http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belong-to-you.html

google word frequency dataset, n-gram models. Donwload or DVDs are available from U of Penn. ~24GB.