Search code examples
textmarkov

Is there a resource of lots of human text?


I just coded a Markov chain that talks based on learned data. I'd like a resource of a lot of text data online, but can't seem to find any (most sites like Wikipedia have a lot of junk, not plain text files).

Is there any site that would have a lot of text file that is suitable to test a Markov chain on?


Solution

  • gutenberg.org might have some resources for you. For example, here's what appears to be a bunch of Moby Dick, in text file form.

    http://www.gutenberg.org/files/2701/2701.txt