Search code examples
datasetplaintext

Huge amount of plaintext data for parsing experiment


I am developing a parser in ruby which parses some nonuniform text data. Can anybody tell me, where I can get a good number of plaintext data for that?


Solution

  • Here's you'll get a list of many:

    http://www.quora.com/Data/Where-can-I-get-large-datasets-open-to-the-public

    And my fav is:

    http://ftp.sunet.se/mirror/archive/ftp.sunet.se/pub/tv+movies/imdb/