Search code examples
datasetimdbimdbpy

Extract reviews and ratings from IMDB


The Internet Movie Database does not allow scraping from their website, but does provide an FTP site with text files that can be downloaded and used for research: http://www.imdb.com/interfaces

How can I extract reviews and the corresponding ratings from this FTP server, preferably in Python?


Solution

  • Reviews are not distributed by IMDb in their plain text data files.

    For all the other data, you can parse them and store them in a SQL database using IMDbPY, or look at it to see how to parse only the information that are relevant to you.