I am trying to crawl a list of sites with scrapy
. I tried to put the list of website urls as the start_urls
, but then I found I couldn't afford so much memory with it. Is there any way to set the scrapy
crawling one or two sites at a time?
You can try using concurrent_requests = 1
so that you don't overloaded with data
http://doc.scrapy.org/en/latest/topics/settings.html#concurrent-requests