Search code examples
hostingscreen-scrapingip-addressdomain-name

Optimal Configuration for Disgusing Identity of Scraping


I'm running a bunch of scripts that are scraping data from a website. For reasons I won't bore you with, I can't run them all off the same host--instead I need to set up six different hosts. I want to configure my hosting setup to disguise the fact that all six hosts have the same owner.

I have gotten six different shared hosting accounts that are located in different geographical locations. Is there anything else I need to do? Should a buy a different domain name for each host? If not, what domain should I give to each host?


Solution

  • You could set up multiple instances of TOR, configure each with a seperate control port, and run your scrapes on one computer, each using a separate TOR. This will make each HTTP request jump through separate chains of proxies, and therefore when they get to the desired site, they will be coming from a unique IP.