web-scraping web-crawler python-3.7 http-status-code-403

web scraping / web crawling showing 403 error on the site i want to crawl

import requests
from bs4 import BeautifulSoup
url ='https://www.vesselfinder.com/vessels'
headers= {'User-Agent': 'Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)'}
response = requests.get(url)
soup = BeautifulSoup(response.content, 'html.parser')
response.status_code

i tried different user agent but still not working, i tried other sites it work but this website not working, help me to crawl all vessel data from this site. thanks in advance!!!

Solution

Server wants an additional header for language

import requests

headers = {
    'user-agent': 'Mozilla/5.0',
    'accept-language': 'en-GB,en-US;q=0.9,en;q=0.8',
}

response = requests.get('https://www.vesselfinder.com/vessels', headers=headers)
response.status_code

Selenium headless broke after Chrome update
Puppeteer - Protocol error (Page.navigate): Target closed
Doubled elements in result array while scraping HTML content
403 Forbidden Error when scraping a site, user-agents already used and updated. Any ideas?
How can I get all the HTML in a document or node containing shadowRoot elements
How to Scrape NBA stats page using rvest
Why does web scraping a website using Python requests connect to a US server instead of a Greek one and return non-Greek content?
ImportXML XPath issue using Google Sheets on a simple web scraping query
How to focus on a window with python?
How can I export text from a specific div with class "swatch-option text" using Python and BeautifulSoup?
scrape a website which has the same url for multiple pages? with the page jump being an ajax request
Web scraping table from UniProt database
How to scrape hierarchical web data into tabular format using rvest?
I'm trying to scrape data from the at the races website but the scraper is not returning any results
Scrape Shadow root elements using Python Selenium
Web scraping for multiple classes using python
Getting Video Links from Youtube Channel in Python Selenium
retrieve links from web page using python and BeautifulSoup
Python investing.com historical stock data by curl request
scraping a website with a load more button Python
Webscraping doesnt return the expected html?
Using R to get download URL by link name
Trouble scraping BBC with Python Scrapy (2023)
Selenium ChromeDriver asking to set default search engine on startup
Scrapy + Splash: connection refused
Scraping fund meta data in Yahoo Finance (not prices)
scraping webpage with dynamic content - cheerio
Scraping Wikipedia tables with Python selectively
Failed to produce a JSON response containing a phone number based on a license number from a webpage using the requests module
How to use urbandictionary API built in API function random()