Search code examples
pythonbeautifulsouprequestpython-requestsinstagram

How would I log into Instagram using BeautifulSoup4 and Requests, and how would I determine it on my own?


I've looked at these two posts on Stack Overflow so far: I can't login to Instagram with Requests and Instagram python requests log in without API. Both of the solutions don't work for me.

How would I do this now, and how would someone go about finding what requests to make where? To make that clearer, if I were to send a post request to log in, how would I go about knowing what and where to send it?
I don't want to use Instagram's API or Selenium, as I want to try out Requests and (maybe) bs4. In case you'd want some code:

import requests

main_url = 'https://www.instagram.com/'
login_url = main_url+'accounts/login/ajax'
user_agent = 'User-Agent: Mozilla/5.0 (iPad; CPU OS 6_0_1 like Mac OS X) AppleWebKit/536.26 (KHTML, like Gecko) Version/6.0 Mobile/10A523 Safari/8536.25'

session = requests.session()
session.headers = {"user-agent": user_agent}
session.headers.update({'Referer': main_url})

req = session.get(main_url)
session.headers.update({'set-cookie': req.cookies['csrftoken']})
print(req.status_code)

login_data = {"csrfmiddlewaretoken": req.cookies['csrftoken'], "username": "myusername", "password": "mypassword"}


login = session.post(login_url, data=login_data, allow_redirects=True)
print(login.status_code)
session.headers.update({'set-cookie': login.cookies['csrftoken']})

cookies = login.cookies

print(login.headers)
print(login.status_code)

This gives me a 405 error.


Solution

  • you can use this code to login to instagram

    import re
    import requests
    from bs4 import BeautifulSoup
    
    from datetime import datetime
    
    link = 'https://www.instagram.com/accounts/login/'
    login_url = 'https://www.instagram.com/accounts/login/ajax/'
    
    time = int(datetime.now().timestamp())
    
    payload = {
        'username': 'login',
        'enc_password': f'#PWD_INSTAGRAM_BROWSER:0:{time}:your_password',
        'queryParams': {},
        'optIntoOneTap': 'false'
    }
    
    with requests.Session() as s:
        r = s.get(link)
        csrf = re.findall(r"csrf_token\":\"(.*?)\"", r.text)[0]
        r = s.post(login_url, data=payload, headers={
            "User-Agent": "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/77.0.3865.120 Safari/537.36",
            "X-Requested-With": "XMLHttpRequest",
            "Referer": "https://www.instagram.com/accounts/login/",
            "x-csrftoken": csrf
        })
        print(r.status_code)