Search code examples

Scraping an url using BeautifulSoup

Hello I am beginner in data scraping. At this case I want to get an url like "https:// . . ." but the result is a list in link variable that contain of all links in web. Here the code below;

import requests
from bs4 import BeautifulSoup
url = ''
page = requests.get(url)
soup = BeautifulSoup(page.content, 'html.parser')
artikel = soup.findAll('div', {'class' : 'list media_rows list-berita'})
p = 1
link = []
for p in artikel:
     s = p.findAll('a', href=True)['href']

the result of the code above is getting error such as

TypeError                                 Traceback (most recent call last)
<ipython-input-141-469cb6eabf70> in <module>
3 link = []
4 for p in artikel:
5         s = p.findAll('a', href=True)['href']
6         link.append(s)
TypeError: list indices must be integers or slices, not str

The result is I want to get all links of https:// . . . in <div class = 'list media_rows list-berita' as a list Thank you in advance.


  • Code:

    import requests
    from bs4 import BeautifulSoup
    url = ''
    page = requests.get(url)
    soup = BeautifulSoup(page.content, 'html.parser')
    articles = soup.findAll('div', {'class' : 'list media_rows list-berita'})
    links = []
    for article in articles:
        hrefs = article.find_all('a', href=True)
        for href in hrefs:


    ['', '', '', '', 
    '', '', '', '', '']