Search code examples
pythonbeautifulsouphref

Extracting href with Beautiful Soup


I use this code to get acces to my link :

links = soup.find("span", { "class" : "hsmall" })
links.findNextSiblings('a')
for link in links:
  print link['href']
  print link.string

Link have no ID or class or whatever, it's just a classic link with a href attribute.

The response of my script is :

print link['href']
TypeError: string indices must be integers

Can you help me to get href value ? Thx !


Solution

  • Okay, it works now with following code :

    linkSpan = soup.find("span", { "class" : "hsmall" })
    link = [tag.attrMap['href'] for tag in linkSpan.findAll('a', {'href': True})]
    for lien in link:
      print "LINK = " + lien`