Search code examples
rubyscreen-scrapingcss-selectorsnokogirihpricot

How to detect mailto links with Hpricot/Nokogiri


I want to match links like <a href="mailto:[email protected]">foo</a>, but this doesn't work only works in Nokogiri:

doc/'a[href ^="mailto:"]'

What's the right way of doing that? How do I do that with Hpricot?


Solution

  • This works on Hpricot:

    doc/'a[@href ^="mailto:"]'
    

    Couldn't figure out a way to do the xpath search though. Apparently, Hpricot doesn't support starts-with: http://wiki.github.com/hpricot/hpricot/supported-xpath-expressions