Search code examples
phpregexanchorhtml-parser

finding all anchor tags with pdf file as the source in php using regex or HTML parser


how can i find all the anchor tags with source pdf

$string="hello this is a dummy text <a href="../../abc.pdf"> 

i need only abc.pdf in the string variable


Solution

  • You should use DOMDocument instead of regex:

    $string='hello this is a dummy text <a href="../../abc.pdf">'; 
    $doc = new DOMDocument;
    $doc->loadHTML($string);
    
    $href = $doc->getElementsByTagName('a')->item(0)->getAttribute('href');
    echo basename($href); // abc.pdf