Search code examples
javascripthtmlfeedtext-extractionhtml-content-extraction

page using JS to display info - need to extract that info


I need to turn the results of the leaderboard here secondsync.com/leaderboard.html#Wed-May-22-2013/totals into a feed.

They use a js to pull this info. How do I indentify the JS and pull the data from another page?

Thanks.


Solution

  • Maybe you could use this tutorial
    http://www.1stwebdesigner.com/tutorials/php-crawler-tutorial/ (requires some PHP and REGEX skills)
    to crawl the content of the page then use REGEX to access the content you need.

    With regex you could match() the first:

    <td class="tweets">*<td>
    

    you should get something like

    <td class="tweets">91,621<td>
    

    Then replace() every <*> with ''

    and the number 91,621 remains.

    This could be modified to <'td class="peak">* , if you are interested in the peak or something similar, just look up how they (secondsync) describe it in their source code.

    Sorry for not helping better, I can´t think of a better solution right now.