Search code examples
phpnutchphpcrawl

Anyone has worked with a PHP API to read 'Nutch search engine' crawl results?


I have set up 'Nutch search engine' to crawl websites. Now,I need to write an php API to talk to the Nutch search engine. I need to do 2 things:

  1. using a PHP script I need to specify to Nutch as to which URLs to crawl (for this I have some pointers from http://www.cs.sjsu.edu/faculty/pollett/masters/Semesters/Fall07/sheetal/?Deliverable2.html

  2. using a PHP script I need to retrieve the crawl result from the Nutch crawl DB. I cant seem to find any help on this (or I might be too dumb to see the answer if it's already there :()

If anyone has used a PHP API to read Nutch crawl results, please share some pointers with me.

Desperately waiting for some help.


Solution

  • I'm looking for a real good way to do this too. But as of now, Im using a JSP API to display search results. This should start you off.

    You could alternatively use php to recieve your results as JSON objects.

    To kick you off in this direction, there's a interesting page to get you started on JSON using jquery. Google for other tutorials on JSON. There are plenty of'em.