Search code examples
solrweb-crawlernutch

Using Apache Solr to index Nutch data


I am attempting to set up Solr to index the results from my Nutch crawler. The tutorials I have found online require the file conf/schema.xml to be copied from Nutch into Solr as part of the configuration, however this file does not exist in my version of Nutch. Is there another way to do this or is there something I am missing? cheers


Solution

  • See the "official" Nutch tutorial:

    Note: due to NUTCH-2745 the schema.xml is not contained in the binary package. Please download the schema.xml from the source repository.

    NUTCH-2745 applies only to Nutch version 1.16 and will be fixed in 1.17.