Search code examples
hadoopnosqlweb-crawlernutchapache-kudu

Configuring Nutch to write to Apache Kudu


I am trying to configure Apache Nutch to write to Apache Kudu, but I cannot find anywhere informations about how to do it. I know I can write to Cassandra and HBase, but there is nothing about Kudu.

The Hadoop distribution that I am using is CDH 5.16.1

How do I configure Apache Nucth to write to Apache Kudu?


Solution

  • Apache Kudu is not support out of the box by Apache Nutch. Writing an integration should not be very complicated since Nutch has a pluggable output/indexing system.

    You could write an indexer plugin by implementing the IndexWriter interface. We always welcome new contributions & contributors.