Search code examples
javasearchdata-miningrtfapache-tika

Searching for Words in RTF Files through Java


I'm looking for ways to search for particular words in 300+ RTF documents using Java. I'd like to use the search ability to:

  1. Count how many times the particular word is used across all RTF documents.
  2. Figure out in what RTF documents the word is used.

I looked into using Apache TIKA to convert RTF docs into text files and search them, but I feel like there's an easier, faster way to do it.


Solution

  • Try using lucene? It supports fulltext search in a lot of file formats.