Search code examples
full-text-searchsearch-enginemultilingual

Full text search library for multilingual content


What is a good full-text search library (free/licensed) that works well with multilingual data?


Solution

  • Lucene has analyzers for a wide variety of languages. That only works, of course, if you know which language each document is in: either give them a language tag or use a language guesser.

    (Though if you want to do cross-lingual retrieval with machine translation in the pipeline, you're on your own, I'm afraid.)