Search code examples
sql-serverlocalizationfull-text-searchwordbreaker

Is there such a thing as third party SQL Server word breaker for Hungarian language?


I want to use CONTAINS on a fulltext index and use FORMSOF(...) on Hungarian data. Is it possible? I KNOW it is not supported by default in SQL Server.


Solution

  • SQL Server can load custom word breakers and stemmers, see Word Breakers and Stemmers. If you cannot find a Hungarian stemmer there is always the possibility of creating one your own: Word Breaker and Stemmer Sample, see also Getting the LRSAMPLE custom word-breaker to work on 64-bit SQL Server 2008. You don't have to implement the dictionary yourself, you could simply reuse for instance the libstemmer Hungarian Snowball algorithm and package it as a SQL Server stemmer.