Search code examples
postgresqlfull-text-searchstop-wordsaffix

How to create the affix and dic files for postgresql text search


I wanna use postgresql's text search capability for one of my projects. The language supported gonna be in english and indonesian. Unfortunately, I can't find indonesian config to be used. (example is here https://www.cs.hmc.edu/~geoff/ispell-dictionaries.html)

Looking at https://www.postgresql.org/docs/11/textsearch-dictionaries.html, it seems I can make my own custom stopwords, affix, and dic files. I already found and load the stopwords file (from https://github.com/masdevid/ID-Stopwords), meanwhile for the affix (the rule looks like here http://indodic.com/affixeng.html) I can't seem to find the file, or the way to write the affix/dic file.

Can someone point out to me how, or possibly where can I find some information in more details?


Solution

  • The Ispell link referenced in the PostgreSQL documentation has the following information:

    Complete documentation for ispell, including documentation of the affix-file format, comes with the distribution kit in Unix manual-page format. If you have ispell installed on your machine, you should be able to type "man ispell" to view it. For convenience and browsing, the primary manual page for ispell is also available online.

    You will have to study that, then you can create an affix file.