Search code examples
pythonnltkinformation-retrievalinformation-extractionlemmatization

Lemmatization of non-English words?


I would like to apply lemmatization to reduce the inflectional forms of words. I know that for English language WordNet provides such a functionality, but I am also interested in applying lemmatization for Dutch, French, Spanish and Italian words. Is there any trustworthy and confirmed way to go about this? Thank you!


Solution

  • Try pattern library from CLIPS, they have support for German, English, Spanish, French and Italian. Just what you needed: http://www.clips.ua.ac.be/pattern

    Unfortunately it only works with Python 2, no support for Python3 provided yet.