Search code examples
pythonnltkwordnet

How to use the Spanish Wordnet in NLTK?


I just downloaded a Spanish Wordnet from the project GRIAL, the format is XML. How can I use it in Python NLTK?

Besides that, in the same page you can download a tagged corpus in Spanish. How can I incorporate it as well?


Solution

  • Use XMLCorpusReader to load XML data as corpus

    Here's the code to do that

    from nltk.corpus.reader import XMLCorpusReader
    reader = XMLCorpusReader(dir, file)
    

    A fully working example which uses XMLCorpusReader is given here