I have installed the nltk package. Following that I am trying to download the supporting packages using nltk.download() and am getting error:
[Errno 11001] getaddrinfo
My machine / software details are:
OS: Windows 8.1 Python: 3.3.4 NLTK Package: 3.0
Below are the commands run in python:
Python 3.3.4 (v3.3.4:7ff62415e426, Feb 10 2014, 18:13:51) [MSC v.1600 64 bit (AMD64)] on win32
Type "copyright", "credits" or "license()" for more information.
import nltk
nltk.download()
showing info http://nltk.github.com/nltk_data/
True
nltk.download("all")
[nltk_data] Error loading all: <urlopen error [Errno 11001]
[nltk_data] getaddrinfo failed>
False
It looks like it is going to http://nltk.github.com/nltk_data/ whereas it should Ideally try to get the data from http://www.nltk.org/nltk_data/.
On another machine when we type http://nltk.github.com/nltk_data/ in the browser, it redirects to http://www.nltk.org/nltk_data/. I am not understanding why the redirection is not happening on my laptop.
I feel that this might be the issue.
I have added the command prompt screenshot.
Got the solution. The issue in my case was that when the NLTK downloader started it had the server index as - http://nltk.github.com/nltk_data/
This needs to be changed to - http://nltk.org/nltk_data/
You can change this by going into the NLTK Downloader window and the File->Change Server Index.
Regards, Bonson