Search code examples
pythonpython-3.xjupyter-notebooknltk

WinError 10060 Connection Timeout Error When Downloading 'punkt' in NLTK


I'm experiencing an issue with the NLTK library in Python, specifically when trying to download the 'punkt' tokenizer models. Here's the error message I receive:

Context: I'm attempting to download the punkt package using the following code:

import nltk
nltk.download('punkt')

Troubleshooting Steps I've Taken:

Checked my internet connection, which seems to be working fine.
Temporarily disabled firewall and antivirus software, but the issue persists.
Attempted to use a different internet connection (e.g., mobile hotspot), but faced the same error.

Solution

  • A very strange error :

    https://github.com/nltk/nltk/issues/1981#issuecomment-1804689730

    https://github.com/nltk/nltk/issues/3104

    It seems to be that on some networks, I've been told "Jio" is one of them, raw.githubusercontent.com is not accessible, e.g.: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/tokenizers/punkt.xml