Gensim is just using a basic `urllib` HTTP request here. If you consistently get this same failure – a broken connection mid-download, possibly even always at the same place – it's likely something idiosyncratic about your network, and its path to the download source. (This could be anything from security policies on firewall-like hosts detecting & aborting your action to faulty network hardware or network misconfigurations.)
You could try another machine, or another network.
But also, for a variety of reasons from the value of Pythonic explicitness to robustness against potential software supply-chain attacks, I don't think the `gensim.downloader` module should exist, and I personally recommend against its use. My full argument has been archived in issue <
https://github.com/piskvorky/gensim/issues/2283>.
I suggest instead using a standard web browser to go to the original publishers of these datasets – like the GLoVe website <
https://nlp.stanford.edu/projects/glove/> – and directly download from there, to an explicit local path you've consciously chosen. (Web browsers can sometimes also resume interrupted-download files.)
Or, if you need to automate the download on a graphical-browserless machine (terminal or notebook host), first discover the source URL via a web browser, then use your own line or two of Python `urllib` or `requests` code, or other scriptable tools like command-line `wget` or `curl`, to do the download to your desired location (and unzip/untar/etc as necessary).
Then, a line or two of Gensim code (like `vecs = KeyedVectors.load_word2vec_format(filename)`) can load those file formats, without obscuring which code is being run, where data has landed in what format, or which kind of model object (class) is being provided. (In some cases, these alternate approaches can also report failures in more detail.)
- Gordon