Unicode decode error while loaading pretrained glove embeddings through Gensim

18 views
Skip to first unread message

anjalib...@gmail.com

unread,
Apr 4, 2019, 11:57:00 AM4/4/19
to GloVe: Global Vectors for Word Representation

I am trying to run some code with Gensim on Colab, and am using the Crawl file listed on the glove website for this:
f = KeyedVectors.load_word2vec_format('glove.42B.300d.zip',binary=True)

Running this is throwing the following error:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x86 in position 11: invalid start byte

I tried a lot of fixes but none seems to work. Please suggest something for this.

Sandeep Pamidiparthi

unread,
Apr 19, 2019, 1:17:00 PM4/19/19
to GloVe: Global Vectors for Word Representation
This might help. First convert to another format and then load in gensim.

https://stackoverflow.com/questions/27139908/load-precomputed-vectors-gensim#41990999
Reply all
Reply to author
Forward
0 new messages