clustering on fasttext embeddings

390 views

Skip to first unread message

ar...@capiot.com

unread,

Feb 8, 2018, 7:42:13 AM2/8/18

to fastText library

Hi, trying to use fasttext for document classification. How do we convert the fasttext embeddings for document in a corpus to a pandas dataframe so as to apply some clustering algorithm on documents

Matan Safriel

unread,

Feb 17, 2018, 6:41:02 AM2/17/18

to fastText library

Dude, they are just vectors :-)) if you choose fasttext to generate document embeddings and not some other algorithm/tool for document embeddings, I guess generate your vectors through the `print-sentence-vectors` option of fasttext, and just transform them into dataframe/s as you would transform any other data for a dataframe. Hopefully you won't memory overflow. Hope this helps!

Reply all

Reply to author

Forward

0 new messages