clustering on fasttext embeddings

384 views
Skip to first unread message

ar...@capiot.com

unread,
Feb 8, 2018, 7:42:13 AM2/8/18
to fastText library
Hi, trying to use fasttext for document classification. How do we convert the fasttext embeddings for document in a corpus to a pandas dataframe so as to apply some clustering algorithm on documents

Matan Safriel

unread,
Feb 17, 2018, 6:41:02 AM2/17/18
to fastText library
Dude, they are just vectors :-)) if you choose fasttext to generate document embeddings and not some other algorithm/tool for document embeddings, I guess generate your vectors through the `print-sentence-vectors` option of fasttext, and just transform them into dataframe/s as you would transform any other data for a dataframe. Hopefully you won't memory overflow. Hope this helps!
Reply all
Reply to author
Forward
0 new messages