Word2vec/phrase on wikibrain

56 views
Skip to first unread message

kjy...@gmail.com

unread,
Jan 13, 2015, 8:18:21 PM1/13/15
to wiki...@googlegroups.com
Hi, I came accross wikibrain project while searching java codes for Word2Vec/Phrase.
I noticed that wikibrain contains two java files of Word2vec and Word2phrase.

how can i use those algorithms using wikibrain with wiki-dump dataset as training dataset?

Thanks,
Jaeyong Kang.

shi...@gmail.com

unread,
Feb 1, 2015, 11:59:01 PM2/1/15
to wiki...@googlegroups.com
Hi Jaeyong. Thanks for your question. Sorry I didn't respond earlier.

What exactly would you like to do with the created Word2Vec model? WikiBrain does build a Word2Vec model for Wikipedia, but it uses an internal (non-standard) format.

-Shilad

Jaeyong Kang

unread,
Feb 3, 2015, 12:26:09 AM2/3/15
to wiki...@googlegroups.com, shi...@gmail.com
Thanks for your reply.
What i want to do is to extract top most similar words given any input words using Wikipedia using Word2vec/phrase. 

In Quickstart example, I can see the results of resolution of apple such that Apple Inc. etc.
But how can i modify the code to see the results using Word2vec? (If possible, Word2Phrase as well).

Thanks.
Reply all
Reply to author
Forward
0 new messages