Dear all
I was trying my hands on gensim and going through the original Paragraph Vector paper by Le and Mikolov. I have got some queries :
As specified in the original paper of Paragraph Vector :
1. Just like
word vectors paragraph matrix is said to be randomly initialized. Each
column in a paragraph matrix represents a paragraph. Each column in the
word matrix represents a word. What the rows in the paragraph matrix
represent?
Suppose we have 3 documents :
para1: Two for tea and tea for two.You for me and me for you.
para2: Tea for me and tea for you
para3: You for me and me for you.Two for tea and tea for two.
with vocabulary V = [two,tea,me,you]
What would be the corresponding paragraph matrix and word matrix?Is it the simple term-document and one-hot representation?
2. How the word-ordering is preserved in case of paragraph vectors?
3. If for a certain set of documents we have all the word in the vocabulary
for e.g. d1 : Chris is a good boy.
d2 : Paris is a beautiful city.
V : [Chris,is,a,good,boy,Paris,beautiful,city]
What would be the document representation like? How would they differ vectorially?
4.
How the update to the Paragraph Matrix D in the next epoch is made? "
to get paragraph vectors D for new paragraphs (never seen before) by
adding more columns in D and gradient descending on D while holding
W,U,b fixed " as in the original paper?