Hello all. I made an LDA model with 86000+ tokens. When trying to infer new documents topic distributions, does the sum of topic distribution probability have to be equal to 1?
lda_model = models.LdaModel.load('/myldamodel.lda_model')
corpus_bow = [[(32439,1), (73079, 2), (73150, 1)],\
[(22949, 1), (73079, 1)],\
[(73150, 2)]]
doc_lda = lda_model[corpus_bow]
for i in doc_lda:
print i
the results are:
[(9, 0.40200000000000152), (26, 0.22707850347701716), (57, 0.17692149652298442)]
[(9, 0.33666666666666822), (35, 0.33666666666666817)]
[(26, 0.67000000000000337)]
0.40200000000000152 + 0.22707850347701716 + 0.17692149652298442 != 1
0.33666666666666822 + 0.33666666666666817 != 1
0.67000000000000337 != 1
I don't understand how should I interpret LDA inference results on new documents?