Hi there,
While trying to add new document including new terms (do not exist in original dic), i have realized that using add_documents will not work as the model itself doesnt agree with respect to number of terms (am I wrong here - I prefer this way I think).
I have ran into the following error :
C:\Python27\lib\site-packages\gensim\models\ldamodel.py:545: RuntimeWarning: overflow encountered in exp2
(perwordbound, numpy.exp2(-perwordbound), len(chunk), corpus_words))
[(0, '0.000*"set([u\'mother\'])" + 0.000*"set([u\'brother\'])" + 0.000*"set([u\'drive\'])"'), (1, '0.000*"set([u\'blood\'])" + 0.000*"set([u\'caus\'])" + 0.000*"set([u\'tension\'])"'), (2, '0.000*"set([u\'brocolli\'])" + 0.000*"set([u\'eat\'])" + 0.000*"set([u\'good\'])"')]
And print_topics gives me all zeros.
The only line I have changed in that code is dictionary = corpora.Dictionary(texts)
with dictionary = corpora.hashdictionary.HashDictionary(texts)
What am I am doing wrong here?
My goal is to train an LDA multicore model (which already done and working nicely) and then update the model sequently with new docs.
Many thanks
Yuval