I am seeing the same thing. I have a corpus of 14,000 documents and a vocabulary of 81,000 words.
lda_model_serial = gensim.models.ldamodel.LdaModel(corpus=word_frequency_map,
#lda_model_parallel = gensim.models.ldamulticore.LdaMulticore(corpus=word_frequency_map,
id2word=id2word,
num_topics=10,
random_state=100,
chunksize=100,
passes=8,
alpha='symmetric',
)
When I subjectively look at the results in pyLDAvis, serial is better. Also, the serial model gives a c_v coherence score of 0.6 on the training corpus. The parallel model gives 0.52.
Anyone have an idea why LdaMulticore doesn't give as good results as LdaModel?
Thanks,
David