Hi ,
My data consist of 1,24,196 sentences .
dataset = [d.split() for d in df_topic_modeling['stopwords_removed_str']]
dictionary = Dictionary(dataset)
corpus = [dictionary.doc2bow(doc) for doc in dataset]
print("Building LDA Multicore model")
lda_multicore_model_using_gensim = LdaMulticore(corpus=corpus, id2word=dictionary, iterations=50, num_topics=5, passes=10)
print("Computing Coherence")
df_topic_modeling['stopwords_removed_str_tokenize']= df_topic_modeling['stopwords_removed_str'].apply(word_tokenize)
cm = CoherenceModel(model=lda_multicore_model_using_gensim,texts=df_topic_modeling['stopwords_removed_str_tokenize'],corpus=corpus, dictionary=dictionary, coherence='c_v')
coherence_lda = cm.get_coherence()
print('\nCoherence Score: ', coherence_lda)
I am trying to get the coherence score for my LDA multicore model. But coherence score is not obtained. Please help