Groups
Conversations
All groups and messages
Send feedback to Google
Help
Sign in
Groups
Gensim
Conversations
About
Gensim
1–30 of 3671
Welcome to the mailing list of
Gensim, topic modelling for humans
. Please read the
FAQ
before asking. Supporting Gensim helps us support you:
https://github.com/
sponsors/piskvorky
Mark all as read
Report abusive group
0 selected
Caleb Fleming
, …
Subham Biswas
4
Jul 4
Speed of classifying new documents with get_document_topics
I have a doubt regarding corpus . When we use the saved model to give topic distributions on unseen
unread,
Speed of classifying new documents with get_document_topics
I have a doubt regarding corpus . When we use the saved model to give topic distributions on unseen
Jul 4
Subham Biswas
Jul 4
I want to get topic distributions on unseen documents using previoulsy tuned and saved LDA method
Hi, I have build a corpus and trained an LDA model. Now I am trying to use it on unseen documents and
unread,
I want to get topic distributions on unseen documents using previoulsy tuned and saved LDA method
Hi, I have build a corpus and trained an LDA model. Now I am trying to use it on unseen documents and
Jul 4
Arturo Godoy
,
Gordon Mohr
4
Jul 3
AttributeError: 'Doc2Vec' object has no attribute 'cum_table'
The `.most_similar()` method can take as arguments the keys of already-known vectors, or raw vectors
unread,
AttributeError: 'Doc2Vec' object has no attribute 'cum_table'
The `.most_similar()` method can take as arguments the keys of already-known vectors, or raw vectors
Jul 3
Emil Rijcken
, …
Gordon Mohr
7
Jun 29
Add new topic modeling alorithm to Gensim
If your preference is for the bulk of the FuzzyTM code to live in a separate code repo & PyPI
unread,
Add new topic modeling alorithm to Gensim
If your preference is for the bulk of the FuzzyTM code to live in a separate code repo & PyPI
Jun 29
Oguzhan Alasehir
,
Gordon Mohr
3
Jun 28
Doc2Vec Training Documents (Seperated vs merged)
Thanks for the details. I will try the alternatives. Best regards, Oguzhan ALASEHIR Gordon Mohr <
unread,
Doc2Vec Training Documents (Seperated vs merged)
Thanks for the details. I will try the alternatives. Best regards, Oguzhan ALASEHIR Gordon Mohr <
Jun 28
Odin Morón-García
,
Gordon Mohr
3
Jun 22
Newbie trying to use D2V for genomes
Thanks a lot Gordon! I am afraid I was tired and with the aim of not dig into the biological details
unread,
Newbie trying to use D2V for genomes
Thanks a lot Gordon! I am afraid I was tired and with the aim of not dig into the biological details
Jun 22
Yonatan Shalita
, …
Andrey Kutuzov
7
Jun 17
Newbie question - error while using the load_word2vec_format function
I mean Gensim is not much tested on Windows, so lots of weird problems might arise. Can you try on
unread,
Newbie question - error while using the load_word2vec_format function
I mean Gensim is not much tested on Windows, so lots of weird problems might arise. Can you try on
Jun 17
Patryk Bartkowiak
,
Gordon Mohr
2
Jun 12
MemoryError while building vocabulary for word2vec
MemoryErrors can be tricky because often the code that's using/holding too much memory is far
unread,
MemoryError while building vocabulary for word2vec
MemoryErrors can be tricky because often the code that's using/holding too much memory is far
Jun 12
Jacob Celestine
,
Gordon Mohr
2
Jun 3
Ways to speed up LDA Seq model?
I'm not familiar with the particular performance characteristics or best practices of `
unread,
Ways to speed up LDA Seq model?
I'm not familiar with the particular performance characteristics or best practices of `
Jun 3
amjass
,
Gordon Mohr
7
May 31
which vectors are used for cosine similarity in Word2Vec
Hi Gordon - ok, yes this clarifies it completely. thank you so much for clearing up my doubts and for
unread,
which vectors are used for cosine similarity in Word2Vec
Hi Gordon - ok, yes this clarifies it completely. thank you so much for clearing up my doubts and for
May 31
Marc gehring
,
Gordon Mohr
4
May 23
Gensim Model in the Software Field
(1) Collect the texts you want to use as training material (2) Follow some online example that
unread,
Gensim Model in the Software Field
(1) Collect the texts you want to use as training material (2) Follow some online example that
May 23
Halit Vural
,
Gordon Mohr
2
May 11
XML Code similarity with Doc2Vec, weighted embeddings
On Tuesday, May 10, 2022 at 10:31:30 AM UTC-7 bosna...@gmail.com wrote: Hello everyone, I am trying
unread,
XML Code similarity with Doc2Vec, weighted embeddings
On Tuesday, May 10, 2022 at 10:31:30 AM UTC-7 bosna...@gmail.com wrote: Hello everyone, I am trying
May 11
Y P
,
Gordon Mohr
2
Apr 28
FastText load problem: “_pickle.UnpicklingError: NEWOBJ class argument isn't a type object”
I've not seen such an error in the context of Gensim before – but Googling reveals plenty of
unread,
FastText load problem: “_pickle.UnpicklingError: NEWOBJ class argument isn't a type object”
I've not seen such an error in the context of Gensim before – but Googling reveals plenty of
Apr 28
Liron Bilya
Apr 25
Gensim dependecies
Hi, I have a question about the setup of gensim. when I try to install the package with a specific
unread,
Gensim dependecies
Hi, I have a question about the setup of gensim. when I try to install the package with a specific
Apr 25
Cristian Urtado Cunha Florido
,
Radim Řehůřek
2
Apr 23
Pre-trained Word Embedding
Hi Cristian, 1) Try the latest version of gensim. Not 2.0.0 – what one is 5 years old. 2) For the
unread,
Pre-trained Word Embedding
Hi Cristian, 1) Try the latest version of gensim. Not 2.0.0 – what one is 5 years old. 2) For the
Apr 23
Finn L
Apr 16
Newbie question, running tox/tests locally
Hi, I'm new to gensim, just starting out on a PR related to Phrases. I've followed the
unread,
Newbie question, running tox/tests locally
Hi, I'm new to gensim, just starting out on a PR related to Phrases. I've followed the
Apr 16
mattyterm
, …
Matt Buckley
13
Apr 11
A bit of a newbie question, but trying to understand feasibility of LSA
Has anyone managed to solve this and could provide their solution? Thanks! On Friday, February 24,
unread,
A bit of a newbie question, but trying to understand feasibility of LSA
Has anyone managed to solve this and could provide their solution? Thanks! On Friday, February 24,
Apr 11
Maria Sääksjärvi
Apr 7
WMD Pairwise similarity matrix
I was wondering how I can transform the results from WMD to a pairwise similarity matrix that I can
unread,
WMD Pairwise similarity matrix
I was wondering how I can transform the results from WMD to a pairwise similarity matrix that I can
Apr 7
Pete Bleackley
,
Radim Řehůřek
3
Mar 22
LSI less accurate than TfIdf
Thanks. I suspect I may be underfitting. On Tue, 22 Mar 2022, 18:32 Radim Řehůřek, <me@
unread,
LSI less accurate than TfIdf
Thanks. I suspect I may be underfitting. On Tue, 22 Mar 2022, 18:32 Radim Řehůřek, <me@
Mar 22
Denitsa Saynova
Mar 21
Fine-tuning off-the-shelf pretrained word2vec models
Hi, I am fine-tuning off-the-shelf pretrained word2vec models. These are typically distributed as
unread,
Fine-tuning off-the-shelf pretrained word2vec models
Hi, I am fine-tuning off-the-shelf pretrained word2vec models. These are typically distributed as
Mar 21
alistair...@gmail.com
,
Gordon Mohr
3
Mar 19
Doc2vec questions
Thank you, Gordon! I am comfortable with my alpha computations but really want to use the model.train
unread,
Doc2vec questions
Thank you, Gordon! I am comfortable with my alpha computations but really want to use the model.train
Mar 19
Tedo Vrbanec
,
Gordon Mohr
3
Mar 18
Gensim sentences extraction (or some other method of sentence extraction?)
I suspect that if you tried the old (`gensim.summarization`) sentence-splitter, you'd find its
unread,
Gensim sentences extraction (or some other method of sentence extraction?)
I suspect that if you tried the old (`gensim.summarization`) sentence-splitter, you'd find its
Mar 18
visual gcp
, …
Gordon Mohr
5
Mar 16
strange gensim word2vec behavior
It's still a toy-sized/synthetic example. As mentioned, a tiny vocabulary/corpus can't
unread,
strange gensim word2vec behavior
It's still a toy-sized/synthetic example. As mentioned, a tiny vocabulary/corpus can't
Mar 16
amjass
,
Andrey Kutuzov
3
Mar 9
Can models loaded as keyed vectors in word2vec format be used for procrustes analysis
I do not know how i missed this! - thank you for confirming! On Thursday, November 4, 2021 at 2:25:23
unread,
Can models loaded as keyed vectors in word2vec format be used for procrustes analysis
I do not know how i missed this! - thank you for confirming! On Thursday, November 4, 2021 at 2:25:23
Mar 9
이준호
,
Gordon Mohr
3
Mar 8
Exact meaning of the parameter "window"
Thanks for your very, very kind answer. Thanks to your answer, I was fully understood. I don't
unread,
Exact meaning of the parameter "window"
Thanks for your very, very kind answer. Thanks to your answer, I was fully understood. I don't
Mar 8
Vít Novotný
Mar 7
Normalization in gensim.similarity.*MatrixSimilarity considered harmful
The SparseMatrixSimilarity and DenseMatrixSimilarity classes from the gensim.similarity.docsim module
unread,
Normalization in gensim.similarity.*MatrixSimilarity considered harmful
The SparseMatrixSimilarity and DenseMatrixSimilarity classes from the gensim.similarity.docsim module
Mar 7
Hajar Zankadi
Mar 7
evaluate lda output using classification metrics
I am working on a research paper and using 3 different algorithms including LDA. I trained LDA using
unread,
evaluate lda output using classification metrics
I am working on a research paper and using 3 different algorithms including LDA. I trained LDA using
Mar 7
Martin
Mar 2
LSI incremental learning
Hello, this seems like a popular issue but I still didn't manage to get it to work after reading
unread,
LSI incremental learning
Hello, this seems like a popular issue but I still didn't manage to get it to work after reading
Mar 2
Divya Gangwani
,
Gordon Mohr
2
Mar 1
Infer Vector Doc2Vec
Without the full error message – with multiple lines of 'traceback' info, showing involved
unread,
Infer Vector Doc2Vec
Without the full error message – with multiple lines of 'traceback' info, showing involved
Mar 1
amjass
,
Gordon Mohr
5
Mar 1
length of most_similar topn when passing in a list of positive words
thank you! indeed it is idle curiosity and nothing more than that - I also like to see if things can
unread,
length of most_similar topn when passing in a list of positive words
thank you! indeed it is idle curiosity and nothing more than that - I also like to see if things can
Mar 1