Groups
Sign in
Groups
Gensim
Conversations
About
Send feedback
Help
Gensim
Contact owners and managers
1–30 of 3751
Welcome to the mailing list of
Gensim, topic modelling for humans
. Please read the
FAQ
before asking. Supporting Gensim helps us support you:
https://github.com/
sponsors/piskvorky
Mark all as read
Report group
0 selected
Felix Selgert
,
Gordon Mohr
2
Jul 1
Epoch Logger for LDA Modell
Unfortunately the docs page you've linked isn't as clear as it probably should be about
unread,
Epoch Logger for LDA Modell
Unfortunately the docs page you've linked isn't as clear as it probably should be about
Jul 1
Joseph Emmens
Jun 23
mm matrix 1 index
Hello All, Just a quick question as I cannot find the solution. I have checked this SO post but no
unread,
mm matrix 1 index
Hello All, Just a quick question as I cannot find the solution. I have checked this SO post but no
Jun 23
Joseph Emmens
Jun 19
Author Topic Model get_new_author_topics
Hey forum, If I have an Author Topic Model (ATM) trained on a set of documents and their authors, and
unread,
Author Topic Model get_new_author_topics
Hey forum, If I have an Author Topic Model (ATM) trained on a set of documents and their authors, and
Jun 19
이팩터s
,
Gordon Mohr
2
Jun 12
Migration 3 to 4
I'd need more context on how you're using this (essentially abstract) superclassm to make any
unread,
Migration 3 to 4
I'd need more context on how you're using this (essentially abstract) superclassm to make any
Jun 12
Kranthi Kumar
,
Gordon Mohr
3
Jun 5
Mismatch of vector embedding length at inference
I'm not too familiar/experienced with the Gensim LsiModel, but some things I'd check: (1) How
unread,
Mismatch of vector embedding length at inference
I'm not too familiar/experienced with the Gensim LsiModel, but some things I'd check: (1) How
Jun 5
l1muba1
,
Gordon Mohr
2
Jun 4
fasttext model produces very different vectors from train to train
This is expected behavior, per the Q11 in the FAQ. Every model run creates a new "space".
unread,
fasttext model produces very different vectors from train to train
This is expected behavior, per the Q11 in the FAQ. Every model run creates a new "space".
Jun 4
Edward Tang
,
Gordon Mohr
2
May 31
Noob question - Unstable result of Doc2vec.infer_vector()
The reasons that these algorithms don't give the exact same results in subsequent training or
unread,
Noob question - Unstable result of Doc2vec.infer_vector()
The reasons that these algorithms don't give the exact same results in subsequent training or
May 31
Lorenzo Filitti
,
Gordon Mohr
2
May 31
Data Format
Without knowing the code you "found online", what sort of JSON it expects, what conversion
unread,
Data Format
Without knowing the code you "found online", what sort of JSON it expects, what conversion
May 31
bayerphi
May 31
coherence Score for mallet
Hey folks, I know that the Mallet implementation has been removed since the latest update. However,
unread,
coherence Score for mallet
Hey folks, I know that the Mallet implementation has been removed since the latest update. However,
May 31
Roy Becker
,
Gordon Mohr
2
Apr 10
noob question: gensim.downloader.load keeps getting stuck
Gensim is just using a basic `urllib` HTTP request here. If you consistently get this same failure –
unread,
noob question: gensim.downloader.load keeps getting stuck
Gensim is just using a basic `urllib` HTTP request here. If you consistently get this same failure –
Apr 10
Daniel
,
Gordon Mohr
2
Apr 10
cannot import name 'triu' from 'scipy.linalg'
Scipy recently removed these functions after a fairly brief (less than 1 year) 'deprecation'
unread,
cannot import name 'triu' from 'scipy.linalg'
Scipy recently removed these functions after a fairly brief (less than 1 year) 'deprecation'
Apr 10
Ferhat Arslan
,
Gordon Mohr
3
Apr 10
Unexpected performance decrease when shared objects are locally compiled
Thanks for the update & confirmation of a possible workaround. If by chance you're on a Linux
unread,
Unexpected performance decrease when shared objects are locally compiled
Thanks for the update & confirmation of a possible workaround. If by chance you're on a Linux
Apr 10
Alexey Shkarupin
, …
Gordon Mohr
4
Apr 1
Loading fasttext model from S3
I believe the docs are wrong to suggest that smart_open's S3 support is enough for this operation
unread,
Loading fasttext model from S3
I believe the docs are wrong to suggest that smart_open's S3 support is enough for this operation
Apr 1
Tomáš Holler
,
Gordon Mohr
2
Mar 28
Loading WikiCorpus
I haven't tested this, but have you tried specifying a no-op `tokenizer_func`, fitting the
unread,
Loading WikiCorpus
I haven't tested this, but have you tried specifying a no-op `tokenizer_func`, fitting the
Mar 28
Joseph Emmens
,
Gordon Mohr
3
Feb 15
Serialized author-topic-model incorrect document count
That's exactly right, that's my fault for copying the example on the gensim atm documents
unread,
Serialized author-topic-model incorrect document count
That's exactly right, that's my fault for copying the example on the gensim atm documents
Feb 15
santosh.b...@gmail.com
,
Andrey Kutuzov
3
Feb 8
How to incorporate timestamp into word embeddings?
Thank you so much, Andrey. I will peruse the articles you shared - beginning with your co-authored
unread,
How to incorporate timestamp into word embeddings?
Thank you so much, Andrey. I will peruse the articles you shared - beginning with your co-authored
Feb 8
squaaad yang
12/11/23
How to implement NMF dynamic topic modeling using gensim?
Hi everyone, Recently, I have been doing a dynamic topic modeling project using NMF. I thought the
unread,
How to implement NMF dynamic topic modeling using gensim?
Hi everyone, Recently, I have been doing a dynamic topic modeling project using NMF. I thought the
12/11/23
Jeff Winchell
,
Gordon Mohr
8
11/24/23
Streamed Restartable Iterables - Details (for corpus streaming)
I suspect if you were to test your conjecture that "Using a file system vs the fastest dbms is
unread,
Streamed Restartable Iterables - Details (for corpus streaming)
I suspect if you were to test your conjecture that "Using a file system vs the fastest dbms is
11/24/23
Sargentini, Thierry
,
Gordon Mohr
2
11/24/23
Python 3.9
AFAIK, Gensim builds & passes its test suite on Python-[3.8, 3.9, 3.10, 3.11] for [Ubuntu, MacOS,
unread,
Python 3.9
AFAIK, Gensim builds & passes its test suite on Python-[3.8, 3.9, 3.10, 3.11] for [Ubuntu, MacOS,
11/24/23
Andy Weasley
,
Gordon Mohr
8
11/23/23
Errors When Installing Version 3.8.3
Thanks for the update, but keep in mind that given the rather blatant failure-to-do-what-was-intended
unread,
Errors When Installing Version 3.8.3
Thanks for the update, but keep in mind that given the rather blatant failure-to-do-what-was-intended
11/23/23
Nathan Cassee
9/29/23
[Request] Experiment on Technical Debt prioritization
We're an international team of academic Software Engineering researchers investigating technical
unread,
[Request] Experiment on Technical Debt prioritization
We're an international team of academic Software Engineering researchers investigating technical
9/29/23
Danilo Tomasoni
,
Gordon Mohr
3
8/28/23
Hard limit on vocab size?
Glad it's sorted. If you *did* want to cap the number of words loaded, you can supply a `limit`
unread,
Hard limit on vocab size?
Glad it's sorted. If you *did* want to cap the number of words loaded, you can supply a `limit`
8/28/23
Jaden Rodriguez
,
Gordon Mohr
2
8/23/23
Fix Proposals and Troubles with Source
Without more details, unsure what specific source errors you're having. A general guide to
unread,
Fix Proposals and Troubles with Source
Without more details, unsure what specific source errors you're having. A general guide to
8/23/23
Felix Goldberg
,
Gordon Mohr
2
8/22/23
Noob question - how to train a doc2vec model using a built-in corpus?
The Gensim project source code (https://github.com/RaRe-Technologies/gensim/) contains in its `docs/
unread,
Noob question - how to train a doc2vec model using a built-in corpus?
The Gensim project source code (https://github.com/RaRe-Technologies/gensim/) contains in its `docs/
8/22/23
Jonathan Peters
8/1/23
Negative log_perplexity
Hello, I created an LDA model from control data and I am trying to calculate the perplexity of my
unread,
Negative log_perplexity
Hello, I created an LDA model from control data and I am trying to calculate the perplexity of my
8/1/23
Jeff Winchell
,
Gordon Mohr
2
7/19/23
Need tokenizer/preprocessor for popular pretrained embeddings models
I made a feature-request item in our issue-tracker for this – https://github.com/RaRe-Technologies/
unread,
Need tokenizer/preprocessor for popular pretrained embeddings models
I made a feature-request item in our issue-tracker for this – https://github.com/RaRe-Technologies/
7/19/23
pradeep t
,
Gordon Mohr
5
7/7/23
Add custom words to GoogleNews-vectors-negative300.bin pretrained model
Thank you so much for the updates On Fri, Jul 7, 2023 at 2:52 AM Gordon Mohr <goj...@gmail.com
unread,
Add custom words to GoogleNews-vectors-negative300.bin pretrained model
Thank you so much for the updates On Fri, Jul 7, 2023 at 2:52 AM Gordon Mohr <goj...@gmail.com
7/7/23
Danilo Tomasoni
,
Gordon Mohr
5
7/4/23
Load of FastText binary format with mmap='r'
thank you very much!! it works! Il giorno venerdì 30 giugno 2023 alle 19:59:27 UTC+2 Gordon Mohr ha
unread,
Load of FastText binary format with mmap='r'
thank you very much!! it works! Il giorno venerdì 30 giugno 2023 alle 19:59:27 UTC+2 Gordon Mohr ha
7/4/23
Thanos Tasakos
,
Gordon Mohr
5
6/30/23
Gensim KeyedVector load from s3
What a legend! I needed to also monkey-patch the numpyio module , to use smart_open instead of open,
unread,
Gensim KeyedVector load from s3
What a legend! I needed to also monkey-patch the numpyio module , to use smart_open instead of open,
6/30/23
pradeep t
,
Gordon Mohr
2
6/29/23
Pretrained model for doc2vec
I don't know of any I'd recommend, & that work with recent Gensim versions. (When I'
unread,
Pretrained model for doc2vec
I don't know of any I'd recommend, & that work with recent Gensim versions. (When I'
6/29/23