Группы
Войти
Группы
Gensim
Сообщения
О
Отправить отзыв
Справка
Gensim
Связаться с владельцами и администраторами
1–30 из 3742
Welcome to the mailing list of
Gensim, topic modelling for humans
. Please read the
FAQ
before asking. Supporting Gensim helps us support you:
https://github.com/
sponsors/piskvorky
Отметить все как прочитанные
Сообщить о нарушении в группе
Выбрано: 0
Roy Becker
,
Gordon Mohr
2
10 апр.
noob question: gensim.downloader.load keeps getting stuck
Gensim is just using a basic `urllib` HTTP request here. If you consistently get this same failure –
не прочитано,
noob question: gensim.downloader.load keeps getting stuck
Gensim is just using a basic `urllib` HTTP request here. If you consistently get this same failure –
10 апр.
Daniel
,
Gordon Mohr
2
10 апр.
cannot import name 'triu' from 'scipy.linalg'
Scipy recently removed these functions after a fairly brief (less than 1 year) 'deprecation'
не прочитано,
cannot import name 'triu' from 'scipy.linalg'
Scipy recently removed these functions after a fairly brief (less than 1 year) 'deprecation'
10 апр.
Ferhat Arslan
,
Gordon Mohr
3
10 апр.
Unexpected performance decrease when shared objects are locally compiled
Thanks for the update & confirmation of a possible workaround. If by chance you're on a Linux
не прочитано,
Unexpected performance decrease when shared objects are locally compiled
Thanks for the update & confirmation of a possible workaround. If by chance you're on a Linux
10 апр.
Alexey Shkarupin
, …
Gordon Mohr
4
1 апр.
Loading fasttext model from S3
I believe the docs are wrong to suggest that smart_open's S3 support is enough for this operation
не прочитано,
Loading fasttext model from S3
I believe the docs are wrong to suggest that smart_open's S3 support is enough for this operation
1 апр.
Tomáš Holler
,
Gordon Mohr
2
28 мар.
Loading WikiCorpus
I haven't tested this, but have you tried specifying a no-op `tokenizer_func`, fitting the
не прочитано,
Loading WikiCorpus
I haven't tested this, but have you tried specifying a no-op `tokenizer_func`, fitting the
28 мар.
Joseph Emmens
,
Gordon Mohr
3
15 февр.
Serialized author-topic-model incorrect document count
That's exactly right, that's my fault for copying the example on the gensim atm documents
не прочитано,
Serialized author-topic-model incorrect document count
That's exactly right, that's my fault for copying the example on the gensim atm documents
15 февр.
santosh.b...@gmail.com
,
Andrey Kutuzov
3
8 февр.
How to incorporate timestamp into word embeddings?
Thank you so much, Andrey. I will peruse the articles you shared - beginning with your co-authored
не прочитано,
How to incorporate timestamp into word embeddings?
Thank you so much, Andrey. I will peruse the articles you shared - beginning with your co-authored
8 февр.
squaaad yang
11.12.2023
How to implement NMF dynamic topic modeling using gensim?
Hi everyone, Recently, I have been doing a dynamic topic modeling project using NMF. I thought the
не прочитано,
How to implement NMF dynamic topic modeling using gensim?
Hi everyone, Recently, I have been doing a dynamic topic modeling project using NMF. I thought the
11.12.2023
Jeff Winchell
,
Gordon Mohr
8
24.11.2023
Streamed Restartable Iterables - Details (for corpus streaming)
I suspect if you were to test your conjecture that "Using a file system vs the fastest dbms is
не прочитано,
Streamed Restartable Iterables - Details (for corpus streaming)
I suspect if you were to test your conjecture that "Using a file system vs the fastest dbms is
24.11.2023
Sargentini, Thierry
,
Gordon Mohr
2
24.11.2023
Python 3.9
AFAIK, Gensim builds & passes its test suite on Python-[3.8, 3.9, 3.10, 3.11] for [Ubuntu, MacOS,
не прочитано,
Python 3.9
AFAIK, Gensim builds & passes its test suite on Python-[3.8, 3.9, 3.10, 3.11] for [Ubuntu, MacOS,
24.11.2023
Andy Weasley
,
Gordon Mohr
8
23.11.2023
Errors When Installing Version 3.8.3
Thanks for the update, but keep in mind that given the rather blatant failure-to-do-what-was-intended
не прочитано,
Errors When Installing Version 3.8.3
Thanks for the update, but keep in mind that given the rather blatant failure-to-do-what-was-intended
23.11.2023
Nathan Cassee
29.09.2023
[Request] Experiment on Technical Debt prioritization
We're an international team of academic Software Engineering researchers investigating technical
не прочитано,
[Request] Experiment on Technical Debt prioritization
We're an international team of academic Software Engineering researchers investigating technical
29.09.2023
Danilo Tomasoni
,
Gordon Mohr
3
28.08.2023
Hard limit on vocab size?
Glad it's sorted. If you *did* want to cap the number of words loaded, you can supply a `limit`
не прочитано,
Hard limit on vocab size?
Glad it's sorted. If you *did* want to cap the number of words loaded, you can supply a `limit`
28.08.2023
Jaden Rodriguez
,
Gordon Mohr
2
23.08.2023
Fix Proposals and Troubles with Source
Without more details, unsure what specific source errors you're having. A general guide to
не прочитано,
Fix Proposals and Troubles with Source
Without more details, unsure what specific source errors you're having. A general guide to
23.08.2023
Felix Goldberg
,
Gordon Mohr
2
22.08.2023
Noob question - how to train a doc2vec model using a built-in corpus?
The Gensim project source code (https://github.com/RaRe-Technologies/gensim/) contains in its `docs/
не прочитано,
Noob question - how to train a doc2vec model using a built-in corpus?
The Gensim project source code (https://github.com/RaRe-Technologies/gensim/) contains in its `docs/
22.08.2023
Jonathan Peters
01.08.2023
Negative log_perplexity
Hello, I created an LDA model from control data and I am trying to calculate the perplexity of my
не прочитано,
Negative log_perplexity
Hello, I created an LDA model from control data and I am trying to calculate the perplexity of my
01.08.2023
Jeff Winchell
,
Gordon Mohr
2
19.07.2023
Need tokenizer/preprocessor for popular pretrained embeddings models
I made a feature-request item in our issue-tracker for this – https://github.com/RaRe-Technologies/
не прочитано,
Need tokenizer/preprocessor for popular pretrained embeddings models
I made a feature-request item in our issue-tracker for this – https://github.com/RaRe-Technologies/
19.07.2023
pradeep t
,
Gordon Mohr
5
07.07.2023
Add custom words to GoogleNews-vectors-negative300.bin pretrained model
Thank you so much for the updates On Fri, Jul 7, 2023 at 2:52 AM Gordon Mohr <goj...@gmail.com
не прочитано,
Add custom words to GoogleNews-vectors-negative300.bin pretrained model
Thank you so much for the updates On Fri, Jul 7, 2023 at 2:52 AM Gordon Mohr <goj...@gmail.com
07.07.2023
Danilo Tomasoni
,
Gordon Mohr
5
04.07.2023
Load of FastText binary format with mmap='r'
thank you very much!! it works! Il giorno venerdì 30 giugno 2023 alle 19:59:27 UTC+2 Gordon Mohr ha
не прочитано,
Load of FastText binary format with mmap='r'
thank you very much!! it works! Il giorno venerdì 30 giugno 2023 alle 19:59:27 UTC+2 Gordon Mohr ha
04.07.2023
Thanos Tasakos
,
Gordon Mohr
5
30.06.2023
Gensim KeyedVector load from s3
What a legend! I needed to also monkey-patch the numpyio module , to use smart_open instead of open,
не прочитано,
Gensim KeyedVector load from s3
What a legend! I needed to also monkey-patch the numpyio module , to use smart_open instead of open,
30.06.2023
pradeep t
,
Gordon Mohr
2
29.06.2023
Pretrained model for doc2vec
I don't know of any I'd recommend, & that work with recent Gensim versions. (When I'
не прочитано,
Pretrained model for doc2vec
I don't know of any I'd recommend, & that work with recent Gensim versions. (When I'
29.06.2023
Laura
,
Gordon Mohr
2
27.06.2023
Doc2vec with small corpus
That approach seems within the realm of reason - but ultimately whether it's better for your
не прочитано,
Doc2vec with small corpus
That approach seems within the realm of reason - but ultimately whether it's better for your
27.06.2023
Peter Mayhew
,
Gordon Mohr
11
13.06.2023
Saving Wikidump corpus into Memory map
Note that even training the exact same corpus twice won't result in the *same* vectors.
не прочитано,
Saving Wikidump corpus into Memory map
Note that even training the exact same corpus twice won't result in the *same* vectors.
13.06.2023
jeff yang
,
Gordon Mohr
4
31.05.2023
Is there anyway to adjust the weight of the node?
I'm not really sure why one would want to "reduce the density around a node". Do you
не прочитано,
Is there anyway to adjust the weight of the node?
I'm not really sure why one would want to "reduce the density around a node". Do you
31.05.2023
TRIXIA MAY BELGA
29.05.2023
LDA topics for Clustering
My goal is to cluster the resulting LDA topics to reduce dimensionality. However I am not sure what
не прочитано,
LDA topics for Clustering
My goal is to cluster the resulting LDA topics to reduce dimensionality. However I am not sure what
29.05.2023
Yan Xu
,
Gordon Mohr
4
18.05.2023
Add the similarity threshold to gensim.models.keyedvectors.KeyedVectors.most_similar
That's a good point, given the extra memory required to return the list-of-(word, score) tuples.
не прочитано,
Add the similarity threshold to gensim.models.keyedvectors.KeyedVectors.most_similar
That's a good point, given the extra memory required to return the list-of-(word, score) tuples.
18.05.2023
Fred R
,
Gordon Mohr
2
09.05.2023
How to get context words in gensim word2vec models
Can you clarify with a bit more detail what you mean by "context words"? I ask because once
не прочитано,
How to get context words in gensim word2vec models
Can you clarify with a bit more detail what you mean by "context words"? I ask because once
09.05.2023
nicolas valderrama
,
Gordon Mohr
3
25.04.2023
"Lazily" add documents to TfIdf
Oh we didn't knew this was possible. I'm glad I asked here before doing any change. Thanks a
не прочитано,
"Lazily" add documents to TfIdf
Oh we didn't knew this was possible. I'm glad I asked here before doing any change. Thanks a
25.04.2023
Gabriel L
, …
Gordon Mohr
12
20.04.2023
Implementation of Correlated Topic Model
I can understand why you might prefer techniques that exist over those that are purely imaginary,
не прочитано,
Implementation of Correlated Topic Model
I can understand why you might prefer techniques that exist over those that are purely imaginary,
20.04.2023
Danilo Tomasoni
,
Gordon Mohr
16
12.04.2023
Very different performances if streaming data or reading data from disk
On Wednesday, April 12, 2023 at 5:36:04 AM UTC-7 danilot.l...@gmail.com wrote: Performance in my
не прочитано,
Very different performances if streaming data or reading data from disk
On Wednesday, April 12, 2023 at 5:36:04 AM UTC-7 danilot.l...@gmail.com wrote: Performance in my
12.04.2023