Groups
Sign in
Groups
DKPro Similarity Users
Conversations
About
Send feedback
Help
DKPro Similarity Users
Contact owners and managers
1–30 of 44
Mark all as read
Report group
0 selected
Павел Кузнецов
,
Torsten Zesch
2
11/23/21
A problem with installation
Did you try to add the dependencies manually as described in the documentation? On Tuesday, November
unread,
A problem with installation
Did you try to add the dependencies manually as described in the documentation? On Tuesday, November
11/23/21
LL Z
1/9/21
Lucene Index for Wikipedia
Hi, Thanks a lot for developing this wonderful tool. I am particularly interested in the ESA
unread,
Lucene Index for Wikipedia
Hi, Thanks a lot for developing this wonderful tool. I am particularly interested in the ESA
1/9/21
Richard Eckart de Castilho
4/10/18
DKPro Similarity 2.3.0 released
We are pleased to announce the release of ### DKPro Similarity 2.3.0 an open source framework for
unread,
DKPro Similarity 2.3.0 released
We are pleased to announce the release of ### DKPro Similarity 2.3.0 an open source framework for
4/10/18
Alain Désilets
,
Torsten Zesch
2
3/19/18
Exclude stopwords when comparing similarity of documents
There is no such functionality in DKPro Similarity itself. The idea is to use eg StopWordRemover from
unread,
Exclude stopwords when comparing similarity of documents
There is no such functionality in DKPro Similarity itself. The idea is to use eg StopWordRemover from
3/19/18
Farjana Khan
,
Torsten Zesch
2
3/12/18
TypeTokenRatioComparator ----Stylistic Similarity (Package dkpro.similarity.algorithms.style)
This comparator requires Token and Lemma annotations. If you create them accordingly, instead of the
unread,
TypeTokenRatioComparator ----Stylistic Similarity (Package dkpro.similarity.algorithms.style)
This comparator requires Token and Lemma annotations. If you create them accordingly, instead of the
3/12/18
José Martínez
,
Torsten Zesch
7
3/12/18
WordNet based measures
Another comment: for debugging it would be interesting if you could share the resources.xml you are
unread,
WordNet based measures
Another comment: for debugging it would be interesting if you could share the resources.xml you are
3/12/18
Alain Désilets
,
Torsten Zesch
4
3/8/18
How to use Mallet Embeddings?
Dear Alain, what you describe sounds like the right way to go. AFAIK this does not exist for plain
unread,
How to use Mallet Embeddings?
Dear Alain, what you describe sounds like the right way to go. AFAIK this does not exist for plain
3/8/18
elgesto elge
,
Farjana Khan
2
2/2/18
Word embedding file for LSA similarity measure
Dear elgesto, Did you solve this issue.If solve but how.Could you share with me please... On Friday,
unread,
Word embedding file for LSA similarity measure
Dear elgesto, Did you solve this issue.If solve but how.Could you share with me please... On Friday,
2/2/18
Alain Désilets
,
Torsten Zesch
6
1/26/18
Speeding up ESA similarity
It's the first one. Word vectors to memory. Alain Désilets <alainde...@gmail.com>
unread,
Speeding up ESA similarity
It's the first one. Word vectors to memory. Alain Désilets <alainde...@gmail.com>
1/26/18
Ann
,
Torsten Zesch
4
3/6/17
Building ESA indexes
If you are not familiar with UIMA pipelines and/or how to extract a text version out of Wikipedia, it
unread,
Building ESA indexes
If you are not familiar with UIMA pipelines and/or how to extract a text version out of Wikipedia, it
3/6/17
Anna Kazantseva
, …
Torsten Zesch
15
3/6/17
using SemEval 2013 baseline
That is hard to resolve from here. Are you using the latest snapshot? I think you need to checkout at
unread,
using SemEval 2013 baseline
That is hard to resolve from here. Are you using the latest snapshot? I think you need to checkout at
3/6/17
Ann
,
Torsten Zesch
2
2/8/17
NaN as a result of "getSimilarity"
Hi, are you sure that the texts for which the similarity is computed are really the same? It might
unread,
NaN as a result of "getSimilarity"
Hi, are you sure that the texts for which the similarity is computed are really the same? It might
2/8/17
PW
, …
Torsten Zesch
8
2/7/17
setting up dkpro-similarity
Due to dependency problems, the wordnet-based modules are not on maven central (yet). However, you
unread,
setting up dkpro-similarity
Due to dependency problems, the wordnet-based modules are not on maven central (yet). However, you
2/7/17
Анна Крюкова
,
Nicolai Erbs
2
1/28/17
Similarity Measures and Wikipedia
Hi, we don't have any vector indexes for Russian pre-computed, but you can create your own using
unread,
Similarity Measures and Wikipedia
Hi, we don't have any vector indexes for Russian pre-computed, but you can create your own using
1/28/17
Aya Mohamed
11/30/16
Getting the similar sentences
Hi all, How to get the similar sentences and words that represent the similarity's percentage
unread,
Getting the similar sentences
Hi all, How to get the similar sentences and words that represent the similarity's percentage
11/30/16
Dmitry Kravchenko
10/3/16
No WikipediaBasedComparator usage sample
Hi, I need to use WikipediaBasedComparator, but I have not found any code with usage of
unread,
No WikipediaBasedComparator usage sample
Hi, I need to use WikipediaBasedComparator, but I have not found any code with usage of
10/3/16
Dmitry Kravchenko
,
Richard Eckart de Castilho
2
10/2/16
Starting the code first time
On 02.10.2016, at 14:22, Dmitry Kravchenko <mar.d...@gmail.com> wrote: > > Hi, > I
unread,
Starting the code first time
On 02.10.2016, at 14:22, Dmitry Kravchenko <mar.d...@gmail.com> wrote: > > Hi, > I
10/2/16
Chen Wu
,
Torsten Zesch
5
6/20/16
Problem about build my own ESA Index
Hi, I find this class now, but there are no function createLuceneWikipediaIndex() in esaindexer.
unread,
Problem about build my own ESA Index
Hi, I find this class now, but there are no function createLuceneWikipediaIndex() in esaindexer.
6/20/16
Josep Maria Formentí
,
Torsten Zesch
3
9/9/15
How to use Wikipedia index
Thanks, I'll continue investigating El dimarts, 8 setembre de 2015 15:07:23 UTC+2, Torsten Zesch
unread,
How to use Wikipedia index
Thanks, I'll continue investigating El dimarts, 8 setembre de 2015 15:07:23 UTC+2, Torsten Zesch
9/9/15
Paulo Avelar
,
Torsten Zesch
3
7/30/15
first timer on DKPro - I need help understanding how similarity works.
Hi Torsten, I'm experimenting with other types of "measure" and I'm getting better
unread,
first timer on DKPro - I need help understanding how similarity works.
Hi Torsten, I'm experimenting with other types of "measure" and I'm getting better
7/30/15
lingjia deng
6/9/15
Problem of locating txt files
hi everyone, I have successfully run the DKPro Similarity 2013 baseline trunk. Everything went well
unread,
Problem of locating txt files
hi everyone, I have successfully run the DKPro Similarity 2013 baseline trunk. Everything went well
6/9/15
Andrea Apicella
,
Torsten Zesch
8
3/14/15
similarity between categories in wikipedia
> <dependency> > <groupId>de.tudarmstadt.ukp.similarity.algorithms</groupId>
unread,
similarity between categories in wikipedia
> <dependency> > <groupId>de.tudarmstadt.ukp.similarity.algorithms</groupId>
3/14/15
Torsten Zesch
8/19/14
Problems with CosineSimilarity implementation
Dear users, recently, some serious issues with the implementation of CosineSimilarity have surfaced.
unread,
Problems with CosineSimilarity implementation
Dear users, recently, some serious issues with the implementation of CosineSimilarity have surfaced.
8/19/14
Torsten Zesch
8/15/14
New DKPro Similarity developers list
Dear dkpro developers and users, I have created a new dkpro similarity developers list mainly to have
unread,
New DKPro Similarity developers list
Dear dkpro developers and users, I have created a new dkpro similarity developers list mainly to have
8/15/14
Benjamin Klein
,
Torsten Zesch
8
8/6/14
Problem with Models for Lexical Substitution
ok, that looks correct. I am sorry, but all I can think of now is that you have to debug the TWSI
unread,
Problem with Models for Lexical Substitution
ok, that looks correct. I am sorry, but all I can think of now is that you have to debug the TWSI
8/6/14
Axel Schulz
, …
Raihan Ul Islam
5
7/18/14
Missing files for WordNet
Hi Torsten, Thanks for your valuable suggestion. I excluded the old version of jwnl from maven
unread,
Missing files for WordNet
Hi Torsten, Thanks for your valuable suggestion. I excluded the old version of jwnl from maven
7/18/14
Raihan Ul Islam
6/23/14
Function Word List
Dear All, I am trying to use the FunctionWordFrequenciesMeasure . There is a default list of function
unread,
Function Word List
Dear All, I am trying to use the FunctionWordFrequenciesMeasure . There is a default list of function
6/23/14
Raihan Ul Islam
6/23/14
Time Consumtion in different similarity algorithm
Dear Concern, I am using most of the text similarity algorithms in my project. It seems there are
unread,
Time Consumtion in different similarity algorithm
Dear Concern, I am using most of the text similarity algorithms in my project. It seems there are
6/23/14
Raihan Ul Islam
, …
Torsten Zesch
8
6/19/14
JiangConrathComparator for wikitionary is taking longer time to run
I agree, this should be fixed in LSR. However, I will probably be very slow to fix that. Someone
unread,
JiangConrathComparator for wikitionary is taking longer time to run
I agree, this should be fixed in LSR. However, I will probably be very slow to fix that. Someone
6/19/14
Raihan Ul Islam
,
Torsten Zesch
3
6/3/14
How to use LatentSemanticAnalysis without dkproCore
I have added a wrapper and a test to the sspace module in the latest snapshot. -Torsten 2014-06-03 14
unread,
How to use LatentSemanticAnalysis without dkproCore
I have added a wrapper and a test to the sspace module in the latest snapshot. -Torsten 2014-06-03 14
6/3/14