Groups
Sign in
Groups
Semantic Vectors
Conversations
About
Send feedback
Help
Semantic Vectors
Contact owners and managers
1–30 of 357
Welcome to the Semantic Vectors group! This group is for users of the
Semantic Vectors Package
to share ideas and ask for help where necessary.
Feel free also to search the
project Wiki
for ideas and solutions to common problems.
Mark all as read
Report group
0 selected
Ahmet Arslan
, …
Dominic Widdows
7
10/27/20
comparing two terms
Thanks again, Trevor. On Tue, Oct 27, 2020 at 6:59 AM Trevor Cohen <trev...@gmail.com> wrote:
unread,
comparing two terms
Thanks again, Trevor. On Tue, Oct 27, 2020 at 6:59 AM Trevor Cohen <trev...@gmail.com> wrote:
10/27/20
Yi Sun
, …
Dominic
8
10/23/20
Generate features for millions of document
Thanks for the tip, Trevor - hopefully IndexFlatFilePositions works fine. (That's the advice so
unread,
Generate features for millions of document
Thanks for the tip, Trevor - hopefully IndexFlatFilePositions works fine. (That's the advice so
10/23/20
TopicModeler
,
Trevor Cohen
3
4/7/18
Number of documents returned
Worked great! Thanks. On Saturday, April 7, 2018 at 11:43:49 AM UTC-5, Trevor Cohen wrote: These
unread,
Number of documents returned
Worked great! Thanks. On Saturday, April 7, 2018 at 11:43:49 AM UTC-5, Trevor Cohen wrote: These
4/7/18
TopicModeler
, …
Trevor Cohen
11
2/16/18
Document Searching
I'd suggest placing each phrase on its own line, removing the pipes, and running SearchBatch
unread,
Document Searching
I'd suggest placing each phrase on its own line, removing the pipes, and running SearchBatch
2/16/18
needHelp
,
Dominic
2
2/5/18
initial phase to start work in my project.
Hello there, Please see https://github.com/semanticvectors/semanticvectors/wiki/GettingStarted for
unread,
initial phase to start work in my project.
Hello there, Please see https://github.com/semanticvectors/semanticvectors/wiki/GettingStarted for
2/5/18
Dhiraj Singh
,
Dominic
2
9/13/17
Command line flag not defined: vectorsearchfilterregex
Looks like documentation at https://github.com/semanticvectors/semanticvectors/wiki/
unread,
Command line flag not defined: vectorsearchfilterregex
Looks like documentation at https://github.com/semanticvectors/semanticvectors/wiki/
9/13/17
Rain
, …
Michael Ruepp
15
1/14/17
How to use Semantic Vector API like Lucene to index and search document
Hi Michael , Sorry but can you reup-load your software ? I can't download it @@ . See my picture
unread,
How to use Semantic Vector API like Lucene to index and search document
Hi Michael , Sorry but can you reup-load your software ? I can't download it @@ . See my picture
1/14/17
Teodor Dimov
,
Dominic Widdows
4
10/5/16
Random Indexing
My main thoughts for such a small corpus is to try various options and see what results you get. This
unread,
Random Indexing
My main thoughts for such a small corpus is to try various options and see what results you get. This
10/5/16
SoftwareEngineer
,
Dominic Widdows
3
8/8/16
Multiple fields
To say which fields you want indexed, use the --contentsfields flag. But only the strings get indexed
unread,
Multiple fields
To say which fields you want indexed, use the --contentsfields flag. But only the strings get indexed
8/8/16
SoftwareEngineer
,
Dominic Widdows
3
8/5/16
Returning document instead of terms
Thanks Dominic.. Some of the links are still broken that I couldn't reach that page. On Monday,
unread,
Returning document instead of terms
Thanks Dominic.. Some of the links are still broken that I couldn't reach that page. On Monday,
8/5/16
Dominic Widdows
7/26/16
Re: Unrecognized option : -luceneindexpath
There's no classpath after your -cp. On Tue, Jul 26, 2016 at 4:27 PM, SoftwareEngineer <hadeel
unread,
Re: Unrecognized option : -luceneindexpath
There's no classpath after your -cp. On Tue, Jul 26, 2016 at 4:27 PM, SoftwareEngineer <hadeel
7/26/16
Dominic Widdows
2
7/26/16
Re: BuildIndex command line
The tokenization (or rather, basic string splitting) is done in this line: https://github.com/
unread,
Re: BuildIndex command line
The tokenization (or rather, basic string splitting) is done in this line: https://github.com/
7/26/16
Hadeel Maryoosh
,
Dominic Widdows
2
7/22/16
Trying to Use BuildIndex class to generate term vector file
Hi Hadeel, If you want to search your own index using the example client, you don't want to build
unread,
Trying to Use BuildIndex class to generate term vector file
Hi Hadeel, If you want to search your own index using the example client, you don't want to build
7/22/16
Dominic Widdows
4
7/22/16
Re: Errors when implementing search using API instead of command lines
BuildIndex is a class not a method. BuildIndex.main is a standard java main method. It doesn't
unread,
Re: Errors when implementing search using API instead of command lines
BuildIndex is a class not a method. BuildIndex.main is a standard java main method. It doesn't
7/22/16
Hadeel Maryoosh
,
Dominic Widdows
16
7/18/16
Using Semantic Vectors to override similarity in Lucene
Hi Hadeel, These lines: VectorSearcher searcher = new VectorSearcher.VectorSearcherCosine(
unread,
Using Semantic Vectors to override similarity in Lucene
Hi Hadeel, These lines: VectorSearcher searcher = new VectorSearcher.VectorSearcherCosine(
7/18/16
Tim Hearn
,
Dominic Widdows
6
7/13/16
IndexTooNewException - for Lucene 4.10?
Yes, I still get a few personal inquiries from time to time as well. Traffic on this list is
unread,
IndexTooNewException - for Lucene 4.10?
Yes, I still get a few personal inquiries from time to time as well. Traffic on this list is
7/13/16
Nikola Morena
,
Dominic Widdows
2
11/17/15
Zero document vectors for default corpus
Hi Nikola, As I mentioned offline, I'd like to confirm that this is a regression (that I should
unread,
Zero document vectors for default corpus
Hi Nikola, As I mentioned offline, I'd like to confirm that this is a regression (that I should
11/17/15
Tim Hearn
,
Dominic
4
11/10/15
NullPointerException when building Index from Solr Index
Belatedly, I took a look at this and the message ending with "Check that -docidfield was set
unread,
NullPointerException when building Index from Solr Index
Belatedly, I took a look at this and the message ending with "Check that -docidfield was set
11/10/15
José Tomás Atria
,
Dominic
5
10/20/15
API questions
I'll see what I can come up with :) So far I only added a few method signatures here and there
unread,
API questions
I'll see what I can come up with :) So far I only added a few method signatures here and there
10/20/15
Michael Ruepp
, …
Dominic Widdows
4
8/7/15
Negative Values Scores from CompareTerms
Yes, higher scores give greater similarities. People frequently assume that similarities would be
unread,
Negative Values Scores from CompareTerms
Yes, higher scores give greater similarities. People frequently assume that similarities would be
8/7/15
Michael Ruepp
,
Dominic
4
8/7/15
Sometimes getting NaN back as Score with CompareTerms
OK I got it, i will check against Double.NaN and exclude it from the result. On Thursday, August 6,
unread,
Sometimes getting NaN back as Score with CompareTerms
OK I got it, i will check against Double.NaN and exclude it from the result. On Thursday, August 6,
8/7/15
Laurent Kevers
,
Dominic Widdows
3
8/7/15
Default options values for BuildIndex
Thank you Dominic for this answer! I currently use Semantic Vectors 5.8 JAR from the Maven Repository
unread,
Default options values for BuildIndex
Thank you Dominic for this answer! I currently use Semantic Vectors 5.8 JAR from the Maven Repository
8/7/15
Dominic
,
Laurent Kevers
3
8/6/15
Completed move to github
Thanks a lot Laurent. I've fixed those ones. There are probably more kicking around the place
unread,
Completed move to github
Thanks a lot Laurent. I've fixed those ones. There are probably more kicking around the place
8/6/15
Michael Ruepp
,
Dominic
2
7/31/15
-matchcase programatically does not work (no search output but vector found), cli works though...
Sorry, this is strange and I'm unlikely to have time to investigate for at least a few days. A
unread,
-matchcase programatically does not work (no search output but vector found), cli works though...
Sorry, this is strange and I'm unlikely to have time to investigate for at least a few days. A
7/31/15
Michael Ruepp
,
Trevor Cohen
3
7/23/15
Getting no vectors when using two terms
Hi, yes, I did not construct the Sentences as String[] of Words but as one String. Thanks for the
unread,
Getting no vectors when using two terms
Hi, yes, I did not construct the Sentences as String[] of Words but as one String. Thanks for the
7/23/15
Michael Ruepp
,
Dominic Widdows
5
7/22/15
Stop Indexing Task
Hi Michael, Option 3 sounds like the most disruptive to the codebase, so while you're welcome to
unread,
Stop Indexing Task
Hi Michael, Option 3 sounds like the most disruptive to the codebase, so while you're welcome to
7/22/15
Al
,
Dominic Widdows
12
7/19/15
terms weights
Hi, Dominic Thanks to "unblock" me, and well, I understand that my options are not yet
unread,
terms weights
Hi, Dominic Thanks to "unblock" me, and well, I understand that my options are not yet
7/19/15
Michael Ruepp
, …
Michael Ruepp
3
7/18/15
Second Termvectorsfile when using buildIndex and -trainingcycles <NUMBER> -docindexing incremental
So trainingcycles not mandatorily improves search quality? Could I live without? Also, is there any
unread,
Second Termvectorsfile when using buildIndex and -trainingcycles <NUMBER> -docindexing incremental
So trainingcycles not mandatorily improves search quality? Could I live without? Also, is there any
7/18/15
Michael Ruepp
, …
Michael Ruepp
8
7/15/15
Increase parallelism with index building
Also whats interesting, if I use docindexing incremental, I end up by having a second file named:
unread,
Increase parallelism with index building
Also whats interesting, if I use docindexing incremental, I end up by having a second file named:
7/15/15
Michael Ruepp
,
Dominic Widdows
2
7/15/15
Lucene positional Index vs no position information
Hi Michael, If you have the space and the patience, then always build positional indexes. You are
unread,
Lucene positional Index vs no position information
Hi Michael, If you have the space and the patience, then always build positional indexes. You are
7/15/15