--
You received this message because you are subscribed to the Google Groups "Semantic Vectors" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semanticvecto...@googlegroups.com.
To post to this group, send email to semanti...@googlegroups.com.
Visit this group at https://groups.google.com/group/semanticvectors.
For more options, visit https://groups.google.com/d/optout.
Thanks Dominic for replying. Brain storming ideas will be extremely helpful.Originally, I would like to improve an existed application we have that is mainly using Lucene for indexing and searching. Lucene is using TF-IDF with cosine as the similarity metric for scoring, but I found that might not be efficient cause our data is short for each document ( by the way we have like a million of documents), and in my experience, I found that LSA concept works much better and returns efficient results.Because I'm new to Lucene and Java, I implemented the first application of Lucene in the link I mentioned earlier ( just the single page) for this data set, it's just about simple indexing and searching in Lucene, I didn't go to the other pages in the link for now. Now, My next step is to change the similarity that Lucene is using ( TF-IDF) with LSA, the details is in here https://lucene.apache.org/core/3_5_0/api/core/org/apache/lucene/search/package-summary.html#changingSimilaritySo in the official website of Lucene, it's said you can change the similarity when you know your data needs to, so that's why I need! I want Lucene to keep doing what it's doing for indexing and searching, but instead of using (TF-IDF) as explained in the link above, I want to use LSA ( semantic vector), and I found this package. Do you think semantic Vector package will be helpful? I found that link to build the models by command lines https://github.com/semanticvectors/semanticvectors/wiki/InstallationInstructions#to-build-and-search-a-modelbut were I able to do the regualar Indexwriter, indexSearcher ( indexing and searching procedures of Lucene) with the Semantic Vector Package? If not, Can I integrate the SV Package in somehow?
So I got that the user will write the input query from the command line. Thanks for that. However, I'm waiting for the answers of the rest of the questions. Thanks again.
On Monday, July 18, 2016 at 11:53:16 AM UTC-6, Hadeel Maryoosh wrote:Sorry but one more question, Is the term vector here the same as the index in Lucene? Cause I already have the index in some directory, so if it is the same, this would save for me some efforts I think..