java pitt.search.semanticvectors.LSA -termweight idf -luceneindexpath positional_index/Set up LSA indexer.Dimension: 200 Minimum frequency = 0 Maximum frequency = 2147483647 Number non-alphabet characters = 2147483647There are -1 terms (and 480 docs).Exception in thread "main" java.lang.NegativeArraySizeExceptionat pitt.search.semanticvectors.LSA.smatFromIndex(LSA.java:96)at pitt.search.semanticvectors.LSA.main(LSA.java:240)
java pitt.search.semanticvectors.BuildPositionalIndex -windowradius 2 -luceneindexpath positional_index/ Building positional index, Lucene index: positional_index/, Seedlength: 10, Vector length: 200, Vector type: REAL, Minimum term frequency: 0, Maximum term frequency: 2147483647, Number non-alphabet characters: 2147483647, Window radius: 2, Fields to index: [contents]Created basic term vectors for 128740 terms (and 480 docs).Processed 0 documents ... Created 128740 term vectors ...Normalizing term vectors.About to write 128740 vectors of dimension 200 to Lucene format file: termtermvectors.bin ... finished writing vectors.Writing vectors incrementally to file docvectors.bin ... Oct 19, 2013 3:35:12 AM pitt.search.semanticvectors.IncrementalDocVectors trainIncrementalDocVectorsSEVERE: No term vector for document 246Finished writing vectors.
--
You received this message because you are subscribed to the Google Groups "Semantic Vectors" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semanticvecto...@googlegroups.com.
To post to this group, send email to semanti...@googlegroups.com.
Visit this group at http://groups.google.com/group/semanticvectors.
For more options, visit https://groups.google.com/groups/opt_out.
I can attest that it was indeed parallelcolt that was missing. Thanks!
--
You received this message because you are subscribed to the Google Groups "Semantic Vectors" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semanticvecto...@googlegroups.com.
To post to this group, send email to semanti...@googlegroups.com.
Visit this group at http://groups.google.com/group/semanticvectors.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to a topic in the Google Groups "Semantic Vectors" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/semanticvectors/LOO1hPro5lI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to semanticvecto...@googlegroups.com.
I'm using solr 4.5 (that i suppose use lucene 4.5 index) and i have the same problem with sv 4.0.
Any suggestions or there's no solution to use lsa with solr 4.5?
--
You received this message because you are subscribed to a topic in the Google Groups "Semantic Vectors" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/semanticvectors/LOO1hPro5lI/unsubscribe.
To unsubscribe from this group and all its topics, send an email to semanticvecto...@googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "Semantic Vectors" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semanticvecto...@googlegroups.com.
java -cp ../semanticvectors-5.4.jar org.apache.lucene.demo.IndexFiles -docs bible_chapters -index luceneindex
java -Xmx1G -cp ../semanticvectors-5.4.jar pitt.search.semanticvectors.LSA -luceneindexpath luceneIndex
Set up LSA indexer.
Dimension: 200 Minimum frequency = 0 Maximum frequency = 2147483647 Number non-alphabet characters = 2147483647
There are 12785 terms (and 1190 docs).
Starting SVD using algorithm LAS2 ...
Wrote 12785 term vectors incrementally to file termvectors.
Wrote 1190 document vectors incrementally to file docvectors. Done.
For more options, visit https://groups.google.com/d/optout.