Hi folks, I am trying to build an index with this package. I am using the following versions of the following projects:
v 5.8 Semantic Vectors
v 5.0.0 Apache Solr
v 4.10.0 (backed up-Lucene index)
When I try to build an index, I use the following command:
java -classpath semanticvectors-5.8.jar pitt.search.semanticvectors.BuildIndex -luceneindexpath C:\Users\tim\workspace\solr-5.0.0\server\solr\collection1\data\index -contentsfields textProperty
And get the following error
Seedlength: 10, Dimension: 200, Vector type: REAL, Minimum frequency: 0, Maximum frequency: 2147483647, Number non-alphabet characters: 2147483647, Contents fields are: [textproperty]
Initialized LuceneUtils from Lucene index in directory: C:\Users\tim\workspace\solr-5.0.0\server\solr\collection1\data\index
Creating term vectors as superpositions of elemental document vectors ...
Initialized LuceneUtils from Lucene index in directory: C:\Users\tim\workspace\solr-5.0.0\server\solr\collection1\data\index
Creating semantic term vectors ...
There are 24878 terms (and 8592 docs).
Training term vectors for field textproperty
Processed 0 terms ... Exception in thread "main" java.lang.NullPointerException
at pitt.search.semanticvectors.LuceneUtils.getExternalDocId(LuceneUtils.java:191)
at pitt.search.semanticvectors.TermVectorsFromLucene.trainTermVectors(TermVectorsFromLucene.java:158)
at pitt.search.semanticvectors.TermVectorsFromLucene.createTermVectorsFromLuceneImpl(TermVectorsFromLucene.java:115)
at pitt.search.semanticvectors.TermVectorsFromLucene.createTermVectorsFromLucene(TermVectorsFromLucene.java:98)
at pitt.search.semanticvectors.BuildIndex.main(BuildIndex.java:117)
What can I do / change to get this to work?