When using this cli (or the same programatically):
java -cp ~/Downloads/semanticvectors-5.8.jar pitt.search.semanticvectors.BuildIndex -trainingcycles <NUMBER> -luceneindexpath ../luceneIndex/Text -termtermvectorsfile ./termsvectorsfile.bin -docvectorsfile ./docvectorsfile.bin -docindexing incremental
I get two termvectors outputfiles:
The first is "TERMVECTORSFILENAME.BIN", the Second is "TERMVECTORSFILENAME<NUMBER>.BIN"
So if i use trainingcycles 2 i get termsvectorsfile2.bin, if i use 3, its termsvectorsfile3.bin additionally to the default termsvectorsfile.bin
This behaviour is not observed when using LSA or Positional Indexing with the same parameters.
Is this a bug or intended? Which termsvectorsfile should I use for search?
Thanks,
Michael