Hmm, I made the suggested changes, and my new class IndexFilesPositionsLines worked fine.
Processed 2000000 documents
Created 268384 term vectors ...
About to write 268384 vectors of dimension 200 to Lucene format file: elementalvectors.bin ... finished writing vectors.
Initialized LuceneUtils from Lucene index in directory: positional_index
Fields in index are: line_number, modified, contents
Writing vectors incrementally to file docvectors.bin ... Oct 22, 2020 7:47:05 PM pitt.search.semanticvectors.LuceneUtils getExternalDocId
SEVERE: Failed to get external doc ID from doc no. 0 in Lucene index.
This is almost certain to lead to problems.
Check that -docidfield was set correctly and exists in the Lucene index
Exception in thread "main" java.lang.NullPointerException
at pitt.search.semanticvectors.LuceneUtils.getExternalDocId(LuceneUtils.java:200)
at pitt.search.semanticvectors.IncrementalDocVectors.trainIncrementalDocVectors(IncrementalDocVectors.java:122)
at pitt.search.semanticvectors.IncrementalDocVectors.createIncrementalDocVectors(IncrementalDocVectors.java:93)
at pitt.search.semanticvectors.BuildPositionalIndex.main(BuildPositionalIndex.java:176)