IndexTooNewException - for Lucene 4.10?

111 views
Skip to first unread message

Tim Hearn

unread,
Jan 15, 2016, 12:48:04 AM1/15/16
to Semantic Vectors
Hi all,

I am attempting to run Semantic Vectors with the following specs:

Semantic Vectors v 5.6
Apache solr v 5.0.0
Lucene Index v 4.10.0

I would like to build an index to begin.  When I try to build the index with the following commands I get the following error:

Commands:

"-docindexing", "incremental"
-contentsfields", "textproperty",
"-docidfield", "id",
"-termvectorsfile", vectorStoreLocStr + "/termvectorsreg.bin", 
"-docvectorsfile", vectorStoreLocStr + "/docvectorsreg.bin",
"-luceneindexpath" , luceneDir };

luceneDir is a directory created with a solr index, match version 4.10


Caused by: org.apache.lucene.index.IndexFormatTooNewException: Format version is not supported (resource: BufferedChecksumIndexInput(MMapIndexInput(path="C:\Users\tim\workspace\solrWorkspace\solr-5.0.0\server\solr\collection1\data\index\segments_2"))): 4 (needs to be between 0 and 3)
at org.apache.lucene.codecs.CodecUtil.checkHeaderNoMagic(CodecUtil.java:156)
at org.apache.lucene.index.SegmentInfos.read(SegmentInfos.java:351)
at org.apache.lucene.index.StandardDirectoryReader$1.doBody(StandardDirectoryReader.java:57)
at org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:913)
at org.apache.lucene.index.StandardDirectoryReader.open(StandardDirectoryReader.java:53)
at org.apache.lucene.index.DirectoryReader.open(DirectoryReader.java:67)
at pitt.search.semanticvectors.LuceneUtils.<init>(LuceneUtils.java:108)




Can anyone help? 

Dominic Widdows

unread,
Jan 15, 2016, 12:55:05 AM1/15/16
to semanti...@googlegroups.com
Hi Tim,

I'm travelling at the moment so won't be available to help much. I'm surprised because Lucene 4.10 and SemanticVectors 5.6 work together (at least under normal curcumstances). See https://github.com/semanticvectors/semanticvectors/wiki/LuceneCompatibility

Is there any way one can verify that an index created by Solr 5.0.0 can be read with plain Lucene 4.10? It would be good to at least eliminate that as a possible source of the problem.

Best wishes,
Dominic

--
You received this message because you are subscribed to the Google Groups "Semantic Vectors" group.
To unsubscribe from this group and stop receiving emails from it, send an email to semanticvecto...@googlegroups.com.
To post to this group, send email to semanti...@googlegroups.com.
Visit this group at https://groups.google.com/group/semanticvectors.
For more options, visit https://groups.google.com/d/optout.

Tim Hearn

unread,
Jan 15, 2016, 12:03:53 PM1/15/16
to Semantic Vectors
I was surprised too.  You have a good point.  All indexes created by solr are lucene indexes, so they should be readable by each other.


I changed the luceneMatchVersion to 5.0.0 just in case, and it still did not work - I got the exact same exception (it still thinks that it's looking at a v4 index!)

Tim Hearn

unread,
Jan 25, 2016, 4:17:35 PM1/25/16
to Semantic Vectors
Have you found any new info?

Dominic

unread,
Jan 28, 2016, 2:58:44 PM1/28/16
to Semantic Vectors
Hi Tim,

I haven't investigated because it sounded like a Lucene / SOLR problem. Do you have an index that you can read with Lucene but can't read with semanticvectors using the same version of Lucene? If so that's a semanticvectors problem. On the other hand if you have an index created by SOLR that you can't open at all with Lucene APIs, that's a problem to raise on Lucene forums.

Best wishes,
Dominic
Message has been deleted

Dominic Widdows

unread,
Jul 13, 2016, 4:47:46 PM7/13/16
to semanti...@googlegroups.com
Yes, I still get a few personal inquiries from time to time as well. Traffic on this list is typically sporadic, and it hasn't been busy recently.

-Dominic

On Wed, Jul 13, 2016 at 12:10 PM, Hadeel Maryoosh <hadeel....@gmail.com> wrote:
Is anyone still using Semantic Vectors? It looks like this is the most recent post here, which was 6 months ago!

--
Reply all
Reply to author
Forward
0 new messages