Hi Julien,
It is a bit difficult to know whether I have got the SolR config right without understanding what "right" is.
For instance I cant understand this code from the SolRWriter.java
// TODO document this param on the wiki
// process solr.annotations.list
String list = job.get("solr.annotations.list");
if (list == null || list.trim().length() == 0) {
return;
}
String[] names = list.split("\\s+");
for (String name : names) {
// support all annotations denoted by '*'
if (name.equals("*")) {
includeAllAnnotations = true;
} else {
String solrFieldName = "annotation_" + name;
populateMapping(solrFieldName, name);
}
}
}
I set that parameter to "*" not knowing what I should set it to. It now seems that SolR is choking when trying to import documents sent to it by Behemoth with the error....
ERROR - 2013-10-07 16:13:33.308; org.apache.solr.common.SolrException; org.apache.solr.common.SolrException: ERROR: [doc=file:///home/alex/projects/documentFileNameRedacted] unknown field 'annotation_meta.content'
at org.apache.solr.update.DocumentBuilder.toDocument(DocumentBuilder.java:174)
at org.apache.solr.update.AddUpdateCommand.getLuceneDocument(AddUpdateCommand.java:73)
at org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:210)
Now I dont know whether I am supposed to have a field annotation_meta, or not. What is the relevance of the ".content" on the end.
I have added other fields into SolR eg
<field name="persontitle" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="jobtitle" type="text_general" indexed="true" stored="true"/>
<field name="skill" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="organisation" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="company" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="address" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="person" type="text_general" indexed="true" stored="true"/>
<field name="qualification" type="text_general" indexed="true" stored="true" multiValued="true"/>
<field name="university" type="text_general" indexed="true" stored="true" multiValued="true"/>
Should I also add
<field name="annotation_meta" type="text_general" indexed="true" stored="true" multiValued="true"/>
And others I dont know about?
My issue is that unless there is a simple example I don't really know if a problem is a Behemoth or a SolR problem.
Alex