Ok, after a lot of work, I found a solution. The author field extracted by Tika are referenced as 'literal' fields, which can be directly extracted. Unfortunately, there are 'Author', 'Last-Author', and creator.
So I created a field called 'author' being multivalued and stored and indexed in schema.xml. Then I added these lines into solrconfig to the /update/extracte reuqestHandler:
<str name="fmap.author">author</str>
<str name="fmap.creator">author</str>
<str name="fmap.last_author">author</str>
Hence, all values to be found in either of these fields is storedin 'author'
Furthermore, I introduced another field 'authorname' into schema.xml, which is not multivalued, but has a custom type 'author'.
The custom type 'author' is created using the custom add script to be found in the conf/custom_schema directory:
{
"name":"author",
"class":"solr.TextField",
"positionIncrementGap":"100",
"analyzer" : {
"charFilters":[
{
"class":"solr.PatternReplaceCharFilterFactory",
"pattern":",.*?$",
"replacement":""
}
],
"filters":[
{"class":"solr.TrimFilterFactory"},
{
"class":"solr.PatternReplaceFilterFactory",
"pattern":"(^\\b[a-zA-Z0-9/-_äöüÄÖÜ]+\\s)|(\\b[a-zA-Z0-9/-_äöüÄÖÜ]+(.?)\\s)",
"replacement":""
},
{"class":"solr.TrimFilterFactory"},
{"class":"solr.LowerCaseFilterFactory"},
{"class":"solr.CapitalizationFilterFactory",
"onlyFirstWord":"false"} ],
"tokenizer":{
"class":"solr.KeywordTokenizerFactory" }
}
}
This is not necessarily needed, but it does some proper reduction of the retrieved names to the familiy name, if provided.
The updateprocessor datafari is then further enhanced by:
<processor class="solr.LastFieldValueUpdateProcessorFactory">
<str name="fieldName">author</str>
</processor>
<processor class="solr.TrimFieldUpdateProcessorFactory">
<str name="fieldName">author</str>
</processor>
<processor class="solr.CloneFieldUpdateProcessorFactory">
<str name="source">author</str>
<str name="dest">authorname</str>
</processor>
Furthermore, in the qf/pf fields in solrconfig, I added authorname.
Voila! The rest hast to happen like in the description of said link above. I also added a custom facet 'authors' to search.js, which shows them:
Manager.addWidget(new AjaxFranceLabs.TableWidget({
elm : $('#facet_author'),
id : 'facet_author',
field : 'authorname',
name : window.i18n.msgStore['author'],
pagination : true,
selectionType : 'OR',
sort : 'AtoZ',
maxDiplay: 100,
returnUnselectedFacetValues : true
}));
Now it works!