RE: [VuFind-Tech] Source Code Solrmarc

41 views
Skip to first unread message

Demian Katz

unread,
Aug 29, 2012, 7:01:59 AM8/29/12
to Hannah Ullrich, VuFind Tech Mailinglist, solrma...@googlegroups.com
You can find the source code for tagged releases of SolrMarc here if you need a specific older version:

http://code.google.com/p/solrmarc/source/browse/#svn%2Ftags

(VuFind 1.3 uses 2.3, VuFind 1.1/1.2 uses 2.2).

You should also be able to build from the trunk code successfully.

It doesn't matter which version of Solr is bundled with the SolrMarc source -- when you run the ant build routine, you can specify which version to compile against, so you can point the compiler to the .war file included with your copy of VuFind. SolrMarc 2.3 or higher should be able to handle interactions with Solr 3.x.

Also, I'm not sure what you're trying to do, but keep in mind that you can use BeanShell scripts for custom indexing routines, so you may not even have to worry about compiling a custom SolrMarc unless you're doing something especially complex or you need absolute top performance (though BeanShell performs pretty well as-is).

I hope this is helpful, but let me know if you still have questions.

- Demian
________________________________________
From: Hannah Ullrich [hannah....@ub.uni-freiburg.de]
Sent: Wednesday, August 29, 2012 4:54 AM
To: VuFind Tech Mailinglist
Subject: [VuFind-Tech] Source Code Solrmarc

Hi @ all.

we want to integrate custom indexer-routines.
In the VuFindIndexer-README.txt is a link to
http://code.google.com/p/solrmarc/.

But there is only a very old version of Solr (1.4) integrated.
Where can we find the sourcecode for the Solrmarc-version, you use in
VuFind?


Greetings from Germany

Hannah

--
Hannah Ullrich
Fachinformatikerin

Universitaetsbibliothek Freiburg
EDV Dezernat
Rempartstr. 10-16
79098 Freiburg
Tel: +49-761 / 203-3877


Hannah

unread,
Aug 29, 2012, 9:54:49 AM8/29/12
to solrma...@googlegroups.com, Hannah Ullrich
Demian,

thanks for the quick answer.

There a different points, why we want a newer Version of Solr for SolrMarc

- It seems, that solrmarc 'out of the box' cannot handle large (100mb+) syonym-files. With Vufind we can do this.
- We want to use for example ICUTokenizer ....
- We want to seperate frontend from backend.

at the moment we are using SolrMarc with custom Indexroutines, and we don't want to change this way of indexing our datas.


Am Mittwoch, 29. August 2012 13:02:01 UTC+2 schrieb Demian Katz:
You can find the source code for tagged releases of SolrMarc here if you need a specific older version:

http://code.google.com/p/solrmarc/source/browse/#svn%2Ftags

(VuFind 1.3 uses 2.3, VuFind 1.1/1.2 uses 2.2).

 I'm a bit confused...
May be I understand the output of the solr admin info site wrong.

katalog:8080/solr/biblio/admin/registry.jsp
Vufind:
Solr Specification Version: 3.5.0.2011.11.22.14.54.38
Solr Implementation Version: 3.5.0
Lucene Specification Version: 3.5.0
Lucene Implementation Version: 3.5.0

VS.
katlaog:8983/solr/admin/registry.jsp

SolrMarc:

Solr Specification Version: 1.4.0
Solr Implementation Version: 1.4.0 833479
Lucene Specification Version: 2.9.1
Lucene Implementation Version: 2.9.1

You should also be able to build from the trunk code successfully.

It doesn't matter which version of Solr is bundled with the SolrMarc source -- when you run the ant build routine, you can specify which version to compile against, so you can point the compiler to the .war file included with your copy of VuFind.  SolrMarc 2.3 or higher should be able to handle interactions with Solr 3.x.

 We tried this, but it doesn't work....

/local_build/build.properties: solrmarc.solr.war.path = /usr/local/vufind/solr/jetty/webapps/solr.war
then 'ant dist'

Indexing data:
First error Message:

INFO:      Using Solr data dir /data/solrmarc-ng/dist/jetty/solr/data
29.08.2012 15:08:38 org.apache.solr.core.Config getLuceneVersion
WARNUNG: the luceneMatchVersion is not specified, defaulting to LUCENE_24 emulation. You should at some point declare and reindex to at least 3.0, because 2.4 emulation is deprecated and will be removed in 4.0. This parameter will be mandatory in 4.0.
WARNUNG: Deprecated syntax found. <highlighting/> should move to <searchComponent/>
29.08.2012 15:08:38 org.apache.solr.analysis.BaseTokenStreamFactory assureMatchVersion
WARNUNG: WhitespaceTokenizerFactory is using deprecated LUCENE_24 emulation. You should at some point declare and reindex to at least 3.0, because 2.x emulation is deprecated and will be removed in 4.0
29.08.2012 15:08:38 org.apache.solr.analysis.BaseTokenStreamFactory assureMatchVersion
WARNUNG: SynonymFilterFactory is using deprecated LUCENE_24 emulation. You should at some point declare and reindex to at least 3.0, because 2.x emulation is deprecated and will be removed in 4.0
.....
many warnings.....
.....
29.08.2012 15:08:39 org.apache.solr.common.SolrException log
SCHWERWIEGEND: java.lang.AbstractMethodError: org.apache.lucene.analysis.TokenStream.incrementToken()Z
        at org.apache.solr.analysis.SlowSynonymFilter.nextTok(SlowSynonymFilter.java:201)
        at org.apache.solr.analysis.SlowSynonymFilter.incrementToken(SlowSynonymFilter.java:91)
        at org.apache.lucene.analysis.FilteringTokenFilter.incrementToken(FilteringTokenFilter.java:48)

##########################################################
Output after writing the line
<luceneMatchVersion>LUCENE_35</luceneMatchVersion> into the solrconfig.xml (this tag is also in the solrconfig of VuFind)

INFO:      Using Solr data dir /data/solrmarc-ng/dist/jetty/solr/data
29.08.2012 15:19:43 org.apache.solr.core.SolrConfig <init>
WARNUNG: Deprecated syntax found. <highlighting/> should move to <searchComponent/>
29.08.2012 15:19:43 org.apache.solr.analysis.BaseTokenStreamFactory warnDeprecated
WARNUNG: EnglishPorterFilterFactory is deprecated. use PorterStemFilterFactory (Porter1) or SnowballPorterFilterFactory with 'English' (Porter2) instead
29.08.2012 15:19:43 org.apache.solr.analysis.BaseTokenStreamFactory warnDeprecated
WARNUNG: EnglishPorterFilterFactory is deprecated. use PorterStemFilterFactory (Porter1) or SnowballPorterFilterFactory with 'English' (Porter2) instead
29.08.2012 15:19:43 org.apache.solr.request.XSLTResponseWriter <init>
WARNUNG: org.apache.solr.request.XSLTResponseWriter is deprecated. Please use the corresponding class in org.apache.solr.response
29.08.2012 15:19:43 org.apache.solr.handler.component.SpellCheckComponent inform
WARNUNG: No queryConverter defined, using default converter
29.08.2012 15:19:43 org.solrmarc.marc.MarcHandler loadReader
INFO: Attempting to open data file: /home/marcdaten/sot_utf8.mrc
29.08.2012 15:19:43 org.apache.solr.common.SolrException log
SCHWERWIEGEND: java.lang.AbstractMethodError: org.apache.lucene.analysis.TokenStream.incrementToken()Z
        at org.apache.lucene.analysis.synonym.SynonymFilter.parse(SynonymFilter.java:314)
        at org.apache.lucene.analysis.synonym.SynonymFilter.incrementToken(SynonymFilter.java:553)


What else we have to do to upgrade?

Hannah


Robert Haschart

unread,
Aug 29, 2012, 12:10:20 PM8/29/12
to solrma...@googlegroups.com
Hannah,

SolrMarc is bundled with a copy of Solr 1.4 that it uses for testing and to provide a quick and easy out-of-the-box configuration.  However SolrMarc has always had the design goal of being able to work with any version of Solr without needing to be recompiled.   There was an incompatibility with Solr 3.x in versions of SolrMarc 2.2 and earlier, however as of SolrMarc 2.3 this incompatibility should be fixed.

By changing which the config.properties  value 
 solrmarc.solr.war.path
to reference a 3.5 version of solr.war   and changing the solr.path to point to the location of your solr 3.5 config that uses ICUTokenizer you should be good to go.

If you have difficulties making this work, contact the list (or me directly) and I'll try to help.

-Bob Haschart
--
You received this message because you are subscribed to the Google Groups "solrmarc-tech" group.
To view this discussion on the web visit https://groups.google.com/d/msg/solrmarc-tech/-/Akgcmy4sc6kJ.
To post to this group, send email to solrma...@googlegroups.com.
To unsubscribe from this group, send email to solrmarc-tec...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/solrmarc-tech?hl=en.

Reply all
Reply to author
Forward
0 new messages