Datafari 3 - NoClassDefFoundError when ingesting source code files

Visto 86 veces
Saltar al primer mensaje no leído

Minh Tran-Ngoc

no leída,
3 oct 2016, 3:46:013/10/16
a Datafari
Hello everybody,

I'm trying to setup  Datafari 3 on a Ubuntu 14.04 system.

When I try to crawl my shares, Solr throws exception when it tries to ingest source code files.

java.lang.NoClassDefFoundError: com/uwyn/jhighlight/renderer/XhtmlRendererFactory
    at org.apache.tika.parser.code.SourceCodeParser.getRenderer(SourceCodeParser.java:131)
    at org.apache.tika.parser.code.SourceCodeParser.parse(SourceCodeParser.java:108)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
    at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256)
    at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
    at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:228)
    at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:69)
    at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:155)
    at org.apache.solr.core.SolrCore.execute(SolrCore.java:2082)
    at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:651)
    at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:458)
    at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:229)
    at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:184)
    at org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1652)
    at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:585)
    at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
    at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:577)
    at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:223)
    at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1127)
    at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:515)
    at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:185)
    at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1061)
    at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
    at org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:215)
    at org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:110)
    at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:97)
    at org.eclipse.jetty.server.Server.handle(Server.java:499)
    at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:310)
    at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:257)
    at org.eclipse.jetty.io.AbstractConnection$2.run(AbstractConnection.java:540)
    at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:635)
    at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:555)
    at java.lang.Thread.run(Thread.java:745)

I tried to copy jhighlight.1.0.3.rar to /opt/datafari/solr/solrcloud/FileShare/lib but it did not help.

Could someone help me?

Thanks in advance and best regards
Minh


Olivier Tavard

no leída,
3 oct 2016, 3:57:433/10/16
a Minh Tran-Ngoc,Datafari
Hi,

It is correct to store your new lib in that folder. Did you stop and start again Datafari after that ?

Best regards,

Olivier 


--
You received this message because you are subscribed to the Google Groups "Datafari" group.
To unsubscribe from this group and stop receiving emails from it, send an email to datafari+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Minh Tran-Ngoc

no leída,
3 oct 2016, 8:48:483/10/16
a Datafari,miner...@gmail.com
It's working, I had to reboot the server.

Now I receive the following two errors
:
Error: Unhandled SolrServerException: No live SolrServers available to handle this request:[http://localhost:8983/solr/FileShare
Error: Unhandled SolrServerException: org.apache.solr.common.SolrException: Not enough nodes to handle the request

Thanks for your help,
Minh
To unsubscribe from this group and stop receiving emails from it, send an email to datafari+u...@googlegroups.com.

Fried Wil

no leída,
27 oct 2016, 17:17:1927/10/16
a Datafari,miner...@gmail.com
Hi Minh

I have the same issue when i have files with a big size (> 2GB).

Error: Unhandled SolrServerException: org.apache.solr.common.SolrException: Not enough nodes to handle the request

Did you solve your issue ? It could be nice for me ^_^

Regards, 

Olivier Tavard

no leída,
28 oct 2016, 2:31:4028/10/16
a Fried Wil,Datafari,Minh Tran-Ngoc
Hi Minh and Fried,

Could you tell us what is the amount of total RAM that you have on your server and the allocated RAM dedicated to Solr please ?

Best,

Olivier


To unsubscribe from this group and stop receiving emails from it, send an email to datafari+unsubscribe@googlegroups.com.

Fried Wil

no leída,
28 oct 2016, 9:00:5828/10/16
a Datafari,wilfried...@gmail.com,miner...@gmail.com
Hi Olivier ^_^

$ free -m

             total       used       free     shared    buffers     cached

Mem:          3965        338       3627          8         69        108

-/+ buffers/cache:        160       3804

Swap:         3813          0       3813


Do you have a specific page/conf parameter affected to the dedicated RAM for Solr ? Guess it's Xms but ..

-Xmx1024m

-Xms256m



I attached some screenshort of Solr RAM monitor.


Just to add some details: I need to crawl files/archives from a Samba Share (in local if it's impossible to crawl 4GB from Samba share) with a large size (could be 2GB at least).


Thanks for your help


Regards, 

Capture d’écran 2016-10-28 à 08.58.19.png
Capture d’écran 2016-10-28 à 08.58.11.png
Responder a todos
Responder al autor
Reenviar
0 mensajes nuevos