HBase indexer with Solr 5.3 error

Sofia Panagiotidi

unread,

Mar 1, 2016, 9:45:34 AM3/1/16

to HBase Indexer Users

Hello

I wrote a few months ago but I got no reply :(

I was wondering whether I can make HBase Indexer work with my Solr version 5.3.1. What I am facing is problems with the Zookeeper node structure that seems to be changed after Solr 5 and I am not sure how to overcome this.

I start my one node Solr with

solr start -c -Dsolr.directoryFactory=HdfsDirectoryFactory -Dsolr.lock.type=none -Dsolr.hdfs.home=hdfs://master:8020/user/ubuntu/solr -s node1/solr -z master:2181

and I create the HBase index as follows

hbase-indexer add-indexer --name myIndexer2 --indexer-conf ~/Desktop/indexdemo-indexer2.xml --cp solr.zk=master:2181 --cp solr.collection=sofiacollection51 --zookeeper master:2181

The indexer gets created all right, but when I try to add something to HBase, the error at the indexer server is

16/03/01 16:30:27 INFO zookeeper.ClientCnxn: Session establishment complete on server master-VirtualBox/192.168.1.44:2181, sessionid = 0x153322b8b9100a3, negotiated timeout = 30000

16/03/01 16:30:27 INFO cloud.ConnectionManager: Watcher org.apache.solr.common.cloud.ConnectionManager@71cb27d6 name:ZooKeeperConnection Watcher:master:2181 got event WatchedEvent state:SyncConnected type:None path:null path:null type:None

16/03/01 16:30:27 INFO cloud.ConnectionManager: Client is connected to ZooKeeper

16/03/01 16:30:27 INFO cloud.ZkStateReader: Updating cluster state from ZooKeeper...

16/03/01 16:30:27 ERROR indexer.DirectSolrInputDocumentWriter: Error updating Solr

org.apache.solr.common.SolrException: Collection not found: sofiacollection51

at org.apache.solr.client.solrj.impl.CloudSolrServer.getCollectionList(CloudSolrServer.java:338)

at org.apache.solr.client.solrj.impl.CloudSolrServer.request(CloudSolrServer.java:219)

at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:117)

at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:116)

at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:102)

at com.ngdata.hbaseindexer.indexer.DirectSolrInputDocumentWriter.retryAddsIndividually(DirectSolrInputDocumentWriter.java:123)

at com.ngdata.hbaseindexer.indexer.DirectSolrInputDocumentWriter.add(DirectSolrInputDocumentWriter.java:108)

at com.ngdata.hbaseindexer.indexer.Indexer.indexRowData(Indexer.java:156)

at com.ngdata.hbaseindexer.indexer.IndexingEventListener.processEvents(IndexingEventListener.java:99)

at com.ngdata.sep.impl.SepEventExecutor$1.run(SepEventExecutor.java:97)

at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)

at java.util.concurrent.FutureTask.run(FutureTask.java:262)

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)

at java.lang.Thread.run(Thread.java:745)

^C16/03/01 16:30:27 INFO mortbay.log: Stopped SelectChann...@0.0.0.0:11060

16/03/01 16:30:28 INFO supervisor.IndexerSupervisor: IndexerWorker.EventWorker interrupted.

16/03/01 16:30:28 INFO zookeeper.ZooKeeper: Session: 0x153322b8b9100a1 closed

16/03/01 16:30:28 INFO zookeeper.ClientCnxn: EventThread shut down

16/03/01 16:30:28 INFO ipc.RpcServer: Stopping server on 41136

16/03/01 16:30:28 INFO ipc.RpcServer: RpcServer.listener,port=41136: stopping

16/03/01 16:30:28 INFO ipc.RpcServer: RpcServer.responder: stopped

16/03/01 16:30:28 INFO ipc.RpcServer: RpcServer.responder: stopping

When I check on the zookeeper's side I can see the collection "sofiacollection51" though:

[zk: master(CONNECTED) 5] ls /collections

[aliases.json, clusterstate.json, ngdata, sofiacollection51]

Any help appreciated

Sofia

Gabriel Reid

unread,

Mar 1, 2016, 10:04:09 AM3/1/16

to Sofia Panagiotidi, HBase Indexer Users

Hi Sofia,

Which build of hbase-indexer are you using? Did you build it yourself
(i.e. checked the code out of GitHub), or did you download binaries
somewhere?

If you've built it yourself, could you specify which version you've
built, and which maven profile (if any) you used when building it?

If you downloaded binaries for hbase-indexer, could you specify which
version you're using?

Thanks,

Gabriel

> --
> You received this message because you are subscribed to the Google Groups
> "HBase Indexer Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to hbase-indexer-u...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

Sofia Panagiotidi

unread,

Mar 1, 2016, 10:11:47 AM3/1/16

to HBase Indexer Users, sof...@gmail.com

Hi Gabriel

If I remember correctly I downloaded and built it with

mvn clean install -DskipTests -Dhbase.api=0.98

My HBase version is 1.1.2 but I don't think this is of any problem

Cheers

> email to hbase-indexer-user+unsub...@googlegroups.com.

Gabriel Reid

unread,

Mar 1, 2016, 10:15:13 AM3/1/16

to Sofia Panagiotidi, HBase Indexer Users

The current master of hbase-indexer in GitHub is built against Solr 4.4.0 [1]

The first thing I would suggest trying is just setting the Solr
version in the pom file to your Solr version, and rebuilding
hbase-indexer. In the best case, this will just work and it should
resolve the issues.

It is not entirely unlikely that you will encounter compilation errors
due to changes in the Solr API though, in which case you'd need to
make some small modifications to hbase-indexer in order to allow
building it against Solr 5.3.

- Gabriel

1. https://github.com/NGDATA/hbase-indexer/blob/master/pom.xml#L22

>> > email to hbase-indexer-u...@googlegroups.com.

>> > For more options, visit https://groups.google.com/d/optout.
>

> --
> You received this message because you are subscribed to the Google Groups
> "HBase Indexer Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an

> email to hbase-indexer-u...@googlegroups.com.

Sofia Panagiotidi

unread,

Mar 1, 2016, 12:30:12 PM3/1/16

to HBase Indexer Users, sof...@gmail.com

I just tried and after fixing some dependencies I got

[ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:2.0.2:compile (default-compile) on project hbase-indexer-engine: Compilation failure: Compilation failure:
[ERROR] /home/sofia/hbase-indexer/hbase-indexer-engine/src/main/java/com/ngdata/hbaseindexer/indexer/SolrServerFactory.java:[43,15] error: incompatible types
[ERROR] 
[ERROR] could not parse error message:   required: SolrServer
[ERROR] found:    CloudSolrServer
[ERROR] /home/sofia/hbase-indexer/hbase-indexer-engine/src/main/java/com/ngdata/hbaseindexer/indexer/SolrServerFactory.java:49: error: no suitable method found for add(HttpSolrServer)
[ERROR] result.add(new HttpSolrServer(shard, httpClient));
[ERROR] ^

I am not sure I would be able to get into the code and do the fixing, I might give it a try later on.

Cheers

>> > email to hbase-indexer-user+unsub...@googlegroups.com.

>> > For more options, visit https://groups.google.com/d/optout.
>
> --
> You received this message because you are subscribed to the Google Groups
> "HBase Indexer Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an

> email to hbase-indexer-user+unsub...@googlegroups.com.

Ravi K

unread,

Mar 16, 2016, 12:40:28 PM3/16/16

to HBase Indexer Users

Gabriel,

Will hbase-indexer be updated anytime soon to account for recent Solr API changes?

got event WatchedEvent state:SyncConnected type:None path:null path:null type:None

2016-03-16 10:31:59,030 INFO org.apache.solr.common.cloud.ConnectionManager: Client is connected to ZooKeeper

2016-03-16 10:31:59,030 INFO org.apache.solr.common.cloud.SolrZkClient: Using default ZkACLProvider

2016-03-16 10:31:59,032 INFO org.apache.solr.common.cloud.ZkStateReader: Updating cluster state from ZooKeeper...

2016-03-16 10:31:59,040 ERROR com.ngdata.hbaseindexer.indexer.DirectSolrInputDocumentWriter: Error updating Solr

org.apache.solr.common.SolrException: Could not find collection : hpfcollection

at org.apache.solr.common.cloud.ClusterState.getCollection(ClusterState.java:162)

at org.apache.solr.client.solrj.impl.CloudSolrServer.directUpdate(CloudSolrServer.java:305)

at org.apache.solr.client.solrj.impl.CloudSolrServer.request(CloudSolrServer.java:539)

at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:124)