Connecting ArangoDB to other technologies?

257 views
Skip to first unread message

zachk...@gmail.com

unread,
Jul 19, 2016, 1:39:09 PM7/19/16
to ArangoDB
Hi everyone,

We are trying to integrate ArangoDB into our current setup and are wondering if:

1. Is there any connector between ArangoDB and Elasticsearch for indexing? Or must all indexing done natively in Arango?

2. Is there any connector/offloader for ArangoDB and Spark so that long-running and intensive queries can be given to a Spark cluster? (I believe there is something similar for Titan/Tinkerpop).

3. We are going to be making an API for graph querying, and we have noticed that the same question can have different performance depending on how it is constructed into a query. Is there any internal automatic query optimization?

Thanks for any and all help!

zachk...@gmail.com

unread,
Jul 19, 2016, 2:13:18 PM7/19/16
to ArangoDB
Regarding #1, I did see elasticsearch-river-arangodb but it doesn't list 3.0 compatibility. Since it was last updated before the 3.0 release, I wasn't sure if it is compatible with 3.0 or not.

Regarding #3, looks like there is automatic query optimization.

Frank Celler

unread,
Jul 19, 2016, 4:30:55 PM7/19/16
to ArangoDB
Hi everyone,

We are trying to integrate ArangoDB into our current setup and are wondering if:

1. Is there any connector between ArangoDB and Elasticsearch for indexing? Or must all indexing done natively in Arango?
You can do the standard indexing in ArangoDB.

We had a river-plugin, which unfortunately is no longer supported by Elastic. There is basic logstash plugin from Pere Urbon (https://github.com/purbon/logstash-output-arangodb). I assume that with some more effort it should be possible to get it up and running, so that you can use Elastic for full-text search.


2. Is there any connector/offloader for ArangoDB and Spark so that long-running and intensive queries can be given to a Spark cluster? (I believe there is something similar for Titan/Tinkerpop).
In principle, you can use the Foxx framework to offload queries via HTTP.  What can be done and what not, would require some more investigation. If you like, you can send more details to hackers (at) arangodb.org 

A colleague of mine is working on a Spark-connector, which will load data into Spark.


3. We are going to be making an API for graph querying, and we have noticed that the same question can have different performance depending on how it is constructed into a query. Is there any internal automatic query optimization?

There is a query optimizer. Especially with graph queries, the way the queries are formulated and the data is modeled, it can have a great impact on the performance.

Hope that helps a little to understand ArangoDB better
  Frank
Reply all
Reply to author
Forward
0 new messages