Hej everyone
I want to build the following pipeline (I'm prototyping right now):
Description:
Data -> Map/Reduce Giraph Pipeline -> graph -> Neo4j -> application
The system would run nightly rebuilding/replacing the Graph. It would then be dumped into a graph DB to make it possible for the application layer to query.
The graph is 20 million V and 200 million E currently in edgelist format with String vertex ID's und key/value pair data on edges (one of them is the edge type). The application layer only reads from that graph.
Here my questions:
1. Is Neo4j the right tool for the job? (I have no updates, no transactions but lots of queries)
2. What is the best way to import the data into Neo4j (I have heard the batch import can be slow for large data, and this would be a bottleneck)
3. Is there a simply online query tool I can hand to the application developer to "browse" the graph?
cheeers Matin