Spark OLAP Traversing is able to find the table I specified in properties.

249 views
Skip to first unread message

prasad.d...@gmail.com

unread,
Jan 15, 2018, 6:57:08 PM1/15/18
to Gremlin-users
Hi,

I am new to JanusGraph and trying to do OLAP traversal on a Graph.
I dont understand why gremlin could not find the table (created with JanusGraphFactory) even when I specified 
table with property storage.hbase.table and still looking for default graph janusgraph.

My properties:
==========
cat /home/hadoop/spark-janus-hbase-axe-indv.properties

gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph

gremlin.hadoop.graphInputFormat=org.janusgraph.hadoop.formats.hbase.HBaseInputFormat

storage.backend=hbase

cache.db-cache = true

cache.db-cache-clean-wait = 20

cache.db-cache-time = 180000

cache.db-cache-size = 0.5

storage.hbase.table=spark-axe-janusgraph



storage.hostname= 172.31.40.228


spark.master=local[*]

spark.executor.memory=1g

spark.driver.cores=1

spark.submit.deployMode=client

spark.executor.instances=4



Gremlin Console

================

gremlin> hadoopGraph =GraphFactory.open('/home/hadoop/spark-janus-hbase-axe-indv.properties')

==>hadoopgraph[hbaseinputformat->no-writer]


gremlin> hadoopGraph.traversal().withComputer(SparkGraphComputer).V()

23:21:29 WARN  org.apache.tinkerpop.gremlin.spark.process.computer.SparkGraphComputer  - No gremlin.hadoop.graphWriter property provided and thus, persistence options are unknown -- assuming all options are possible

org.apache.hadoop.hbase.TableNotFoundException: Table 'janusgraph' was not found, got: hypergraph.

Type ':help' or ':h' for help.

Display stack trace? [yN]



Any help is much appreciated. Thanks in advance....



Regards,

Prasad Dokuparthi.


HadoopMarc

unread,
Jan 16, 2018, 8:19:15 AM1/16/18
to Gremlin-users
Hi Prasad,

Nothing seems wrong, so maybe check:

config = new PropertiesConfiguration('/home/hadoop/spark-janus-hbase-axe-indv.properties')
config.getProperty('storage.hbase.table')

Cheers,     Marc

Op dinsdag 16 januari 2018 00:57:08 UTC+1 schreef prasad.d...@gmail.com:

prasad.d...@gmail.com

unread,
Jan 16, 2018, 11:43:35 AM1/16/18
to Gremlin-users
Thanks a lot for your reply.

The configuration I am giving seems to be reflecting but not working.

gremlin> graph = GraphFactory.open('/home/hadoop/spark-janus-hbase-axe-indv.properties')

==>hadoopgraph[hbaseinputformat->no-writer]

gremlin> 

gremlin> graph.configuration()

==>[spark.executor.memory,1g]

==>[spark.master,local[*]]

==>[spark.driver.cores,1]

==>[gremlin.graph,org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph]

==>[cache.db-cache-size,0.5]

==>[spark.submit.deployMode,client]

==>[cache.db-cache-clean-wait,20]

==>[cache.db-cache,true]

==>[storage.backend,hbase]

==>[storage.hostname,172.31.47.151]

==>[gremlin.hadoop.graphReader,org.janusgraph.hadoop.formats.hbase.HBaseInputFormat]

==>[storage.hbase.table,spark-axe-janusgraph]

==>[gremlin.hadoop.inputLocation,/user/hbase/data/default/spark-axe-janusgraph]

==>[spark.executor.instances,4]

==>[cache.db-cache-time,180000]

gremlin> 

gremlin> 


Regards,
Prasad.

HadoopMarc

unread,
Jan 16, 2018, 11:56:38 AM1/16/18
to Gremlin-users
OK, maybe try to add the following property (also present in the Cassandra example in conf/hadoop-graph):

gremlin.hadoop.inputLocation=none

HTH,   Marc

Op dinsdag 16 januari 2018 17:43:35 UTC+1 schreef prasad.d...@gmail.com:

prasad.d...@gmail.com

unread,
Jan 16, 2018, 4:01:01 PM1/16/18
to Gremlin-users
Hi Marc,

Thanks a lot, I got it worked with 

janusgraphmr.ioformat.conf.storage.hbase.table=spark-acxiom-janusgraph


But I am surprised to see that JanusGraph is not picking with storage.hbase.table 
and there is no good documentation in one place to explain how configuration should be set up to make gremlin work with spark.

There is tons of documentation present to traverse, process and do operations on the stupid default JanusGraph with 6 vertices.
but what to do with that if every user is facing trouble on how to setup gremlin configuration , create a userdefined graph and open it and traverse it. 

I cant even find a sample code to load data to spark with InputRDD menthod anywhere.

Sorry I dont want to be rude, but series of exceptions and not getting help is causing frustration. 
Thanks a lot for your help.

Regards,
Prasad.

HadoopMarc

unread,
Jan 17, 2018, 8:51:39 AM1/17/18
to Gremlin-users
Hi Prasad,

Your comments are fair enough. The thing is, many developers familiar with the old Titan-0.5.x, have had to wait very long for the JanusGraph releases and are eager to update their own projects. Nevertheless, everyone should realize that the JanusGraph community needs new users to keep it healthy on the long term. I am busy right now to bulkload the Friendster graph (https://snap.stanford.edu/data/com-Friendster.html) into JanusGraph and hope to blog about it in the near future and provide users with a reference big data graph example.

The janusgraphmr.ioformat.conf prefix for hadoop is tricky indeed and I feel bad that I missed that one.


Cheers,    Marc


Op dinsdag 16 januari 2018 22:01:01 UTC+1 schreef prasad.d...@gmail.com:
Reply all
Reply to author
Forward
0 new messages