I am having some issues with running the GiraphGraphComputer in the gremlin console. I have been able to successfully install the tinkerpop.giraph plugin and activate it, and the HADOOP_GREMLIN_LIBS variable is set to the ext/giraph-gremlin/lib directory.
I am trying to run the gremlin console on VM. The actual graph I am trying to read in has been successful using the SparkGraphComputer, but when I try and use Giraph, I get the following output:
plugin activated: tinkerpop.giraph
gremlin> graph = GraphFactory.open('conf/hadoop-graph/hadoop-script.properties')
==>hadoopgraph[scriptinputformat->graphsonoutputformat]
gremlin> g = graph.traversal(computer(GiraphGraphComputer))
==>graphtraversalsource[hadoopgraph[scriptinputformat->graphsonoutputformat], giraphgraphcomputer]
gremlin> g.V().count()
17:54:13 WARN org.apache.hadoop.util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
17:54:14 INFO org.apache.tinkerpop.gremlin.hadoop.process.computer.giraph.GiraphGraphComputer - HadoopGremlin(Giraph): TraversalVertexProgram[GraphStep([],vertex), CountGlobalStep, ComputerResultStep]
java.lang.IllegalStateException: checkLocalJobRunnerConfiguration: When using LocalJobRunner, must have only one worker since only 1 task at a time!
gremlin.graph=org.apache.tinkerpop.gremlin.hadoop.structure.HadoopGraph
gremlin.hadoop.graphInputFormat=org.apache.tinkerpop.gremlin.hadoop.structure.io.script.ScriptInputFormat
gremlin.hadoop.graphOutputFormat=org.apache.tinkerpop.gremlin.hadoop.structure.io.graphson.GraphSONOutputFormat
gremlin.hadoop.jarsInDistributedCache=true
gremlin.hadoop.inputLocation=data/mygraph.txt
gremlin.hadoop.scriptInputFormat.script=data/script-input-tinkerpop.groovy
gremlin.hadoop.outputLocation=output
#####################################
# GiraphGraphComputer Configuration #
#####################################
giraph.minWorkers=1
giraph.maxWorkers=2
giraph.useOutOfCoreGraph=true
giraph.useOutOfCoreMessages=true
mapred.map.child.java.opts=-Xmx1024m
mapred.reduce.child.java.opts=-Xmx1024m
giraph.numInputThreads=4
giraph.numComputeThreads=4
# giraph.maxPartitionsInMemory=1
# giraph.userPartitionCount=2
####################################
# SparkGraphComputer Configuration #
####################################
spark.master=local[4]
# spark.master=yarn-client
spark.executor.memory=1g
spark.serializer=org.apache.spark.serializer.KryoSerializer
# spark.kryo.registrationRequired=true
# spark.storage.memoryFraction=0.2
spark.eventLog.enabled=true
spark.eventLog.dir=tmp/spark-event-logs
# spark.ui.killEnabled=true
Has anyone else been having similar issues or figured out how to run the GiraphGraphComputer?