Questions about adding vertex and maintaining global value.

29 views
Skip to first unread message

肖俊鹏

unread,
Nov 25, 2013, 9:29:28 AM11/25/13
to graph...@googlegroups.com
Hi,
It will be very helpful if anyone can give me some advice.
  • I have to parse papers, novels into graphlab ( based on words, i.e., same words represent the same vertex). And the appearing number of a word should be the value of  corresponding vertex. I have searched in this group and tried to find solution in documentation, but I didn't find it.  So I wonder if there any good way to solve this problem ( I mean I can write line_parser, but I don't know how to solving the duplicated vertex(same word), and changed a vertex's value while parsing). And yes, I can pre-processing files, But I want to know if there is a way to solve the issue without pre-processing files. 
  • The other question is about global value. I want to maintain a global value during GAS procedure. The only way that I found is using map_reduce_vertices<T>. But this function can only be used after GAS procedure. In order to use this function, I have to separate my algorithm into several iterations. Which will slow down the algorithm ( not continues signal vertexes). So I wondering if there any good way to maintain global value during the GAS procedure.

BTW, It's a little hard to debug on distribution platform. If there are any good way for debugging, please tell me. 

Thanks for your time to read this long post. Looking forward for your reply. 
Message has been deleted
Message has been deleted

肖俊鹏

unread,
Nov 26, 2013, 8:45:36 AM11/26/13
to graph...@googlegroups.com
I'm trying to set GraphLab snapshot but can not find the snapshot files during/after the processing. The version of Graphlab is 2.2 (or maybe 2.1).  Here are core codes about snap_shot:
clopts.attach_option("snapshot_interval", interval,"interval");
clopts.attach_option("snapshot_path", snappath,"path");
graphlab::synchronous_engine<testclass> engine(dc, graph, clopts);
And This is the command I use:
--snapshot_interval 0 
--snapshot_path="hdfs://master:54322/user/dummy/snap_out"
 Should I set something else to make this work?
Reply all
Reply to author
Forward
0 new messages