run scc in GPS

30 views
Skip to first unread message

Yi Lu

unread,
May 17, 2014, 3:30:36 AM5/17/14
to stanford...@googlegroups.com, ya...@ust.hk
Hi Everyone,

I find a SCC package in GPS's source code. Could anyone please tell me how to run the state-of-the-art (with all the optimization techniques) implementation of SCC from the following paper?


What are the parameters do I need to specify to let GPS run?

Thanks.

Semih Salihoglu

unread,
May 17, 2014, 8:50:36 PM5/17/14
to Yi Lu, stanford...@googlegroups.com, ya...@ust.hk
Hi Yi,

You should use the Master and Vertex classes under scc.hybrid package and give the fbts (for finishing backward traversals serially), and frss (finishing remaining sccs serially) options. And then with nefsc (num edges for serial computation), you can pick when to trigger  serial computation. You can start with the default value, which is 1M edges and then play around a bit.

Best,

semih


--
You received this message because you are subscribed to the Google Groups "stanfordgpsusers" group.
To unsubscribe from this group and stop receiving emails from it, send an email to stanfordgpsuse...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Yi Lu

unread,
May 18, 2014, 1:32:11 AM5/18/14
to stanford...@googlegroups.com, Yi Lu, ya...@ust.hk, se...@stanford.edu
Hi Semih,

Thanks for your reply. 

I use the following parameters, is this correct? 


./start_gps_nodes.sh 120 scc-pokec \
"-ifs /pullgel/pokec/part_* \
-hcf /usr/local/hadoop/conf/core-site.xml \
-jc gps.examples.scc.hybrid.JobConfiguration \
-mcfg /gps/machine8.cfg \
-pt 10 \
-log4jconfig /home/yanda/gps/conf/log4j.config \
-other fbts###true###frss###true###nefsc###1000000"


By the way, what is the input format used? Is the format as the same as "adj" in graphlab?http://docs.graphlab.org/graph_formats.html


在 2014年5月18日星期日UTC+8上午8时50分36秒,Semih Salihoglu写道:

Yi Lu

unread,
May 18, 2014, 3:31:45 AM5/18/14
to stanford...@googlegroups.com, Yi Lu, ya...@ust.hk, se...@stanford.edu
Hi Semih,

I have another question. I am wondering whether there is a parameter that I can set to run fixed number of forward and backward label algorithm. For example, I want to run 4 rounds of labeling, then 1 round single machine algorithm.

Thanks

在 2014年5月18日星期日UTC+8上午8时50分36秒,Semih Salihoglu写道:

Semih Salihoglu

unread,
May 18, 2014, 10:54:09 PM5/18/14
to Yi Lu, stanford...@googlegroups.com, ya...@ust.hk
Hi Yi,

Your setup looks correct to me and the input is in adj list format. You can't specify the number of fw and bw traversals but you can specify the total number of supersteps to run: msstr (for max supersteps to run). But you can easily change the code and make the algorithm run certain numbers of fw and bw traversals.

Hope this helps,

semih 

Yi Lu

unread,
May 18, 2014, 11:03:51 PM5/18/14
to stanford...@googlegroups.com, Yi Lu, ya...@ust.hk, se...@stanford.edu
Hi Semih,

Could you please give me some hints on which file I should go to modify? There are so many files under SCC package.

Again, Thanks a lot.

在 2014年5月19日星期一UTC+8上午10时54分09秒,Semih Salihoglu写道:

Semih Salihoglu

unread,
May 18, 2014, 11:11:32 PM5/18/14
to Yi Lu, stanford...@googlegroups.com, ya...@ust.hk
Probably SCCBaseMaster.

semih
Reply all
Reply to author
Forward
0 new messages