I use the WCC code in the examples folder and run the connected components algorithm on a road-network. There are 3774768 vertices and 16522438 edges, the max degree is 9. Therefore, during the computation, there are not so many msgs need to transmit across the machines. However, I notice the minimum superstep time is around 3 seconds, even though there are less than 100 msgs need to transmit. Overall, GPS needs 17502.681 seconds to run CC on my graph. I also run the algorithm in graphlab, it only needs 2982.3 to complete the computation. I also notice the minimum superstep time in Giraph is also high, about 2~3 seconds, Giraph finishes the computation in 9518.985s.
I wonder whether there is some constant overhead during each superstep, is this a common problem for JAVA implementation of pregel?
Thanks a lot. Have a good time in your trip.