Precise timing of operation execution

7 views

Skip to first unread message

John Lavoie

unread,

Jun 30, 2015, 12:25:56 PM6/30/15

to cascadi...@googlegroups.com

Our application is very performance sensitive. We have been trying to identify where the application is spending execution time so that we can focus on optimizing the slow parts. We have been using hprof to do low level profiling, but we would also like to do high level metrics by recording the total execution time of each custom operation in our workflows.

The numbers don't seem to add up correctly though. Is what we're trying to do impossible? Are we implementing it wrong?

    public void operate(FlowProcess flowProcess, FunctionCall<CoresWrapper> functionCall) {
        long startTime = System.nanoTime();
        // do heavy lifting
        long endTime = System.nanoTime();
        incrementExecutionCounter(hadoopFlowProcess, endTime - startTime);
    }

And these utility methods:
    protected final void incrementExecutionCounter(FlowProcess flowProcess, long value) {
        incrementCounter(flowProcess, getOperationName(), PlatformConstants.ACC_COUNTER_EXEUCTION_TIME,
                Utils.toMilliSeconds(value));
    }
    protected final void incrementCounter(FlowProcess flowProcess, String group, String counter, long value) {
        if (flowProcess instanceof HadoopFlowProcess) {
            ((HadoopFlowProcess) flowProcess).getReporter().getCounter(group, counter).increment(value);
        }
    }

What we are seeing is that the total time for my two operations in the reduce phase add up to a greater amount of time than the tasks were executing! As an example:
Wall clock time: 2.15 hours
Map Tasks: 606
Reduce Tasks: 207
Avg Map Time: 117 seconds
Avg Reduce Time: 945 seconds

CPU_MILLISECONDS: 20.18891 days (For Reduce only)
VCORES_MILLIS_REDUCES: 16.5976 days
Custom Operation 1: 13.34499 days
Custom Operation 2: 13.19409 days

I would have expected that Custom Operation 1 + Custom Operation 2 + shuffle + sort = one of the total reduce time metrics. But that isn't the case. It appears that something is going horribly wrong as these numbers are added up. These aren't fast operations. The average time per execution of the entire reduce phase is around 0.5-1 second per tuple.

Are we doing something wrong? Or is it possible that some multi-threading is happening in the reduce tasks?

We are running Cascading 2.6.1 on MapR 4.0.1 (planning to do upgrades very soon)

Chris K Wensel

unread,

Jun 30, 2015, 1:23:29 PM6/30/15

to cascadi...@googlegroups.com

I would argue profiling your app in Driven (http://www.cascading.org/driven/) to see where the issues are before trying to measure individual operation performance.

unfortunately Cascading 3 isn’t supported by Driven EAP just yet (getting close), but when it is, on Tez you can also enable flight recorder to profile individual nodes.

https://github.com/Cascading/cascading/tree/3.0/cascading-hadoop2-tez#running-tests-and-debugging

along with the planner traces as well,

https://github.com/Cascading/cascading/blob/3.0/README.md#debugging-the-3x-planner

you can pinpoint the tez node you want to profile.

that all said, if you really think the bottlenecks are from the operations, just use Cascading local mode and a jvm profiler in your IDE. there are no hadoop deps, so you can see your code much easier.

ckw

--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/feeaadc9-5e0e-453d-8e63-946a9e762ff5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

—

Chris K Wensel

ch...@wensel.net

Reply all

Reply to author

Forward

0 new messages