I would like to determine exactly how long the execution of my workflow takes (not including loading/parsing the data). Is it possible to load the input data and then time only how long the algorithm takes (in this case, I am running kmeans using CSV files as input).
Thanks!
--
You received this message because you are subscribed to the Google Groups "stratosphere-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to stratosphere-d...@googlegroups.com.
Visit this group at http://groups.google.com/group/stratosphere-dev.
For more options, visit https://groups.google.com/d/optout.
One caveat though!
Stratosphere currently pipelines the data between operators. The source and the next operator run definitely together, so the reading influences the successor operator's time still. You can see that from the fact that the operator runtimes overlay in the execution time breakdown (accessible after the program completed).