I'm trying to run a script using scald.rb. It seems to rsync the jar and then hang while compiling the job. Has anyone seen this behavior before? Here is the output:[INFO] Found Job Class: BuildBFLModelCompleteConnecting to adsnest3.smf1.twitter.com13/05/23 14:56:09 INFO util.HadoopUtil: resolving application jar from found main method on: com.twitter.scalding.Tool$13/05/23 14:56:09 INFO planner.HadoopPlanner: using application jar: /home/mmiller/ads-batch-deploy.jar13/05/23 14:56:09 INFO property.AppProps: using app.id: 9397446BB6825C965870E7D7DA3EE207After this point it will sit there for literally 1 hour.
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
--
You received this message because you are subscribed to a topic in the Google Groups "cascading-user" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/cascading-user/jsG9NZ5w1VI/unsubscribe?hl=en.
To unsubscribe from this group and all its topics, send an email to cascading-use...@googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
Can you share a test job with Chris? My guess is there is some bad scaling of some scheduling algorithm that only shows up in some graphs.
13/06/07 14:33:54 INFO uc.UCGenerator: Fetching type2 events :: [/var/tmp/idm/localtest-type2]13/06/07 14:42:00 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId=
sorry, I cannot provide any help on this without it being a raw cascading application I can debug. hopefully Nathan or Sam can jump in.ckw
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
I have created a small test to simulate this scenario. I am using Jcascalog on top of cascading.Code is in my github: https://github.com/sourabhchaki/cascalog-cascading-test/blob/master/src/main/java/com/home/test/CascadingTestInJcascalog.java.
Looping Nathan as this code is written in Jcascalog.Here I am doing self join for the same input for a given depth. For every depth cascading creates 2 jobs. Thus by changing depth I was able check the preparation time for multiple cascading jobs. Here I can see cascading job preparation time is increasing in non linear fashion as number of jobs increases.Execution steps for this test is here.depth=5,step 10, time taken:1 sec[17/06/2013:14:13:19 IST] [INFO] [cascading.property.AppProps main]: using app.id: FC106638099703F5450E89B08BB7442F[17/06/2013:14:13:20 IST] [INFO] [cascading.util.Version flow]: Concurrent, Inc - Cascading 2.0.0[17/06/2013:14:13:20 IST] [INFO] [cascading.flow.Flow flow]: [] starting......[17/06/2013:14:13:20 IST] [INFO] [cascading.flow.Flow flow]: [] starting jobs: 10[17/06/2013:14:13:20 IST] [INFO] [cascading.flow.Flow flow]: [] allocating threads: 1[17/06/2013:14:13:20 IST] [INFO] [cascading.flow.FlowStep pool-1-thread-1]: [] starting step: (6/10)depth=10, steps: 20: Time taken: 15 mins.[17/06/2013:14:14:50 IST] [INFO] [cascading.property.AppProps main]: using app.id: 264A79523E9A9AF21EB04D2814FBCF9F[17/06/2013:14:29:54 IST] [INFO] [cascading.util.Version flow]: Concurrent, Inc - Cascading 2.0.0[17/06/2013:14:29:54 IST] [INFO] [cascading.flow.Flow flow]: [] starting.....[17/06/2013:14:29:54 IST] [INFO] [cascading.flow.Flow flow]: [] starting jobs: 20I tried with depth =15, so jobs= 30, and waited for 1 hrs but the application never started.Hope this will help you to investigate the problem.Let me know if you need any more details.ThanksSourabh
On Friday, 24 May 2013 03:58:32 UTC+5:30, Oscar Boykin wrote:
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
Anyone else see cascading take a long time in flow planning phase?
We've seen some very long cases when the graph gets up to 30-35 steps.
--Oscar Boykin :: @posco :: http://twitter.com/posco
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
Hi Sam,I have generated dot files doing the following steps://Api.execute(new StdoutTap(), newMappings);Flow flow = Api.compileFlow(new StdoutTap(), newMappings);flow.writeDOT(depth*2+"jobs.dot");Dot files for 10jobs and 20jobs are attached.Here also I can see the same problem. For 20jobs it is taking 15 minutes to generate the dot file. Where as for 10jobs it is taking less than 1 sec to generate this dot file. I have generated these dot files in local. I saw the same result in linux machine as well.I am new to Jcascalog and yet to get experience in cascading. It will be really helpful if you can convert my example to cascading only if Chris need that for debugging.Thanks in advance.Regards,Sourabh
On Friday, May 24, 2013 3:58:32 AM UTC+5:30, Oscar Boykin wrote:
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
For more options, visit https://groups.google.com/groups/opt_out.
Anyone else see cascading take a long time in flow planning phase?
We've seen some very long cases when the graph gets up to 30-35 steps.
--Oscar Boykin :: @posco :: http://twitter.com/posco
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
Did any one of you get a chance to look into it?
ThanksSourabh
On Friday, 24 May 2013 03:58:32 UTC+5:30, Oscar Boykin wrote:
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user.
For more options, visit https://groups.google.com/groups/opt_out.
Anyone else see cascading take a long time in flow planning phase?
We've seen some very long cases when the graph gets up to 30-35 steps.
--Oscar Boykin :: @posco :: http://twitter.com/posco
--
You received this message because you are subscribed to the Google Groups "cascading-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cascading-use...@googlegroups.com.
To post to this group, send email to cascadi...@googlegroups.com.
Visit this group at http://groups.google.com/group/cascading-user?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/f506c2e4-44e8-4d82-9144-d2471637bd1b%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/2bc09db2-3f33-42d6-9f43-239cf04fd996%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/DAA38007-794A-4EFD-84F6-B7E7D2A1F0B1%40wensel.net.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/bb63ba24-ea99-4a9c-8e71-183487f0fc83%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
...<span class="Apple-style-span" style="border-collapse: separate; color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: norm
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/47a44e67-b3ce-44c9-97de-46301491012d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/cascading-user/FB1F496E-CB0F-40A6-9E0B-5904E078EB10%40wensel.net.
For more options, visit https://groups.google.com/d/optout.
...