2017-02-07 22:56:21,500 INFO Sleeping for 5 seconds...zzz...
2017-02-07 22:56:26,592 INFO Launching a rocket!
2017-02-07 22:56:26,616 INFO No jobs exist in the LaunchPad for submission to queue!
2017-02-07 22:56:26,616 ERROR ----|vvv|----
2017-02-07 22:56:26,616 ERROR Error with queue launcher rapid fire!
2017-02-07 22:56:26,618 ERROR Traceback (most recent call last):
File "/atlas/u/jkuck/software/anaconda2/envs/anaconda_venv/lib/python2.7/site-packages/fireworks/queue/queue_launcher.py", line 216, in rapidfire
raise RuntimeError("Launch unsuccessful!")
RuntimeError: Launch unsuccessful!
2017-02-07 22:56:26,619 ERROR ----|^^^|----
It looks like the queue launcher thinks a firework is ready to launch, but then finds the queue is empty after calling launch_rocket_to_queue(). Any tips would be appreciated!
Thanks,
Jonathan
--
You received this message because you are subscribed to the Google Groups "fireworkflows" group.
To unsubscribe from this group and stop receiving emails from it, send an email to fireworkflow...@googlegroups.com.
To post to this group, send email to firewo...@googlegroups.com.
Visit this group at https://groups.google.com/group/fireworkflows.
To view this discussion on the web visit https://groups.google.com/d/msgid/fireworkflows/848dd390-ba00-4ad9-8daf-815882c89347%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
1) Can you paste the output of "lpad get_fws -s READY -d count" after the script crashes?
2) Would you mind running the script again with strm_lvl="DEBUG" and pasting the output again?
2017-02-08 10:53:34,428 INFO Job submission was successful and job_id is 1176878
2017-02-08 10:53:34,428 INFO Sleeping for 5 seconds...zzz...
2017-02-08 10:53:39,455 INFO Finished a round of launches, sleeping for 60 secs
2017-02-08 10:54:39,516 INFO Checking for Rockets to run...
2017-02-08 10:54:39,555 INFO The number of jobs currently in the queue is: 0
2017-02-08 10:54:39,555 INFO 0 jobs in queue. Maximum allowed by user: 20
2017-02-08 10:54:39,640 INFO Launching a rocket!
2017-02-08 10:54:39,647 DEBUG getting queue adapter
2017-02-08 10:54:39,733 INFO Created new dir /atlas/u/jkuck/rbpf_fireworks/block_2017-02-08-17-35-21-007249/launcher_2017-02-08-18-54-39-731710
2017-02-08 10:54:39,733 INFO moving to launch_dir /atlas/u/jkuck/rbpf_fireworks/block_2017-02-08-17-35-21-007249/launcher_2017-02-08-18-54-39-731710
2017-02-08 10:54:39,734 DEBUG writing queue script
2017-02-08 10:54:39,740 INFO submitting queue script
2017-02-08 10:54:41,842 INFO Job submission was successful and job_id is 1176879
2017-02-08 10:54:41,843 INFO Sleeping for 5 seconds...zzz...
2017-02-08 10:54:46,933 INFO Launching a rocket!
2017-02-08 10:54:46,940 DEBUG getting queue adapter
2017-02-08 10:54:46,961 INFO No jobs exist in the LaunchPad for submission to queue!
2017-02-08 10:54:46,961 ERROR ----|vvv|----
2017-02-08 10:54:46,962 ERROR Error with queue launcher rapid fire!
2017-02-08 10:54:46,965 ERROR Traceback (most recent call last):
File "/atlas/u/jkuck/software/anaconda2/envs/anaconda_venv/lib/python2.7/site-packages/fireworks/queue/queue_launcher.py", line 216, in rapidfire
raise RuntimeError("Launch unsuccessful!")
RuntimeError: Launch unsuccessful!
2017-02-08 17:11:05,635 INFO Launching a rocket!
2017-02-08 17:11:05,637 DEBUG getting queue adapter
2017-02-08 17:11:05,673 INFO No jobs exist in the LaunchPad for submission to queue!
2017-02-08 17:11:05,673 ERROR ----|vvv|----
2017-02-08 17:11:05,673 ERROR Error with queue launcher rapid fire!
2017-02-08 17:11:05,674 ERROR Traceback (most recent call last):
File "/home/kuck/.local/lib/python2.7/site-packages/fireworks/queue/queue_launcher.py", line 221, in rapidfire
raise RuntimeError("Launch unsuccessful!")
RuntimeError: Launch unsuccessful!
2017-02-08 17:11:05,675 ERROR ----|^^^|----
Best,
Jonathan
Hi Anubhav,
Correct me if I'm wrong, but I think the queue launcher is crashing before creating the launch directory. It looks like 'atlas/u/jkuck/rbpf_fireworks/block_2017-02-08-17-35-21-007249/launcher_2017-02-08-18-54-39-731710 ' is the directory created by the successful submission.The problem seems to be that somehow launchpad.run_exists(fworker) is evaluating to True in the while loop in rapidfire() in queue_launcher.py, but then false in launch_rocket_to_queue().Best,
Jonathan
To unsubscribe from this group and stop receiving emails from it, send an email to fireworkflows+unsubscribe@googlegroups.com.
To post to this group, send email to firewo...@googlegroups.com.
Visit this group at https://groups.google.com/group/fireworkflows.
To view this discussion on the web visit https://groups.google.com/d/msgid/fireworkflows/774f992d-1ece-4152-9f0d-88db91cbe1e3%40googlegroups.com.
To unsubscribe from this group and stop receiving emails from it, send an email to fireworkflow...@googlegroups.com.
To post to this group, send email to firewo...@googlegroups.com.
Visit this group at https://groups.google.com/group/fireworkflows.
To view this discussion on the web visit https://groups.google.com/d/msgid/fireworkflows/774f992d-1ece-4152-9f0d-88db91cbe1e3%40googlegroups.com.
--Best,
Anubhav