Failed to load done file.

11 views
Skip to first unread message

Mark Miller

unread,
Jul 31, 2017, 8:32:10 AM7/31/17
to genie
I have a EMR spark job I'm triggering from Genie. The job runs fine from Data Pipeline, but frequently from Genie it fails with nothing more than a status message reporting:

    Failed to load done file.

Looking at the log files it even seems like the job finished successfully. I can find literally nothing online or in the docs to explain what to do. All I seem to find when searching is references to the source code.

Mark Miller

unread,
Jul 31, 2017, 8:33:04 AM7/31/17
to genie
Forgot to mention this is Genie 3.0.7


Marco Primi

unread,
Jul 31, 2017, 12:43:57 PM7/31/17
to Mark Miller, genie
I think we saw this errors a few days ago but digging a bit further than turned out to be due to the `ps` command not being installed in the docker image (that’s something you can check, it’s been fixed).

If it this is a different issue, it’s probably platform specific, since we haven’t seen this in our environments (which run thousands of jobs a day), so we’ll need some more info to see what’s going on.
Can you send the full stack trace (plus any error you might see earlier in the logs?)

Thanks,
M.


On Jul 31, 2017, at 5:33 AM, Mark Miller <developmen...@gmail.com> wrote:

Forgot to mention this is Genie 3.0.7



--
You received this message because you are subscribed to the Google Groups "genie" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genieoss+u...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Mark Miller

unread,
Jul 31, 2017, 5:18:26 PM7/31/17
to genie
We upgraded to 3.0.11 (latest version with the fix) and that seems to have eliminated the error. We're getting other errors now, but I have to look into them first to determine what's happening. Thank you.
Reply all
Reply to author
Forward
0 new messages