Node fail?

2,142 views
Skip to first unread message

Palle Duun Rohde

unread,
Oct 2, 2014, 6:12:25 AM10/2/14
to genome-au-c...@googlegroups.com

Hi


I am having problems with my jobs terminating because of 'Node fail'?

For example I get this message in e-mail alert:

Run time 01:09:50, NODE_FAIL, ExitCode 0


Has this anything to do with my job or the nodes? It happens to different jobs and the time when it fails seems random.


Cheers

Palle

Anders Halager

unread,
Oct 2, 2014, 6:25:53 AM10/2/14
to genome-au-c...@googlegroups.com
Can you give me one of the job ids this happened with?

Anders

Palle Duun Rohde

unread,
Oct 2, 2014, 6:30:25 AM10/2/14
to genome-au-c...@googlegroups.com


SLURM Job_id=1340896 Name=CV Failed, Run time 00:07:57, NODE_FAIL, ExitCode 0

Palle Duun Rohde

unread,
Oct 2, 2014, 6:36:33 AM10/2/14
to genome-au-c...@googlegroups.com

SLURM Job_id=1338030 Name=binaryDMU Failed, Run time 00:11:36, NODE_FAIL, ExitCode 

Anders Halager

unread,
Oct 2, 2014, 6:38:26 AM10/2/14
to genome-au-c...@googlegroups.com
Thanks, I am looking at it.

Anders

Rune Møllegaard Friborg

unread,
Oct 15, 2014, 5:01:10 PM10/15/14
to genome-au-c...@googlegroups.com

This issue has been identified and should only happen in rare cases, where communication to the node fails.

/ Rune

Reply all
Reply to author
Forward
0 new messages