Errors using 1864 (qsub failed with code 32512)

27 views
Skip to first unread message

alkalo...@gmail.com

unread,
Jul 13, 2016, 6:13:19 AM7/13/16
to grid-control
Dear Fred

While using the latest version and after resolving all the issues I had due to my grid certificate, running produces this error


All logfiles were moved to /nfs/dust/cms/user/alkaloge/grid-jobs/gc_Data80X_METtest/error.tar
2016-07-13 12:11:12 - Job 50314  state changed from INIT to FAILED
2016-07-13 12:11:13 - wms.local:WARNING - /usr/sge/bin/lx-amd64/qsub failed:
2016-07-13 12:11:13 - WARNING: qsub failed with code 32512
2016-07-13 12:11:13 -
Unable to read script file because of error: error opening site: No such file or directory
/bin/sh: line 1: os: command not found
/bin/sh: line 2: h_rt: command not found
/bin/sh: line 3: h_vmem: command not found

Any help is highly appreciated.

Regards

Alexis

Fred Stober

unread,
Jul 13, 2016, 12:26:42 PM7/13/16
to grid-control
Hi,

there were some issues with the NAF batch system scheduler today.
However, I've just now successfully tried docs/example/Example02_local.conf on the nafhh-cms03 with the latest stable release r1864.
Have you tried it again since then?

Cheers,
Fred

alkalo...@gmail.com

unread,
Jul 13, 2016, 1:36:40 PM7/13/16
to grid-control
Hi
No, still the same issue  - However, indeed I can run the Example02 but this is also a far simpler config. What I am using is located here


/nfs/dust/cms/user/alkaloge/TauAnalysis/new/new/CMSSW_8_0_12/src/DesyTauAnalyses/NTupleMaker/test/gc_Data80X.config

Can you maybe have a look ?

Thanks

Alex

Fred Stober

unread,
Jul 14, 2016, 8:10:33 AM7/14/16
to grid-control
Instead of:

submit options = 
        site => hh
        os => sld6
;       h_rt => 167:59:00
        h_rt => 5:59:00 
        h_vmem => 4000M

you should use:

[jobs]
memory = 4000
wall time = 5:59

"os=sld6" is the NAF default since 29.10.2015 and the "site" option has no effect since the transition to the NAF 2.0.

Cheers,
Fred
Reply all
Reply to author
Forward
0 new messages