Hi All
Can another advise the possibilities of me encountering the error message as below when submitting a job ?
sbatch: error: memory allocation failure
The same script use work perfectly fine until I include #SBATCH --nodelist=(compute[015-046]) (once removed it work as it should)
The issues
Any suggestion
Thanks
Mike
Mike,
You don't include your entire sbatch script, so it's really hard to say what's going wrong when we only have a single line to work with. Based on what you have told us, I'm guessing you are specifying a memory requirement per node greater than 128000. When you specify a nodelist, Slurm will assign your job to all of those nodes, not a subset that matches the other job specifications (--mem or --mem-per-cpu, or --tasks, etc.):
- -w, --nodelist=<node name list>
- Request a specific list of hosts. The job will contain all of these hosts and possibly additional hosts as needed to satisfy resource requirements.
Prentice