Meme parallel fails on multiple nodes

62 views

Skip to first unread message

Some Xyz

unread,

Sep 11, 2014, 7:22:52 AM9/11/14

to meme-...@googlegroups.com

Dear all,

When I try to run Meme parallel on 8 nodes with 16 cores on each. It running for 6 iteration after after that It giving following message :

meme:14413 terminated with signal 11 at PC=428d49 SP=7fff26d2f2d0. Backtrace:

/lib64/libc.so.6(__libc_start_main+0xfd)[0x33e341ecdd]

meme[0x402ee9]

meme[0x428d49]

meme[0x4290a4]

meme[0x424bf2]

In my cluster we are using sge gridengine, following script is used for job submission :

#!/bin/bash

# Script jobs # Comment

#$ -q all.q # Use specific queue

#$ -N meme_CSC # Job Name

#$ -cwd # Change to current working Directory

#$ -S /bin/bash # Job run shell

#$ -V # Export environment variables into script

#$ -j y # Merge standard out and error into one file

#$ -pe mpich 128 # MPI enviroment use for 2 cores

meme K27L_K4I_expanded.fa -oc K27L_expanded1 -p $NSLOTS -mod zoops -dna -nmotifs 10 -minw 5 -maxw 30 -maxsize 20000000;

But when I try to run on single node with 16 processor it doesn't giving error.

Please tell us why meme is failing ?

CharlesEGrant

unread,

Oct 6, 2014, 9:14:48 PM10/6/14

to meme-...@googlegroups.com

My first guess would be that the head node is getting killed for using too much memory. What is the default amount of memory allocated to a node? You may need to tell SGE that your job requires more memory.

You might try something like this:

#$ -l h_vmem=4G

Reply all

Reply to author

Forward

0 new messages