juicer.sh script stuck on "srun job queued and waiting for resources" and memory issues

1,369 views
Skip to first unread message

Ben Nestor

unread,
Nov 8, 2020, 6:27:21 AM11/8/20
to 3D Genomics

Hi,

I'm having trouble running the cluster juicer.sh script for HiC reads. I'm just using a subsample of practice data (genome size ~4Mb) before I use my real data (genome size ~800Mb).

I'm running juicer.sh with:

$MYSCRATCH/cluster_juicer/scripts/juicer.sh -C 7333940 -q workq -l workq -Q 1440 -L 1440 -g cherry -s HindIII -A pawsey0149 -z $MYSCRATCH/cluster_juicer/references/cherry_pilon3.fasta -y $MYSCRATCH/cluster_juicer/restriction_sites/cherry_HindIII.txt -p $MYSCRATCH/cluster_juicer/references/chrom.sizes -d $MYSCRATCH/cluster_juicer/work -D $MYSCRATCH/cluster_juicer

(lines are messed up a bit here though)

 

It stops on the output "srun: job 5609221 queued and waiting for resources". I waited a few hours and nothing happened so I hit ctrl C and the following errors appeared. I've tried this several times. Around four jobs get submitted to squeue after it starts, but they disappear after a few seconds to a minute. 

1.PNG

debug/head-5609218.err shows errors for spack in a few lines: e.g. /var/spool/slurmd/job5609218/slurm_script: line 10: spack: command not found

 

I made the following edits to juicer.sh to try and get it working

1.    Line 115. Added own load_java and commented out spack commands

2.PNG

My cluster has bwa 0.7.17 and GNU awk 4.1.0 loaded by default. The compiler is PrgEnv-cray/6.0.4 by default, but I also got the same errors when changing to PrgEnv-gnu/6.0.4 before running the script. I'm not sure how to add this compiler swap to the juicer.sh script if that's needed.

I commented out $load_bwa, $load_awk, $load_gpu anywhere else they appeared in    the script.

 

2.    Line 123. Changed queue names to workq since that is the only partition in my cluster.

 3.PNG

24 hours is the time limit for this partition so I also seached for all "time" or "-t" and changed these if they were above 24 hours.

 

3. Line 369. Changed memory to below 58G. It was 80G here previously.

4.PNG

I also searched for all "mem" in the script and changed these to 58G if they were above this (64G is the max for my cluster). I assumed 8 CPUs for the mem-per-cpu options.

 

Running the modified juicer script gives similar problems:

5.PNG

This time there are no errors in cat debug/head* though

Cat debug/split * shows that the splitting seemed to work:

6.PNG

Content of the created splits folder:

9.PNG

The HIC_tmp and aligned directories get created but are empty.

 

I figured out that this (line 576) is the line causing the program to wait at "srun job __ queued and waiting for resources":

srun -c 1 -p "$queue" -t 1 -o $debugdir/wait-%j.out -e $debugdir/wait-%j.err -d $dependsplit -J "${groupname}_wait" sleep 1  

 

But it looks like the -d $dependsplit should be fulfilled by the output in split * above. If I remove -d $dependsplit it runs without pausing, but I get several errors like "srun: error: Unable to resolve "nid01185": Unknown host" before the spam of "sbatch: error: Batch job ...". nid01185 is the node my group uses on the cluster. I tried adding -N 1 and -n 1 to the srun command to limit it to one node and task but that didn't change anything.

 

Any ideas? Sorry for the long post, but I was thinking more information is better for error solving.

Ben Nestor

unread,
Nov 8, 2020, 9:05:15 PM11/8/20
to 3D Genomics
Ok so I think it's fixed, fingers crossed though. I had to run it on a different cluster with higher memory and also remove the isRice and isBCM checks at lines 71-72 (just made them equal 0).

Aiden Lab

unread,
Nov 11, 2020, 2:37:43 PM11/11/20
to 3D Genomics
Hi Ben,
I have a Juicer configured for Zeus and Magnus cluster on Pawsey. If you want to give them a try, you can download from here: https://www.dropbox.com/s/hfwm7sjxtdi9hrd/juicer.tar.gz?dl=0
Use juicer.sh for Zeus and juicer-magnus.sh on magnus cluster. I think the only thing you need to adjust is the juiceDir="..." variable. (can be set also as a parameter)

Best,
David Weisz
Aidenlab

Ben Nestor

unread,
Nov 12, 2020, 12:39:57 AM11/12/20
to 3D Genomics
That is very convenient, thanks David! I have the pipeline running at the moment, but will give it a try if I can't the script I edited to work.

thapap...@gmail.com

unread,
Feb 11, 2021, 6:00:19 PM2/11/21
to 3D Genomics
Hi Ben.ne,

Did this work for you? If so could you pleas share it?

Best

Ben Nestor

unread,
Feb 11, 2021, 8:37:35 PM2/11/21
to 3D Genomics
Hi thapap,

Here is the script I used. All the changes I've made are labelled by #Ben. They should be explained by my original post although I might have changed some numbers or commented out different lines. It was a while ago, but I think the most important changes were running it on a cluster with higher memory and removing the isRice and isBCM checks at lines 71-72. You could also try the script David posted. Hope this helps!

Regards,
Ben
editedjuicer.sh

thapap...@gmail.com

unread,
Feb 16, 2021, 3:23:22 PM2/16/21
to 3D Genomics
David 's script does not work for me.  Thanks ben, it works great.  

Best
Reply all
Reply to author
Forward
0 new messages