Dear Nextflow,
A performance issue occurred on the server for my nextflow pipeline. My pipeline was very simple, it ran bcftools against 1300 chr21 vcfs with -resume option. When about 200 samples had finished, I observed the vm for submitting job was not responsive. The active java process for nextflow was using 171 threads. I had to stop my pipeline then VM was responsive.
here is the process:
process BCFtoolsIsec {
cache true
cpus 3
executoe 'lsf'
queue 'largemem'
scratch '/data'
stageInMode 'copy'
stageOutMode 'move'
clusterOptions '-M 5G -e./log.err -o ./log.out -R "select[avx2]"'
input:
set id, file(a_vcf), file (b_vcf) from map_ch
script:
"""
source activate /path/vcf-env/
bcftools view -s ${id} $a_vcf > ${id}.a.vcf
bcftools view -s ${id} $b_vcf > ${id}.b.vcf
more commands
"""
}
Thank you for any help you can provide,
Xin