recalibrate: gatk running on a single CPU thread for multiple samples

34 views
Skip to first unread message

Ivan De Dios

unread,
Aug 12, 2016, 10:16:15 PM8/12/16
to biovalidation
Hi Brad,

I noticed gatk base-recalibrator starting up on only a single thread for a run with 96 bam files. This is from a previously failed run where variant calling finished for vardict and platypus but mutect2 failed due to "malformed header" errors in the bam files. So I adjust the yaml config to include realignment and recalibration for gatk.

I ran it as bcbio_nextgen.py ../config/run.yaml -n 28

Brad Chapman

unread,
Aug 13, 2016, 6:03:52 AM8/13/16
to Ivan De Dios, biovalidation

Ivan;
If you're re-running in the same directory and want to re-parallelize steps
you need to remove `checkpoints_parallel/*.done` files to tell bcbio that
you're re-running these steps:

http://bcbio-nextgen.readthedocs.io/en/latest/contents/parallel.html#troubleshooting

Hope that gets the analysis running in parallel as expected,
Brad


> [ text/plain ]
> --
> You received this message because you are subscribed to the Google Groups "biovalidation" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to biovalidatio...@googlegroups.com.
> To post to this group, send email to bioval...@googlegroups.com.
> Visit this group at https://groups.google.com/group/biovalidation.
> For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages