Speed up "gather-pvalues-for-each-gene"

47 views
Skip to first unread message

Ashley Qin

unread,
Dec 1, 2020, 6:20:20 PM12/1/20
to PheWeb-UMich
Hi Peter, is there any way to speed up the processing time for the script "gather-pvalues-for-each-gene"? I have almost 900 phenotypes to process, which translates to about 20,000 tasks. I'm using the SLURM cluster scheduler to handle this task, but even after 4 days of running the job reaches the time limit. 

pjvandehaar

unread,
Feb 2, 2021, 4:37:35 PM2/2/21
to PheWeb-UMich
Sorry about getting back to you so late.  Presumably you've figured it out by now.

Yeah, that step is too slow.  It could run 4x faster with a better approach, and I wrote out how to do it at the top of https://github.com/statgen/pheweb/blob/master/pheweb/load/gather_pvalues_for_each_gene.py .  I'll get to it eventually, but if you'd like to work on it I will help.
Reply all
Reply to author
Forward
0 new messages