Running for a long time and stuck in "Finding the best partitioning scheme"

137 views
Skip to first unread message

Nayeli Gutiérrez

unread,
Jun 28, 2022, 7:20:10 PM6/28/22
to PartitionFinder
Hi Rob, thanks for this great software! It is my first time running PartitionFinder2 with a big dataset (12,997 loci, 40 spp) and would like to know if it is normal that it is taking more than 14 days to finish using 28 cpus in a cluster. It finished analyzing the subsets and its been stuck in the "Finding the best partitioning scheme" part for five days with no error message. 

Command:
python /home/ngutierrez/array1/software/partitionfinder-2.1.1/PartitionFinder.py /home/ngutierrez/array1/trees/allspp/partitionfinder/ --raxml -p 28

The log file is attached (I had to zip it because it was too big to attach here).

Thanks!
Nayeli
log.txt.zip

Rob Lanfear

unread,
Jun 28, 2022, 7:22:58 PM6/28/22
to PartitionFinder
Hi Nayeli,

I think the answer is in the logfile...

```
INFO     | 2022-06-15 22:01:01,359 | analysis_m |    PartitionFinder will have to analyse 1469956744 subsets to complete this analyses
WARNING  | 2022-06-15 22:01:01,365 | progress   |    1469956744 is a lot of subsets, this might take a long time to analyse. Perhaps consider using a different search scheme instead (see the Manual)
```

1.4 billion is a lot of subsets to analyse. I'd suggest taking a look through the manual and choosing the fastest possible approach for your dataset.

Rob

Nayeli Gutiérrez

unread,
Jul 14, 2022, 6:54:15 PM7/14/22
to PartitionFinder
Hi Rob, thanks for your answer. Do you think the combination below will do? I am using 80 cpus and 500gb of memory in a cluster.

--raxml -p 80 --rcluster-max 1000 --rcluster-percent 10

Best,
Nayeli

Nayeli Gutiérrez

unread,
Jul 16, 2022, 1:30:53 PM7/16/22
to PartitionFinder

It did: Total processing time: 1 day, 12:39:56 (h:m:s) !!!

:D 

Rob Lanfear

unread,
Oct 4, 2022, 6:58:12 PM10/4/22
to PartitionFinder
Cool!

R

Reply all
Reply to author
Forward
0 new messages