several partitions

andrea nino

unread,

Apr 27, 2021, 7:03:17 PM4/27/21

to PartitionFinder

Dear Rob,

I used a strategy by partitioned my dataset (13500 pb * 150 species) by gen and by codon to Protein coding Genes, and one by each rRNA with model_selection : bic and search: greedy. All was ok with the partitionFinder run, and Iqtree, Mrbayes, RaxML analysis. However, to my Beast analysis (using the same partition scheme and 7 calibration points) I cannot arrive to optimize my ESS values after several N° of generations. I supposed it could arrive one day running several analysis and many hr in a cluster. Do you believe that optimize the number of generation could help to optimize the Beast analysis?

What could be the best strategy? Grouping the partition by gen, or by position? or by position and same model ? Or this does not have any sense.

Or relented a new scheme partition in PartitionFinder?

Thank for the advises

Best regards,

Andrea

Subset | Best Model | # sites | subset id | Partition names

1 | GTR+I+G | 777 | aa6d08870474a261954d433ad287322a | 12S

2 | GTR+I+G | 1137 | c770cc808857d12eb2d67da7e0758b9b | 16S

3 | GTR+I+G | 1112 | 2ef2eb84dc48341275890c9886e931cd | COII_pos1, ATP6_pos1, COIII_pos1, CYTB_pos1

4 | TVM+I+G | 1014 | 3bd306a9701fb1001b9b06f8625942f2 | COI_pos2, COIII_pos2, ATP6_pos2

5 | TVM+G | 423 | 14af0a2a5e4771d8fcec37f54c19c4fa | ND3_pos3, ATP6_pos3, ATP8_pos3

6 | HKY+I+G | 70 | 09c06b283a6ba155b862fbbb38eeaae7 | ATP8_pos1

7 | GTR+I+G | 283 | 604c00155ccf4a0be8e7dbd1ec888904 | ND6_pos2, ATP8_pos2

8 | GTR+I+G | 517 | 63e10bda2d363c9dee5e81713c85b4ae | COI_pos1

9 | GTR+I+G | 752 | 2b5edf5b945982971c25c94bf9a8fc9f | COI_pos3, COII_pos3

10 | GTR+I+G | 615 | 97a01e0ce67a8045a0482aad57a39f0e | COII_pos2, CYTB_pos2

11 | TIM+G | 265 | e98e1a37d5da6b8c54cd5707f0eafde4 | COIII_pos3

12 | TIM+I+G | 380 | ed77f53d9829f38e52853f1e541a6764 | CYTB_pos3

13 | GTR+I+G | 1455 | cc75992d6172e44b09acff9311e955e3 | ND5_pos1, ND1_pos1, ND4L_pos1, ND4_pos1

14 | GTR+I+G | 314 | 837d34d1fd73ea9fed0c069125ab885a | ND1_pos2

15 | TVM+I+G | 314 | 719cf8dbcbc87def839e14b6f74e2304 | ND1_pos3

16 | GTR+I+G | 374 | 1f9895b348e888b7f719089ffb41e425 | ND2_pos1

17 | TVM+I+G | 495 | 7b4514749caa8db90ae3ae7e6e46972b | ND2_pos2, ND3_pos2

18 | GTR+G | 374 | c1c9c0bacf26d0bd81b6f430703637e1 | ND2_pos3

19 | GTR+I+G | 334 | 643fb50e0a0911a120c27a203a85bdef | ND6_pos1, ND3_pos1

20 | GTR+I+G | 1141 | 4e0dde062f976253d07b25928fb79cea | ND4L_pos2, ND4_pos2, ND5_pos2

21 | TIM+I+G | 101 | d59867060c15e1b8554ccb066da1647e | ND4L_pos3

22 | TRN+I+G | 1040 | 276ad8fb996a1e06214eedd85c87439a | ND5_pos3, ND4_pos3

23 | HKY+G | 213 | 15994501a9ddd135de599d0ba39dd87c | ND6_pos3

Rob Lanfear

unread,

Apr 27, 2021, 7:08:02 PM4/27/21

to PartitionFinder

Hi Andrea,

There are almost limitless reasons why your BEAST analyses may not be converging. I'd be somewhat surprised if the partitioning scheme was the culprit, though it's possible.

One issue is that PartitionFinder is not built for a Bayesian framework (it's very explicitly built for a frequentist framework). So there's no guarantee that the partitioning scheme you get are appropriate for a Bayesian analysis.

The usual solution to low ESS values is very simple - run the analysis for longer, and/or run a series of independent chains and combine the output. Given long enough runs you will usually get good sampling. More generally, there are lots of ways to tune an MCMC to help sample properly from the posterior distribution, and lots of options to do this in BEAST. That's a topic for the BEAST discussion group though.

Rob

Yu Sun

unread,

May 3, 2021, 10:37:26 PM5/3/21

to partiti...@googlegroups.com

Hi Nino,

I also used the same strategy to get the results and to run the Bayesian, it took me a long time to run, how long have you used it? Thank you.

Best,

Yu

andrea nino <nerak...@gmail.com> 于2021年4月27日周二下午6:03写道：

--
You received this message because you are subscribed to the Google Groups "PartitionFinder" group.
To unsubscribe from this group and stop receiving emails from it, send an email to partitionfind...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/partitionfinder/7b1ecc6e-e2cc-4d63-8ace-a6d9bb429a50n%40googlegroups.com.

andrea nino

unread,

May 6, 2021, 6:09:11 AM5/6/21

to partiti...@googlegroups.com

Dear Rob,

thank you so much for your answer. Yes, I agree that this partition scheme is correct and totally possible to do a Beast analysis. I will try to run a larger N° generation and long runs and run a series of independent chains.

Best,

Andrea

--
You received this message because you are subscribed to a topic in the Google Groups "PartitionFinder" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/partitionfinder/yGpXWIw-2wQ/unsubscribe.
To unsubscribe from this group and all its topics, send an email to partitionfind...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/partitionfinder/7ce67119-ef84-434b-85d7-17d5e42d2f37n%40googlegroups.com.

andrea nino

unread,

May 6, 2021, 6:16:54 AM5/6/21

to partiti...@googlegroups.com

Hi Yu,

I did 6 independent runs with 50 million of generation, sampling log each 1000 gn. Each one takes around 90hr on the cipres server.

Could you indicate to me how long time runs your analisis? and how many generations?

Thank you for the information.

Best

Andrea

To view this discussion on the web visit https://groups.google.com/d/msgid/partitionfinder/CADjeuP0Xc9mQ0E237NsZ9CnTdZ3VZgOW2omEmq%3DAgF4ZFFD48Q%40mail.gmail.com.

Yu Sun

unread,

May 16, 2021, 12:11:28 AM5/16/21

to partiti...@googlegroups.com

I ran it for a long time until my computer crashed... btw, the generations I used 10 million. So, I decided to run it on PhyloBayes.

Yu

andrea nino <nerak...@gmail.com> 于2021年5月6日周四上午5:16写道：

To view this discussion on the web visit https://groups.google.com/d/msgid/partitionfinder/CAJRpOga14QdXOLaiBdmCWoGvKSUN1D4FUhRRSzvyCm7j%3D1fHPw%40mail.gmail.com.

Reply all

Reply to author

Forward