> ra...@googlegroups.com <mailto:ra...@googlegroups.com>.
I am running RaxML v8.2.4 for a big DNA dataset (30000 taxa * 30000 bp, ~28000 patterns),


> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com
> <https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com?utm_medium=email&utm_source=footer>.
--
Alexandros (Alexis) Stamatakis
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
www.exelixis-lab.org
> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com
> <https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com?utm_medium=email&utm_source=footer>.
--
Alexandros (Alexis) Stamatakis
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
www.exelixis-lab.org
> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com
> <https://groups.google.com/d/msgid/raxml/9336b98f-2561-40eb-8fa6-98574a456caa%40googlegroups.com?utm_medium=email&utm_source=footer>.
--
Alexandros (Alexis) Stamatakis
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
www.exelixis-lab.org
> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
I agree that the large size of site patterns here is weird and I would double check carefully on the alignment to get rid of misalignment. Please feel free to suggest me any useful tools to examine alignment quality and find out errors.
Thanks!
> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
In either case, genuine or artefacts, those additional patterns are utterly useless for optimisation and you need to get them out of the matrix before you run any tree inference. E.g. by eliminating all sites that differ in just in a fraction of the accessions.
Dear Guido,In either case, genuine or artefacts, those additional patterns are utterly useless for optimisation and you need to get them out of the matrix before you run any tree inference. E.g. by eliminating all sites that differ in just in a fraction of the accessions.So here you want to suggest is to remove those sites with only a little amount of minor variants within? As those "variants" may actually be artefacts and it is useless for tree optimisation but lead to great additional computational pressure? And this "variants" may lead to oversized pattern count as my data here.



> an email to ra...@googlegroups.com
> <mailto:ra...@googlegroups.com>.
I will also try to remove those "fake" noise in the alignment to reduce "artificial" patterns, but do you think it is meaningful to remove those sites with all identical base which I think it is not phylogenetic-informative.
