yes, please send me the files to my personal email,
alexis
On 13.10.2017 14:24, Benjamin lin wrote:
> Dear Alexis,
>
> I encounter a RAxML bug probably related to long tree labels.
> I use a tree with ~12000 leaves and the corresponding 1.5kb multiple
> alignment.
>
> If i use the modes -f v (EPA) or -f e (optimization of an input tree), i
> get the following output and core dump error (last line):
>
> |
> IMPORTANT WARNING
> Found34sequences that are exactly identical to other sequences inthe
> alignment.
> Normallythey should be excluded fromthe analysis.
> Analignment file withsequence duplicates removed has already
> been printed to file
> ../mmod_LTPs128_SSU_aligned_99.5_trim.hmm_1000.aln.fasta.reduced
> UsingBFGS method to optimize GTR rate parameters,to disable thisspecify
> "--no-bfgs"
> Founda total of 12953taxa intree file ../mmmod_LTPs128_SSU_tree.newick
> ThisisRAxMLversion 8.2.9released byAlexandrosStamatakison July202016.
> Withgreatly appreciated code contributions by:
> AndreAberer(HITS)
> SimonBerger(HITS)
> AlexeyKozlov(HITS)
> KassianKobert(HITS)
> DavidDao(KIT andHITS)
> SarahLutteropp(KIT andHITS)
> NickPattengale(Sandia)
> WaynePfeiffer(SDSC)
> AkifumiS.Tanabe(NRIFS)
> CharlieTaylor(UF)
> Alignmenthas 1979distinct alignment patterns
> Proportionof gaps andcompletely undetermined characters
> inthisalignment:32.50%
> RAxMLlikelihood-based placement algorithm
> Using1distinct models/data partitions withjoint branch length optimization
> Allfree model parameters will be estimated byRAxML
> ML estimate of 25per site rate categories
>
>
> Partition:0
> AlignmentPatterns:1979
> Name:NoNameProvided
> DataType:DNA
> SubstitutionMatrix:GTR
>
>
> RAxMLwas called asfollows:
>
>
> raxmlHPC-SSE3 -f v -G 0.015625-m GTRCAT -n test_1000 -s
> ../1000.aln.fasta -t ../tree.newick
>
>
> raxmlHPC-SSE3:treeIO.c:836:treeFindTipByLabelString:Assertion`!
> tr->nodep[lookup]->back' failed.
> Aborted (core dumped)
> |
>
>
> This seems to appear after the parsing of the alignment (where a few
> identical sequences are detected), and the tree.
> This is true for RAxML 8.2.9 to 8.2.11 (I didn't try earlier versions).
> The bug don't appear in the rapid bootstrapping + ML inference mode (-f a).
> So, the error might be related to the -t option (used in both -f v and e
> modes).
>
> I tested the integrity of the tree and the alignment with other
> software, and everything looks fine.
> Tree labels look as follows:
> - Erwinia_oleae__GU810925__Enterobacteriaceae
> - Erwinia_gerundensis__FJ611848__Enterobacteriaceae
> - Pantoea_intestinalis__KP326384__Enterobacteriaceae
> - Pantoea_theicola__AB907776__Enterobacteriaceae
> - Yersinia_enterocolitica_subsp_enterocolitica__AF366378__Enterobacteriaceae
> - Yersinia_enterocolitica_subsp_palearctica__FR729477__Enterobacteriaceae
>
> I can send you the files if you want to reproduce the error.
> In the meantime i will explore the treeFindTipByLabelsString method to
> try to understand the issue.
>
> Thanks,
>
> --
> You received this message because you are subscribed to the Google
> Groups "raxml" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
raxml+un...@googlegroups.com
> <mailto:
raxml+un...@googlegroups.com>.
> For more options, visit
https://groups.google.com/d/optout.
--
Alexandros (Alexis) Stamatakis
Research Group Leader, Heidelberg Institute for Theoretical Studies
Full Professor, Dept. of Informatics, Karlsruhe Institute of Technology
www.exelixis-lab.org