phylophlan.fna is empty

80 views
Skip to first unread message

Jiffin Paulose

unread,
Nov 15, 2018, 6:23:26 PM11/15/18
to PhyloPhlAn-users
Hello again!

Thanks to you, I am cranking out trees using proteomes with no problems!  However, I wanted to compare proteome-level and genomic-level analyses, and so am trying to use the supertree_nt.cfg I kept getting an error at the makeblastdb stage, so I went back to our conversations and started searching files.  I found that phylophlan.fna is empty, so I deleted the contents of phylophlan_databases, allowed the program to download the .tar and .md5 file, but the .fna file is still empty.  Any suggestions?  Thanks again!

J.

Francesco Asnicar

unread,
Nov 16, 2018, 7:17:11 AM11/16/18
to phylophl...@googlegroups.com
Hello Jiffin,

I think the config filename could be misleading... Sorry about that! The supertree_nt.cfg is meant to be used for a supertree approach when the marker genes in the database are nucleotides.
So, if you want to perform a phylogenomic analysis using the PhyloPhlAn database (which is amino acids) when inputs can be either/both proteomes and genomes, you simply need to make sure that the [map_dna] section is present in the configuration file.
In the case of the PhyloPhlAn database the configuration there should consider a tool able to do a translated search like, tblastn or diamond.
If you ran the phylophlan_write_default_configs.sh script the supertree_aa.cfg configuration file shuold already have the [map_dna] section.

Please, let me know if something is not clear.

Many thanks,
Francesco

--
You received this message because you are subscribed to the Google Groups "PhyloPhlAn-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to phylophlan-use...@googlegroups.com.
To post to this group, send email to phylophl...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/phylophlan-users/e8e172e5-69a2-4326-8f9b-0673afd9abaa%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Kevin Lee

unread,
Jan 17, 2019, 7:34:57 PM1/17/19
to PhyloPhlAn-users
Dear Francesco,

I've fallen into the same trap as well. So, if I understand correctly, supertree_aa or supermatrix_aa can actually handle nucleotide input data when using an amino acid database (e.g. phylophlan). 
So I've attempted to do:

phylophlan2.py -i input/Arthrobacter -d phylophlan --diversity low -f phylophlan_configs/supermatrix_aa.cfg --verbaes

I got the following error, but that turned out that my version of diamond was out of date (does not have --max-hsps)

[e] Command '['diamond', 'blastx', '--quiet', '--threads', '1', '--outfmt', '6', '--more-sensitive', '--id', '50', '--max-hsps', '35', '-k', '0', '--query', '/home/kevinl/phylophlan/input/Arthrobacter/B4.fna', '--db', 'phylophlan_databases/phylophlan/phylophlan.dmnd', '--out', 'Artobacter_phylophlan/tmp/map_dna/B4.b6o.bkp']' returned non-zero exit status 1

Once upgrading diamond to the newest version, phylophlan2 was able to map the DNA.

Francesco Asnicar

unread,
Jan 19, 2019, 12:27:54 PM1/19/19
to phylophl...@googlegroups.com
Dear Kevin,
Many thanks for reporting this. Can you tell me which version of Diamond was giving you trouble? I can think to add an ad-hoc check for diamond and report an error in the case of old versions.

Many thanks,
Francesco

Kevin Lee

unread,
Feb 3, 2019, 8:41:33 PM2/3/19
to PhyloPhlAn-users
Dear Francesco,

Unfortunately, I just overwritten the the binary (yay to messy single user system). However, according to diamond's changelog:

0.8.31 -
removed --single-domain option and replaced by --max-hsps
0.8.31 (released in 1 Jan 2017) was the first occurrence of the option.

Kind regards,

Kevin

Francesco Asnicar

unread,
Feb 6, 2019, 4:15:28 AM2/6/19
to phylophl...@googlegroups.com
Dear Kevin,

No problem, many thanks for checking the changelog in diamond. I'll add a check about the damond version in the next commit.

Thanks,
Francesco

Reply all
Reply to author
Forward
0 new messages