Metaphlan output

1,096 views
Skip to first unread message

shashank gupta

unread,
Feb 22, 2015, 12:31:48 PM2/22/15
to metaphl...@googlegroups.com
Hi,
i am trying to do MetaPhlAn using Galaxy server, i am using it first time.

As it is mentioned in the tutorial that Metaphlan accepts Fasta sequence which looks like-
>seq1
AAATTGCATTTAGACCAGG
>seq2
ATATATGATAGCGCGATGA
>seq3
CGTGCGCGTATAGTGGGGCTA and so on..

i uploaded fasta file in the GALAXY server, it gets uploaded without any error, then clicked Metaphlan keeping options default and clicked execute, again it got completed without an error.

But when i saw the output it shows something like this

unclassified 100.0

Am i missing something, because when i used the test fasta file available on GALAXY server, it works perfectly.

Best!
Shashank

Nicola Segata

unread,
Feb 22, 2015, 3:39:29 PM2/22/15
to shashank gupta, metaphl...@googlegroups.com
Hi Shashank,
 can it be that the size of the dataset you uploaded is too small (i.e. a small number of reads)? In that case MetaPhlAn will likely give you 100 unclassified, because MetaPhlAn is not a read-classifier but an estimator of species abundances and with just few reads the result cannot be robust.

I'd be able to help you more if you tell me the characteristics of the dataset (number of reads, length of the reads, origin of the sequences).

cheers
Nicola


--
You received this message because you are subscribed to the Google Groups "MetaPhlAn-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to metaphlan-users+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

shashank gupta

unread,
Feb 24, 2015, 1:30:17 AM2/24/15
to metaphl...@googlegroups.com, bioinforma...@gmail.com, nicola...@unitn.it
Hi,
the file i uploaded have 46,317 sequences.
len min/max/avg 361.0/845.0/655.0


thanks
Shashank

Nicola Segata

unread,
Feb 24, 2015, 2:56:08 AM2/24/15
to shashank gupta, metaphl...@googlegroups.com
Hi,
 for MetaPhlAn we are used to use at least half milling shot reads as input. You reads are quite long compared to standard Illumina sequencing, so in MetaPhlAn you should get better results using a local mapping strategy, e.g. "--bt2_ps sensitive-local" rather than the default "  --bt2_ps very-sensitive" (without "-local").
cheers
Nicola

shashank gupta

unread,
Feb 24, 2015, 11:13:30 PM2/24/15
to metaphl...@googlegroups.com, bioinforma...@gmail.com, nicola...@unitn.it
I am using Galaxy server, so how can i use above command ?

George Weingart

unread,
Feb 25, 2015, 1:50:37 AM2/25/15
to shashank gupta, metaphl...@googlegroups.com, nicola...@unitn.it
I believe that in Galaxy, metaphlan2 has the option to select that parm
--
You received this message because you are subscribed to the Google Groups "MetaPhlAn-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to metaphlan-use...@googlegroups.com.

e.m...@warwick.ac.uk

unread,
Jul 29, 2015, 10:17:51 AM7/29/15
to MetaPhlAn-users, bioinforma...@gmail.com, nicola...@unitn.it
Hi all,

I have the same problem, when I run my metagenomic sequencing samples through MetaPhlAn2 I always get 100% unclassified.

We did whole genome sequencing from environmental samples (paired-end) and got about 4,000,000 reads for each samples. Each sequence is about 250bp long.
I tried to use MetaPhlAn2 with joined reads but also with my paired-end reads, I tried sensitive, very-sensitive and very-sensitive-local but the result is always the same - 100 unclassified.

What am I doing wrong?

Cheers,

Eileen

Duy Tin Truong

unread,
Jul 29, 2015, 11:16:32 AM7/29/15
to e.m...@warwick.ac.uk, MetaPhlAn-users, bioinforma...@gmail.com, nicola...@unitn.it
Hi Eileen,

Can you send me the commands that you used? Besides, can you check if metaphlan2 actually received input through the pipe, or if the input file path exists?

Thanks,
Tin

Duy Tin Truong

unread,
Jul 29, 2015, 11:19:03 AM7/29/15
to e.m...@warwick.ac.uk, MetaPhlAn-users, bioinforma...@gmail.com, nicola...@unitn.it
Hi Eileen,

Can you check the output sam file or the bowtie2 output to see if bowtie2 actually works?

Thanks,
Tin

e.m...@warwick.ac.uk

unread,
Jul 29, 2015, 11:35:39 AM7/29/15
to MetaPhlAn-users, bioinforma...@gmail.com, nicola...@unitn.it, duytin...@unitn.it
Hi Tin,

I used the following command, but also tried it with very-sensitive and with my joined data rather than forward and reverse reads

metaphlan2.py 1_S1_L001_R1_001.fastq,1_S1_L001_R2_001.fastq --mpa_pkl db_v20/mpa_v20_m200.pkl --bowtie2db bowtie2db/mpa --bowtie2out metagenome.bowtie2_sensitive.bz2 --nproc 5 --bt2_ps very-sensitive-local --input_type fastq > MSAP1_sensitive.txt

How can I check if methaphlan2 received input through the pipe?

I attached the bowtie2 output file (ziped). I had a look into it but do not know if that looks alright.

Cheers,

Eileen

metagenome.bowtie2_sensitive.bz2

Duy Tin Truong

unread,
Jul 31, 2015, 11:41:33 AM7/31/15
to e.m...@warwick.ac.uk, MetaPhlAn-users, bioinforma...@gmail.com, nicola...@unitn.it
When checking your bowtie2 output, I could not find the marker name in the metaphlan2 database. Besides, looking at the command that you used, I think there was something wrong with bowtie2db option as you are running metaphlan2 with metaphlan1 database, can you try again with this:

cat 1_S1_L001_R1_001.fastq 1_S1_L001_R2_001.fastq | metaphlan2.py  --mpa_pkl db_v20/mpa_v20_m200.pkl --bowtie2db bowtie2db/mpa_v20_m200 --bowtie2out metagenome.bowtie2_sensitive.bz2 --nproc 5 --bt2_ps very-sensitive-local --input_type fastq > MSAP1_sensitive.txt

Thanks,
Tin

Reply all
Reply to author
Forward
0 new messages