Hi Ning,
that's a good question. The two types of analysis (clade_profiles and marker_ab_table) are quite similar and they mostly differ only for the format.
They both report the number of reads per marker normalized (i.e. divided) by the marker length (in nt) and multiplied by 1,000. As said, the differences are in the output format as detailed below.
In the output obtained with "clade_profiles":
* the markers are binned by clade
* values of 0 are explicitly reported for markers in clades having at least one marker with non-null abundance
* can be useful to easily check the consistency of markers within a clade
In the output obtained with "marker_ab_table":
* only markers with non-null abundance are reported
* clade information is not included
* abundances from multiple samples can be merged with the "utils/merge_metaphlan_tables.py" script.
* the marker abundances can be normalized by the total number of reads in the metagenome (specified with --nreads) to make abundances more consistently comparable across samples
We recently added "marker_ab_table" because of the last two advantages listed above (i.e. comparing the marker abundances across samples), and we left "clade_profiles" mostly for retro-compatibility.
let me know if this solves your doubts
many thanks
Nicola
On Monday, February 18, 2013 10:41:01 PM UTC+1, Ning Li wrote:
Hi,I am currently using MetaPhlAn, it is a nice tool for taxonomy classification. I understand the relative abundance in the output as '-t rel_ab' [default]. But I am confused by another two paramenters of '-t ANALYSIS TYPE'
They are :
clade_profiles: normalized marker counts for clades with at least a non-null marker
marker_ab_table: normalized marker counts (only when > 0.0 and normalized by metagenome size if --nreads is specified)
My questions are
1. What are the differences of those two, 'normalized marker counts'?
2. What do the values mean in the output file if I choose them? How do you calculate them?
Thank you so much.
Best
Ning