What is exactly different between the two MPA reference sets? Have you removed a lot of species or they are still supposed to be there?
When I'm looking to Metaphlan1 results, I'm seeing a lot of unclassified species (Bifidobacterium_unclassified for example) while in Metaphlan2, it seems there is a little less but the results are quite different for some bacterial species.
To give you an example, with Metaphlan 1 for Bifidobacterium_unclassified, I'm getting an average of ~15 for relative abundance and it's by far my first top hit (I'm getting 2.50 for Bifidobacterium_adolescentis).
For Metaphlan2, I don't have that Bifidobacterium_unclassified but still have Bifidobacterium_adolescentis with an average abundance of 2.13 accross my samples. For sure there are now more information in the database that will affect that relative abundance but how can I verify that this version works well for me now?
Thanks a lot for your help.
Thanks a lot.
Have a nice day!