Thanks for making this software available. I'm currently using HUMAnN2 v0.7.
I'm using the standard HUMAnN2 pipeline to study gut microbiome bacterial abundance, genes represented, and pathways in a few dozen mice. Despite starting with a large sequence dataset, I've noticed that the number of specific bacterial genus and species is limited (about 40 species detected per sample, 40-70% sequence alignment rate via bowtie2). I suspect that this is due to the taxonomic profiling performed by MetaPhAn2 in the pre-screening process. My questions are:
1. If I'm looking for more accurate bacterial abundance estimates, is it best to use HUMAnN2 with the --bypass-prescreen option, or to use another tool?
2. In your experience, does using --bypass-prescreen significantly change downstream gene / pathway abundance results? If yes, how so. If not, why use the prescreen?
3. Is there a way to extract bacterial abundance from translated alignments (diamond) for bugs not identified in pre-screening?
Best,
Sherman Jia, MD MEng
UCSF MS Center