I am new to Humann2 and looking for an explanation for a similar problem. I am running Humann2 using Metaphlan2 output and my data is a shotgun sequencing analysis of DNA extracted from rat faecal samples. Much like Mohammed, I notice that when running Humann2, after the bowtie2 step I have a high % of unmapped reads, but my samples cannot contain any human DNA. Should I worry about it? Should I use a different database (currently I am using the UniRef90 one). Also, while I have 93% of unmapped reads, at the metaphlan step it's stated that the bugs identified by metaphlan explain the 100% predicted community composition. Does this mean that, even if the bowtie2 step doesn't align a good percentage of reads, I still have all the reads I need to thoroughly analyze the community composition?
Please find below the log and the command I'm running:
humann2 --input 170427_A19_IonXpress_010.fastq --output 170427_A19_fastq --gap-fill on --taxonomic-profile A19_rel.txt
Output files will be written to: /home/bioc1484/Documents/170427_Pilot/170427_A19_fastq
Found g__Parabacteroides.s__Parabacteroides_goldsteinii : 26.36% of mapped reads
Found g__Lactococcus.s__Lactococcus_lactis : 22.97% of mapped reads
Found g__Akkermansia.s__Akkermansia_muciniphila : 20.84% of mapped reads
Found g__Subdoligranulum.s__Subdoligranulum_unclassified : 10.04% of mapped reads
Found g__Bacteroides.s__Bacteroides_uniformis : 7.45% of mapped reads
Found g__Lactobacillus.s__Lactobacillus_murinus : 3.38% of mapped reads
Found g__Peptostreptococcaceae_noname.s__Peptostreptococcaceae_noname_unclassified : 2.85% of mapped reads
Found g__Blautia.s__Ruminococcus_torques : 1.75% of mapped reads
Found g__Clostridium.s__Clostridium_symbiosum : 0.71% of mapped reads
Found g__Dorea.s__Dorea_unclassified : 0.68% of mapped reads
Found g__Butyrivibrio.s__Butyrivibrio_unclassified : 0.58% of mapped reads
Found g__Peptostreptococcaceae_noname.s__Clostridium_glycolicum : 0.54% of mapped reads
Found g__Oscillibacter.s__Oscillibacter_unclassified : 0.53% of mapped reads
Found g__Eggerthella.s__Eggerthella_unclassified : 0.47% of mapped reads
Found g__Clostridium.s__Clostridium_hathewayi : 0.18% of mapped reads
Found g__Adlercreutzia.s__Adlercreutzia_equolifaciens : 0.18% of mapped reads
Found g__Ruminococcus.s__Ruminococcus_flavefaciens : 0.09% of mapped reads
Found g__Anaerotruncus.s__Anaerotruncus_colihominis : 0.07% of mapped reads
Found g__Flavonifractor.s__Flavonifractor_plautii : 0.07% of mapped reads
Found g__Lachnospiraceae_noname.s__Lachnospiraceae_bacterium_3_1_46FAA : 0.07% of mapped reads
Found g__Anaerotruncus.s__Anaerotruncus_unclassified : 0.06% of mapped reads
Found g__Roseburia.s__Roseburia_unclassified : 0.06% of mapped reads
Found g__Clostridium.s__Clostridium_asparagiforme : 0.04% of mapped reads
Found g__Eubacterium.s__Eubacterium_plexicaudatum : 0.02% of mapped reads
Found g__Bifidobacterium.s__Bifidobacterium_pseudolongum : 0.01% of mapped reads
Total species selected from prescreen: 25
Selected species explain 100.00% of predicted community composition
Creating custom ChocoPhlAn database ........
Running bowtie2-build ........
Running bowtie2 ........
Total bugs from nucleotide alignment: 17
g__Akkermansia.s__Akkermansia_muciniphila: 79346 hits
g__Lactococcus.s__Lactococcus_lactis: 120221 hits
g__Blautia.s__Ruminococcus_torques: 5543 hits
g__Parabacteroides.s__Parabacteroides_goldsteinii: 260069 hits
g__Bacteroides.s__Bacteroides_uniformis: 79711 hits
g__Eubacterium.s__Eubacterium_plexicaudatum: 29348 hits
g__Peptostreptococcaceae_noname.s__Clostridium_glycolicum: 4923 hits
g__Anaerotruncus.s__Anaerotruncus_colihominis: 5237 hits
g__Clostridium.s__Clostridium_hathewayi: 4780 hits
g__Clostridium.s__Clostridium_asparagiforme: 1705 hits
g__Lactobacillus.s__Lactobacillus_murinus: 11265 hits
g__Adlercreutzia.s__Adlercreutzia_equolifaciens: 2972 hits
g__Flavonifractor.s__Flavonifractor_plautii: 10271 hits
g__Clostridium.s__Clostridium_symbiosum: 13702 hits
g__Lachnospiraceae_noname.s__Lachnospiraceae_bacterium_3_1_46FAA: 3826 hits
g__Bifidobacterium.s__Bifidobacterium_pseudolongum: 641 hits
g__Ruminococcus.s__Ruminococcus_flavefaciens: 314 hits
Total gene families from nucleotide alignment: 28011
Unaligned reads after nucleotide alignment: 93.3538774989 %