I just had a quick methods based question I wanted to share with the group.
Currently, I utilize the standard HUMAnN2 workflow for meta genome and metatranscriptome processing. To briefly summarize our pipeline, we quality filter/pair with trimmomatic, remove host reads with KneadData, annotate taxonomy with Metaphlan, and conduct uniref90 annotation within Humann2 (using diamond) and subsequently convert identified Uniref90 terms into KEGG orthlogy terms (KOs) which we then summarize into stratified L1-L3 pathways.
I suppose I have 2 questions...if were were to choose to obtain a KEGG database license and utilize that (and thus bypass the UniRef90 to KO conversion):
1) What is the method by which humann can be directed to the LICENSED database?
2) What would a database warrant me? I noted that many of the KOS that currently are obtained after uniref90 conversion are labeled "No_Name"...but actually do have names when searched online. Would a KEGG database license likely only provide those names...or would it likely elevate the number of successful annotations of my filtered sequence data?
I hope this makes sense and is an okay question to ask the forum, just looking for some guidance here.
Thank you,
Justin Wright