I am using humann2 to interpret metagenomes predicted by PICRUSt. I have downloaded the KEGG database from and assigned KOs to modules. The scripts works fine, however, I noticed that many modules are actually missing in the "modulec" data file. Everything with a module ID greater M00377 is missing thus many of the KOs are ending up a "unintegrated".
As some of the modules I am interested in are also missing, I was thinking of adding them manually to "modulec". This works well with "simple modules". However, I have some modules that have alternative genes for certain module-steps. An example would be the denitrification module M00529 which has two alternatives for the nitrate reduction step ((K00370+K00371+K00374) or (K02567+K02568)), the nitrite reduction step (K00368 or K15864) and the nitric oxide reduction step. How do I add a pathway like that into the file? Is there a possibility to provide multiple options, or should I add separate lines for the possible combinations? If the latter is the case, should I then add up abundances of the different combinations?
Hopefully somebody can help me with this issue.
Regards,
Katharina
thank you for the answer. I took a look at the modulep file and tried to add my pathway there in the same form it is displayed in KEGG: (K00370+K00371+K00374,K02567+K02568) (K00368,K15864) (K04561+K02305,K15877) K00376
However, the pathway is not found, even though most of the KOs are present in my database (K15864 and K15877 are not, but they are alternatives). Did I misinterpret the logic for the entry? Is it somehow different in KEGG and in the modulep file? In the KEGG help a module definition is explained like this:
"The definition of the module as a list of K numbers. Comma separated K numbers indicate alternatives. Plus signs are used to represent a complex and a minus sign denotes a non-essential component in the complex." It does not say anything about the use of parentheses, so I was wondering whether those are needed to define the module.
BR,
Katharina
yes all other are. If I remove the "+", "," and parentheses (and the 2 KOs that are not there), the pathway will be detected. As soon as I put the logical signs back in, the pathway is not detected any more.
BR,
Katharina