Hi,
I'm making a phyloseq package from MED output, but I have a couple of questions regarding taxonomy.
To build the phyloseq I have:
- otu table: matrix of sequence nodes (e.g. 5006, 30004... etc) by samples
- sample data: a data frame with my metadata, samples by variables
Now for the taxonomy I have separated the levels from the assignment in RDP for each node resulting in:
- taxonomy table: matrix of sequence nodes by taxonomic ranks (from order to genus), where I added an extra level for nodes.
Building phyloseq works perfect but I'm unsure of whether this is the correct way, as I might be having some issues with taxonomy assignment in downstream analyses.
So my questions would be:
- How to make a taxonomy matrix from MED output that is compatible for phyloseq
- Should unclassified sequences be excluded? (though I think might be informative)
- Any other tip from going from MED to phyloseq is highly appreciated
Thanks
Susana