Hello! I'm running the new picrust2 and would greatly appreciate some help if you have a moment. For my experiments we sequenced v3-v4 and I'm getting the following errors:
Experiment 2: Warning - 4 input sequences aligned poorly to reference sequences
Warning - 157 input sequences aligned poorly to reference sequences
Then for both I get the following:
Warning:
There was only one file for the function: KO
Maybe
that's fine if you used custom traits or there were no sequences matching one
of the domains.
Warning:
There was only one file for the function: EC
Maybe
that's fine if you used custom traits or there were no sequences matching one
of the domains.
My understanding is that the new picrust2 aligns bacteria first and archaea second, so it seems I am having less alignment to reference sequences for archaea, but I am still getting some alignment. However, despite some alignment occurring no final files are generated for (I assume) the archaea domain. Taken together, I have two questions:
1) Is it unusual to have no archaea outputs from picrust2 for v3-v4 sequencing despite having some alignment to reference sequences? I am inexperienced in the archaea realm, but a quick look online suggested that v3-v4 regions should sequence archaea too. My experiment is exploring bacteria specifically, so I'm not concerned about losing archaea if none were sequenced, but I wanted to make sure that this isn't a sign that something else is wrong (especially given that some alignment to reference sequences is indeed occurring).
2) I have quite different alignment outcomes from Experiment 1 and Experiment 2, and I'm wondering if the amount being dropped in Experiment 1 is concerning? I'm planning to go back over my initial filtering to make sure that I have no mitochondria or chloroplasts sneaking in, but I was curious if there is a rule of thumb for how many input sequences to expect to be filtered out.
Thank you so much in advance for your help!!
Zoë