sample-id
|
input
|
primer-removed
|
percentage of input primer-removed
|
filtered
|
percentage of input passed filter
|
denoised
|
non-chimeric
|
percentage of input non-chimeric
|
#q2:types
|
numeric
|
numeric
|
numeric
|
numeric
|
numeric
|
numeric
|
numeric
|
numeric
|
KK1QH167EC6
|
161
|
78
|
48.45
|
31
|
19.25
|
1
|
1
|
0.62
|
KK1791K6395
|
197
|
93
|
47.21
|
38
|
19.29
|
3
|
3
|
Jan-52
|
KK15K0FFPT8
|
157
|
98
|
62.42
|
35
|
22.29
|
9
|
9
|
Mai 73
|
KX1A8LCNE28
|
1560
|
1359
|
87.12
|
1135
|
72.76
|
854
|
854
|
54.74
|
KK1K5R4PCX3
|
7812
|
7136
|
91.35
|
5727
|
73.31
|
3515
|
3498
|
44.78
|
KK1R99NVGR0
|
15044
|
13597
|
90.38
|
10638
|
70.71
|
5190
|
5175
|
34.4
|
10,000 reads is a quite large cutoff, especially for PacBio 16S which tends to be lower depth than Illumina MiSeq.
As for the read loss at the DADA2 step - this is not an uncommon phenomenon (see the author of DADA2 discussing it here: https://github.com/benjjneb/dada2/issues/1164) and many of our clients, and ourselves when process ours+client data, have noticed it too (to be clear, this can happen with Illumina reads as well, depending on community makeup). It has less to do with the quality of the reads, though at extremes that can influence things, but it seems more related to community composition.
In fact, you are seeing the same general phenomenon that we see as well - for some reason, the ITS seems to perform better after DADA2 and then the 16S + 18S are a bit worse...we usually see around 50% of reads left after DADA2 with those latter ones (ie: so up to 50% loss at times).
ANDRÉ M. COMEAU, PhD |
Address for deliveries: |
|
Research Associate (Lab Manager) Morgan
Langille Lab • Dept. of Pharmacology |
||
"Without fantasy, there is no science. Without fact, there is no art." - Nabokov |
|