Dear Dr. Torbjørn,
Thanks for your prompt reply.
We are following the QIIME and UPARSE pipelines in our lab, where OTU binning is performed after QA/QC and Chimera detection. Till date, for smaller number of samples, 20-30 individuals, the usearch61 was used to perform chimera detection which used to take 2-3 hours on single core processor. However, usearch61 was unable to perform Chimera detection for file size more than 4 GB, where we started using vsearch. Right now, we have 100 samples in batches for multiple projects, but the chimera detection part has become a bottleneck in analysis. You suggested performing OTU clustering before chimera detection, both reference based as well as denovo. Will it not run the risk of underestimation of Chimeric reads and effect the diversity estimates like shannon and chao indices when performed only on Representative sequences? Can you forward some papers where you have successfully used this method so that I can discuss with my lab members and PI for justifying our deviation from QIIME pipeline and following the new method as proposed by you? This will be of immense help from your side.
Thank you in advance. Looking forward for your reply.
-Naina