Hello,
I am using VSEARCH to process microbiome sequencing data. I have many different samples, and they are all in different files. I have successfully made OTU tables for each of my samples (referenced against the filtered, dereplicated, de-chimera-ed aggregate OTUs from all samples). However, the sample ID in each of my tables doesn't contain the full sample name, but truncates it at the first "-", resulting in many tables with the same sample label.
For example, this is the top of the OTU table for my first sample. As you can see, the sample is identified as "C1". However, the full name of the sample should be "C1-1-rhizo-ITS1". I have several other samples that start with the same characters (for example, "C1-1-soil-ITS1, C1-2-rhizo-ITS1, etc.) Based on the VSEARCH documentation, I'm guessing that the sample ID label is truncating at the first hyphen. Is there something I can do to work around this?
In case it's helpful, this is the code I used to make the OTU tables:
for i in *_filtered.fasta;
do vsearch -usearch_global ${i} --db zotus_nochime.fa --id 0.97 --otutabout OTU_tables/${i/_*}_otu_counts_its.txt; done