align_seqs:template_fp /home/tatyanaz/core_alignment_SILVA128.fna
assign_taxonomy:reference_seqs_fp /home/tatyanaz/97_otus_16S.fasta
assign_taxonomy:id_to_taxonomy_fp /home/tatyanaz/consensus_taxonomy_7_levels.txt
In both cases (with GreenGenes and with SILVA), I had prefilter_percent_id set to 0.0.
For GG,I did not include any parameters and just ran pick_open_reference_otus.py as is :
pick_open_reference_otus.py -i /home/tatyanaz/fastq1_newsplit/seqs.fna -o /home/tatyanaz/fastq1_newpickotu --prefilter_percent_id 0.0
When I compared the overall # OTUS found within the sequences of my samples, I noticed that certain samples appeared to have significantly different results based on whether I had used GreenGenes or SILVA- for instance, one sample (37) using GG had 113554.0 Counts/Sample and with SILVA only had 79936.0 Counts/Sample.
Is this normal? What could be causing this discrepancy? Is it due to the different databases or is there something I am missing in the parameters for either SILVA or GG? Please let me know if you know how to solve this issue or have come across the same issue.
Thanks!