join_paired_ends:perc_max_diff 80
$ pick_open_reference_otus.py -i SplitLib/seqs.fna -o otus
# default database would be greengenes...should I try against another database?
$ cp otus/otu_table_mc2_w_tax_no_pynast_failures.biom output.biom
$ core_diversity_analyses.py -o core -i output.biom -m Map_File.txt -t otus/rep_set.tre -e 10569
Thanks in advance for your help,
Matt
1) At the phylum level (L2), 14% (range 3-29%) of the reads map to "Unassigned;Other"
2) When I look at the species (L6) level it looks like I have lots of otus (44 out of 154 total otus) that don't appear to be "fully mapped" (apologies for poor terminology)
Is there a level of cutoff that these low unknown otus should be excluded?
does this mean I cannot look down to the genus level for these otus?
k__Bacteria;p__Bacteroidetes;c__Bacteroidia;o__Bacteroidales;f__S24-7;g__ (mean 31% of reads map to this OTU)
1) At the phylum level (L2), 14% (range 3-29%) of the reads map to "Unassigned;Other"Because this mouse microbiome is relatively well characterized, this number sounds a little high. Because you are using 'no_pynast_failures' and default settings, we know that all remaining OTUs are > 70% similar to something in greengenes. But only getting assignments to the kingdom level is a little strange.Also, 14% of all your reads, or 14% of your OTUs. Having lots of low quality OTUs without good taxonomy is common. Having lots of unknown reads is more unusual.
My concern is that having a high % of unassigned reads in some samples will 'push down' the relative percentages of other bacteria (the flip side of this is that if I filter out the unassigned reads, then those that had high % unassigned reads may have the other bacteria relative abundance inflated). Worryingly the % of unassigned reads is higher in some groups than others, meaning which option I choose could have a significant impact on results (see bar graph)
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq$ cat > splitlibparam.txt
multiple_split_libraries_fastq:phred_quality_threshold 19
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq$ multiple_split_libraries_fastq.py -i dbdb_paired2 -o REDONE/SplitLib --include_input_dir_path --remove_filepath_in_name -p splitlibparam.txt
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE$ pick_open_reference_otus.py -i SplitLib/seqs.fna -o otus
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE$ cp otus/otu_table_mc2_w_tax_no_pynast_failures.biom output.biom
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE$ biom summarize_table –i output.biom –o tablesummary.txt
# Note: lowest sample count is 10566
# doing core_diversity analysis:
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE$ core_diversity_analyses.py -o core -i output.biom -m Map.txt -t otus/rep_set.tre -e 10566
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq$ multiple_split_libraries_fastq.py -i dbdb_paired2 -o REDONE3/SplitLib --include_input_dir_path --remove_filepath_in_name -p param.txt
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE3$ pick_open_reference_otus.py -i SplitLib/seqs.fna -o otus
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE3$ cp otus/otu_table_mc2_w_tax_no_pynast_failures.biom output.biom
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE3$ biom summarize_table –i output.biom –o tablesummary.txt
# Note: lowest sample count is 10760
# doing core_diversity analysis:
qiime@qiime-190-virtual-box:~/Desktop/dbdb_fastq/REDONE$ core_diversity_analyses.py -o core -i output.biom -m Map.txt -t otus/rep_set.tre -e 10760