Hi Jose,
I re-ran the pick_reference_otus_through_otu_table.py with the lower
threshold (91%). And, below is the output.
qiime@qiime-VirtualBox:~/Desktop$ rm -rf ucrC ;
pick_reference_otus_through_otu_table.py -i '/home/qiime/Desktop/
Shared_Folder/06272012_MiSeq_Analysis/sl_out/seqs.fna' -o ucrC/ -r '/
home/qiime/qiime_software/gg_otus-4feb2011-release/rep_set/
gg_91_otus_4feb2011.fasta' -t '/home/qiime/qiime_software/
gg_otus-4feb2011-release/taxonomies/greengenes_tax.txt' -p '/home/
qiime/Desktop/Shared_Folder/06272012_MiSeq_Analysis/
07032012_91percent_parameters.txt'
qiime@qiime-VirtualBox:~/Desktop$ per_library_stats.py -i '/home/qiime/
Desktop/ucrC/uclust_ref_picked_otus/otu_table.biom' Num samples: 30
Num otus: 989
Num observations (sequences): 35780.0
Seqs/sample summary:
Min: 484.0
Max: 2331.0
Median: 1185.5
Mean: 1192.66666667
Std. dev.: 482.015721274
Median Absolute Deviation: 392.5
Default even sampling depth in
core_qiime_analyses.py (just a suggestion): 484.0
Seqs/sample detail:
DNA.2: 484.0
DNA.11: 503.0
DNA.1: 515.0
DNA.6: 518.0
DNA.5: 656.0
DNA.24: 663.0
DNA.14: 785.0
DNA.9: 840.0
DNA.25: 869.0
DNA.28: 899.0
DNA.23: 901.0
DNA.7: 950.0
DNA.18: 959.0
DNA.29: 1140.0
DNA.20: 1167.0
DNA.26: 1204.0
DNA.3: 1206.0
DNA.17: 1245.0
DNA.16: 1288.0
DNA.4: 1319.0
DNA.13: 1445.0
DNA.12: 1570.0
DNA.22: 1660.0
DNA.19: 1687.0
DNA.27: 1701.0
DNA.21: 1748.0
DNA.8: 1837.0
DNA.30: 1840.0
DNA.10: 1850.0
DNA.15: 2331.0
This was an increase is sequence number per sample over what I
previously posted. So, I ran pick_otus.py using uclust_ref and the 97%
gg_otus .fasta file without suppressing new clusters. I ran the output
of this through assign_taxonomy.py and make_otu_table.py. Then, I put
the .biom file through per_library_stats.py and got the following
output:
qiime@qiime-VirtualBox:~/Desktop$ per_library_stats.py -i '/home/qiime/
Desktop/otu_table.biom'
Num samples: 30
Num otus: 28468
Num observations (sequences): 64972.0
Seqs/sample summary:
Min: 833.0
Max: 4189.0
Median: 2129.0
Mean: 2165.73333333
Std. dev.: 874.043398363
Median Absolute Deviation: 718.0
Default even sampling depth in
core_qiime_analyses.py (just a suggestion): 833.0
Seqs/sample detail:
DNA.2: 833.0
DNA.11: 899.0
DNA.6: 900.0
DNA.1: 970.0
DNA.5: 1157.0
DNA.24: 1269.0
DNA.14: 1423.0
DNA.9: 1570.0
DNA.28: 1594.0
DNA.25: 1630.0
DNA.23: 1654.0
DNA.7: 1692.0
DNA.18: 1760.0
DNA.29: 2009.0
DNA.26: 2118.0
DNA.20: 2140.0
DNA.3: 2143.0
DNA.4: 2354.0
DNA.17: 2390.0
DNA.16: 2422.0
DNA.13: 2642.0
DNA.12: 2859.0
DNA.22: 2985.0
DNA.19: 3025.0
DNA.21: 3114.0
DNA.8: 3255.0
DNA.30: 3295.0
DNA.27: 3311.0
DNA.10: 3370.0
DNA.15: 4189.0
It seems like this will solve the problem of losing sequence data when
generating rarefaction plots. Just to make sure I understand, Is it
that my sequences that did not match the reference dataset but still
passed the quality filtering steps were still 16S (not junk) sequences
since they were picked up when I dropped the identity threshold? They
just didn't happen to match anything in the reference dataset I was
initially using to generate clusters and my rep_set?
Also, I took the output taxonomy assignments generated in
assign_taxonomy.py and used them in make_otu_table.py, along with the
output from pick_otus.py, to generate an OTU table and generate a
heatmap. But, I must be doing something wrong because when I look at
the heatmap, the taxonomy is "None" which is coming from the
pick_otus.py output file (seqs_otus.txt). I thought those would be
replaced with taxonomic assignments provided in -t of
make_otu_table.py, but they are not. I checked, and the taxonomic
assignments file does have taxonomic assignments listed in it. Is
there something I am not specifying correctly? Below is the code I am
using:
pick_otus.py -i '/home/qiime/Desktop/Shared_Folder/
06272012_MiSeq_Analysis/sl_out/seqs.fna' -m uclust_ref -r '/home/qiime/
qiime_software/gg_otus-4feb2011-release/rep_set/
gg_97_otus_4feb2011.fasta'
assign_taxonomy.py -i '/home/qiime/Desktop/Shared_Folder/
06272012_MiSeq_Analysis/sl_out/seqs.fna'
make_otu_table.py -i '/home/qiime/Desktop/
uclust_ref_picked_otus_newclusters/seqs_otus.txt' -o otu_table.biom -t
'/home/qiime/Desktop/rdp22_assigned_taxonomy/
seqs_tax_assignments.txt'
make_otu_heatmap_html.py -i '/home/qiime/Desktop/otu_table.biom' -o
OTU_Heatmap
Thank you again,
Molli
On Jul 2, 10:15 pm, Jose Carlos Clemente <
jose.cleme...@gmail.com>
wrote:
> Hi Molli,
>