Hello,
I am very new to metagenomics and would very much appreciate any advice/help I can get. I have results (fasta file) from an Ion Torrent metagenomics analysis by Ion Reporter. I would like to ask how to do alpha and beta diversity analysis on these results.
The details of the Ion Torrent and Ion reporter software can be read here:
To summarize, the Ion Reporter software generates “fasta” file output for a given sample that gives the species level identification. I want to take this one fasta file that has the species level identification of the reads and perform an alpha and beta diversity analysis using QIIME. Any suggestions on how can one do alpha and beta diversity analysis using this one fasta file?
I am reading the QIIME help. I have installed QIIME and currently I am currently going through the QIIME tutorials. I saw that the scripts required for doing alpha and beta diversity analysis need OTU tables and mapping files. Is it possible to generate OTUs and mapping table for this one fasta file I have? The content of the fasta file format resulted from Ion Reporter looks like the text I am pasting below.
example.fasta output from Ion Reporter. (This is the file I want to use as input to alpha and beta diversity analysis in QIIME):
>MG|11|100.0|Bacteroidetes|Bacteroidia|Bacteroidales|Bacteroidaceae|Bacteroides|vulgatus|
CACGTATCCAACCTGCCGTCTACTCTTGGATAGCCTTCTGAAAGGAAGATTAATACAAGATGGCATCATGAGTCCGCATGTTCACATGATTAAAGGTATTCCGGTAGACGATGGGGATGCGTTCCATTAGATAGTAGGCGGGGTAACGGCCCACCTAGTCTTCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
>MG|13|99.55|Bacteroidetes|Bacteroidia|Bacteroidales|Bacteroidaceae|Bacteroides|vulgatus|
TTGGACCGTGTCTCAGTTCCAATGTGGGGGACCTTCCTCTCAGAACCCCTATCCATCGAAGACTAGGTGGGCCGTTACCCCGCCTACTATCTAATGGAACGCATCCCCATCGTCTACCGGAATACCTTTAATCATGTGAACATGCGGACTCATGATGCCATCTTGTACTTAATCTTCCTTTCAGAAGGCTGTCCAAGAGTAGACGGCAGGTTGGATACGTG
…
I would very much appreciate any advice and suggestions you may give.
Thank you,
Julie
--
---
You received this message because you are subscribed to the Google Groups "Qiime Forum" group.
To unsubscribe from this group and stop receiving emails from it, send an email to qiime-forum...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
---
You received this message because you are subscribed to a topic in the Google Groups "Qiime Forum" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/qiime-forum/f_xBCbRnjdc/unsubscribe.
To unsubscribe from this group and all its topics, send an email to qiime-forum...@googlegroups.com.
Hi Adam,
Thank you very much again for your quick reply and all your help to me. I followed the steps that you suggested in your last email. Here is a summary of what I did so far:
Step 1. First, I generated a map file as below where I named the sample name as S1:
vi map.txt:
SampleID BarcodeSequence LinkerPrimerSequence Treatment DOB Description
S1 S1-Demo NA NA
Step 2) Then, I named my fasta file above as S1 and I put it under a fasta directory named "fasta_dir. I ran to command below to generate fasta file with QIIME labels:
add_qiime_labels.py -i fasta_dir/ -m map.txt -c SampleID
Step 3) The command "add_qiime_labels.py" in Step2 above, generated a file named "combined_seqs.fna" which is the QIIME version of fasta file with labels added to it in the fasta header.
Step 4) Then, I took the fna file from step3 above named "combined_seqs.fna" and then I ran the command "pick_open_reference_otus.py" using the with the new_refseqs.fna file that I found under the illumina tutorial files. (I couldn't find the another file named "refseqs.fna".) The command below worked with "new_refseqs.fna" file:
pick_open_reference_otus.py -i /home/QIIMETEST/combined_seqs.fna -r /home/QIIME_tutorial/moving_pictures_tutorial-1.9.0/illumina/precomputed-output/otus/new_refseqs.fna -o /home/output --suppress_taxonomy_assignment --suppress_align_and_tree
Step 5) The command in step4 generated a BIOM file named otu_table_mc2.biom. I took this file as my OTU biom table. Then, I ran the alpha diversity script as below.
alpha_diversity.py -i /home/otu_table_mc2.biom -m chao1 -o outputAlphaDiversity.txt
The alpha diversity script above generates a file with a value of 62.
vi outputAlphaDiversity.txt:
chao1
S1 62
How do I interpret this number of 62? Does this mean that there are 62 different species in the sample fasta and my alpha diversity prediction is 62 by QIIME? However, my input sample contained 20 species. So, why does QIIME generate a value of 62? If you could please give me some insight on this when you have a chance, I would really appreciate it.
Thank you.
Best,
Julie
ps: I also noticed that my OTU biom file contains 62 rows. Does the alpha diversity number always equal to the number of elements in the OTU table? Is this expected? Thank you.