python forBo.py seqs_sorted.fasta assignments.txt GammaproteobacteriaTraceback (most recent call last):
File "o-trim", line 21, in <module>
trim_from = int(sys.argv[2])
IndexError: list index out of range
Swathi
o-trim FASTA_FILE 0 200
o-trim FASTA_FILE 20
I decided to start a discussion topic, since the title is quite a common question.
This is my recipe to analyze a dataset with the oligotyping pipeline.
- Get all reads for all samples in a project into one FASTA file, and format the deflines to meet the standard explained here.
- Use RDP or GAST to assign taxonomy to each read individually. If you are using QIIME and exporting your FASTA files for a given taxon via QIIME, be very careful: QIIME assigns taxonomy to representative sequences of OTUs, but you definitely don't want that, because generally OTUs are phylogenetically mixed units. Anyone who wants do a robust oligotyping analysis should assign taxonomy to their reads individually before selecting a group of reads.
- Filter reads that are identified as the genus of interest and put them into one file.
- If you are working with 454 or Use PyNAST to align these reads against GreenGenes template. If you don't want to spend too much time, extract templates from the GreenGenes alignment for a given genus or family using this script (it is under the Scripts directory in your copy of the codebase). If you are working with Illumina HiSeq or MiSeq reads, skip the alignment step, but run this script on your FASTA file to accommodate for any length variation.
- Remove common gaps from the resulting alignment file using this script (it is also under the Scripts directory in your copy of the codebase). Don't forget to check the number of reads in the failed alignments file to make sure there aren't too many reads failed the alignment step.
- Run the entropy-analysis and start oligotyping.
- Repeat the last three steps for each genera you wish to oligotype from the dataset.
--
You received this message because you are subscribed to the Google Groups "Oligotyping and MED" group.
To unsubscribe from this group and stop receiving emails from it, send an email to oligotyping...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/oligotyping/2f5ba5bf-1abc-4d20-8269-abf49abda947%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.