Hi Greg,
Sure, here are the commands I gave, broken down into a two-stage process: 1) building a plant reference library, and 2) finding those plant sequences in herbivore dung. (Note: the chloroplast sequence we amplified was the trnL intron, which has a sequence length ranging from 10 to 150 bp).
1) Building plant reference library
split_libraries.py -m PlantMapping.txt -f Plant454Reads.fna -q Plant454Reads.qual -l 10 -z truncate_only
pick_otus.py -i plant_seqs.fna -m uclust -o plant_otus --min_aligned_percent 0.5 -s 1.0 -e 1e-10
pick_rep_set.py -i plant_otus.txt -f plant_seqs.fna -m most_abundant -o plant_rep_set.fasta
--------------------------------------------------------------
2) Finding plant seqs in fecal samples
split_libraries.py -m FecalMapping.txt -f Fecal454Read.fna -q Fecal454Read.qual -l 10 -z truncate_only
pick_otus.py -i fecal_seqs.fna -m blast -o fecal_otus -r plant_rep_set.fasta --min_aligned_percent 0.5 -s 0.98 -e 1e-10
pick_rep_set.py -i fecal_otus.txt -f fecal_seqs.fna -m most_abundant -o fecal_rep_set.fasta -r plant_rep_set.fasta
assign_taxonomy.py -i fecal_rep_set.fasta -t plant_taxonomy.txt -r plant_rep_set.fasta -m blast -e 0.001 -o fecal_taxonomy
make_otu_table.py -i fecal_otus.txt -o fecal_taxon_98.biom -t plant_taxonomy.txt
convert_biom.py -i fecal_taxon_98.biom -o fecal_taxon_98.txt -b --header_key taxonomy --output_metadata_id "ConsensusLineage"
--------------------------------------------------------------
Well, that's it. Please let me know what went wrong, or at least, what could be improved.
Thanks very much,
Gaddy