Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

Chimera checking, analysis of Sanger sequencing data in qiime

209 views
Skip to first unread message

Masanobu Hiraoka

unread,
Nov 5, 2021, 3:38:47 PM11/5/21
to Qiime 1 Forum
Hello, everyone,

I would like to analyze microbiome at human upper respiratory way with qiime pipeline, especially ANCOM that was read by Sanger sequencing(clone library method).

To make data more reliable, I am thinking of excluding chimera sequences.
When I do chimera checking, what and when is the best way? ChimeraSlayer, uchime or usearch? Before clustering(qiime1) or after clustering(qiime2)?

I was suggested some processing in this Forum before(post split fasta file import), and succeeded in converting data into qiime2 pipeline.

After I imported data to qiime2, I did the clustering(q2-vsearch) at 99.6% identity, exclude singleton, mitochondria and chloroplast. And further statistical analyses were performed with rarefied artifacts(commands are not shown).

Thank you
//Masanobu

commands;

Qiime1;

add_qiime_labels.py -i fasta_dir -m example_mapping4.txt -c InputFileName -o combined_fasta

validate_mapping_file.py -m example_mapping4.txt -o validate_mapping_file_output  

Qiime2;

qiime tools import \
 --input-path Raw_Files/combined_seqs.fna \
 --output-path Raw_Files/seqs.qza \
 --type 'SampleData[Sequences]'

qiime vsearch dereplicate-sequences \
  --i-sequences Raw_Files/seqs.qza \
  --o-dereplicated-table Raw_Files/table.qza \
  --o-dereplicated-sequences Raw_Files/rep-seqs.qza

qiime vsearch cluster-features-de-novo \
  --i-table Raw_Files/table.qza \
  --i-sequences Raw_Files/rep-seqs.qza \
  --p-perc-identity 0.996 \
  --o-clustered-table Raw_Files/table-dn-996.qza \
  --o-clustered-sequences Raw_Files/rep-seqs-dn-996.qza

qiime phylogeny align-to-tree-mafft-fasttree \
  --i-sequences Raw_Files/rep-seqs-dn-996.qza \
  --o-alignment Raw_Files/aligned-rep-seqs.qza \
  --o-masked-alignment Raw_Files/masked-aligned-rep-seqs.qza \
  --o-tree Raw_Files/unrooted-tree.qza \
  --o-rooted-tree Raw_Files/rooted-tree.qza

#Removing reads that don't appear in at least two samples

qiime feature-table filter-features \
  --i-table Raw_Files/table-dn-996.qza \
  --p-min-samples 2 \
  --o-filtered-table Results_Files/table-dn-996.qza

#Removing the mitochondria and chloroplasts;silva138.1full

qiime taxa filter-table \
  --i-table Raw_Files/table-dn-996.qza  \
  --i-taxonomy Raw_Files/full_taxonomy.qza  \
  --p-exclude mitochondria,chloroplast  \
  --o-filtered-table Results_Files/full_table-dn-996_Removed.qza
  

ref;
This is my history of commands list.


TonyWalters

unread,
Nov 5, 2021, 6:11:24 PM11/5/21
to Qiime 1 Forum
Hello,

Although I haven't looked at the benchmarks recently, Chimeraslayer hasn't been updated in quite some time, so I would go with the vsearch option that's built into QIIME2.

See this tutorial:
https://docs.qiime2.org/2021.8/tutorials/chimera/

It looks like you would be able to do the chimera checking after your  qiime vsearch cluster-features-de-novo command (and you'd want to modify the commands after this to use the chimera filtered sequence artifact file).

-Tony

Masanobu Hiraoka

unread,
Nov 5, 2021, 7:37:39 PM11/5/21
to Qiime 1 Forum
Thank you for replying!
I missed the tutorial of chimera checking in qiime2!

OK, I will do the command "qiime vsearch uchime-denovo" after clustering.
If I have any questions, I'll ask them on qiime2.

// Masanobu
Reply all
Reply to author
Forward
0 new messages