Hello, it's my first time to use Qiime deoising scripts to denoise my 454 data. I follow the instruction here (
http://qiime.org/tutorials/denoising_454_data.html)
1. I run denoise_wrapper.py, for example,
denoise_wrapper.py -v -i run1.sff.txt -f run1/seqs.fna -o run1/denoised/ -m run1_mapping.txt
The output gives me three fasta files centroids.fasta,singletons.fasta and denoised_seqs.fasta
2. According to the workflow, I need to run inflate_denoiser_output.py (e.g.,inflate_denoiser_output.py -c centroids.fna -s singletons.fna -f seqs.fna -d denoiser_mapping.txt -o denoised_seqs.fna)
I have several questions about I run the inflate_denoiser_output.py.
First, this scripts seems to put the singleton and cnetroids together. Why we need to do this (Run inflate_denoiser_output.py)? Can I use the denoised_seqs.fasta directly to build OTU table and this file is from denoise_wrapper.py. What's the function of this file? Does this file = centroids.fasta+singletons.fasta?
Second, After I run the inflate_denoiser_output.py. I checked the number of sequences in the output. I was surprise then number of seqs in the output fasta file is same as the number of seqs in the fasta befoe denoising (seqs.fna). I don't know why this would happen. I use to use Ampliconnoise to denoise. The reads counts before and after denoising are quite different.
Third, the instruction suggests to pick up otus using the output of inflate_denoiser_output.py with optimal option? (
http://qiime.org/tutorials/denoising_454_data.html) Can anyone tell me what's the difference between optimal option and without? I never use it before? Does this mean I must use --optimal if my input files from Qiime denoising work flow? (i.e. denoise_wrapper.py + inflate_denoiser_output.py)
Thanks,
Ben