Hi there, new QIIME user here following the Amplicon SOP.
Like
many others, I'm running into issues when assigning taxonomy and I
believe it's memory related. The job gets killed after 5 minutes and
I've reduced --p-reads-per-batch --p-n-jobs. There is 16303708 kB of RAM on this linux box. I'm not sure what else I
can do. Should I decrease --p-reads-per-batch 1000 even more? What would
be a reasonable batch size?
qiime feature-classifier classify-sklearn
--i-classifier /home/Downloads/silva-138-99-nb-classifier.qza
--i-reads /home/Rocks/outputs/qza_intermediates/rocks16S_rep_seqs.qza
--o-classification rocks16S_taxonomy.qza
--p-reads-per-batch 1000
--p-n-jobs 1
--p-confidence 1
--verbose &> 16S_classify_verbose.log & disown
I downloaded the silva-138-99-nb-classifier.qza from the QIIME data resources page but I saw that on your Virtual Box Amplicon SOP (
https://github.com/LangilleLab/microbiome_helper/wiki/Amplicon-SOP-v2-(qiime2-2022.11)) there is a trained classifier on the 16S V4/V5 region (
classifier_silva_132_99_16S_V4.V5_515F_926R.qza), which is what we sequenced. Is it possible to download this externally without going through the virtual box?
Thank you.