Hi,
I have been trying to use the identify chimera script to remove chimeras. Chimera Slayer is not working - the txt file was empty after running it- I was reaching out to confirm if I was using the correct input files (-i rep_set_aligned.fasta (pynast aligned folder) and -a rep_set.fna)?
I also just tried running it prior to clustering with USearch like you recommended. I was able to get it to work without a reference set but when I tried to run it with a reference set it did not work- fatal error due to not having enough memory (see below):
$ identify_chimeric_seqs.py -i seqs.fna -m usearch61 -o usearch_checked_chimeras_Silva99ref/ -r /Users/Sara_Jeanne/Desktop/QIIME/SILVA123_QIIME_release/rep_set/rep_set_all/99/99_otus.fasta
Traceback (most recent call last):
File "/macqiime/anaconda/bin/identify_chimeric_seqs.py", line 354, in <module>
main()
File "/macqiime/anaconda/bin/identify_chimeric_seqs.py", line 350, in main
threads=threads)
File "/macqiime/anaconda/lib/python2.7/site-packages/qiime/identify_chimeric_seqs.py", line 774, in usearch61_chimera_check
log_lines, verbose, threads)
File "/macqiime/anaconda/lib/python2.7/site-packages/qiime/identify_chimeric_seqs.py", line 961, in identify_chimeras_usearch61
HALT_EXEC=HALT_EXEC)
File "/macqiime/anaconda/lib/python2.7/site-packages/bfillings/usearch.py", line 2411, in usearch61_chimera_check_ref
app_result = app()
File "/macqiime/anaconda/lib/python2.7/site-packages/burrito/util.py", line 285, in __call__
'StdErr:\n%s\n' % open(errfile).read())
burrito.util.ApplicationError: Unacceptable application exit status: 1
Command:
cd "/Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/"; usearch61 --mindiffs 3 --uchime_ref "/Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_consensus_fixed.fasta" --minh 0.28 --xn 8.0 --minseqlength 64 --threads 0.5 --mindiv 0.8 --uchimeout "/Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_chimeras_ref.uchime" --dn 1.4 --strand plus --db "/Users/Sara_Jeanne/Desktop/QIIME/SILVA123_QIIME_release/rep_set/rep_set_all/99/99_otus.fasta" --log "/Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_chimeras_ref.log" > "/tmp/tmp1ayFh13eW2nmNUfGRkAR.txt" 2> "/tmp/tmpHaRsZdDNwDW9wE96bAYa.txt"
StdOut:
usearch_i86osx32 v6.1.544, 4.0Gb RAM (8.6Gb total), 2 cores
(C) Copyright 2010-12 Robert C. Edgar, all rights reserved.
http://drive5.com/usearchStdErr:
00:00 966.7kb Reading /Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_consensus_fixed.fasta, 10Mb
00:00 11Mb 28993 (29.0k) seqs, min 184, avg 309, max 493nt
00:01 11Mb Reading /Users/Sara_Jeanne/Desktop/QIIME/SILVA123_QIIME_release/rep_set/rep_set_all/99/99_otus.fasta, 781Mb
00:39 782Mb 537233 (537.2k) seqs, min 900, avg 1436, max 3674nt
01:23 788Mb 100.0% Masking
02:00 789Mb 100.0% Word stats
usearch61(59351,0xa10771d4) malloc: *** mach_vm_map(size=8388608) failed (error code=3)
*** error: can't allocate region securely
*** set a breakpoint in malloc_error_break to debug
Out of memory mymalloc(10952), curr 1.04e+09 bytes
usearch61 --mindiffs 3 --uchime_ref /Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_consensus_fixed.fasta --minh 0.28 --xn 8.0 --minseqlength 64 --threads 0.5 --mindiv 0.8 --uchimeout /Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_chimeras_ref.uchime --dn 1.4 --strand plus --db /Users/Sara_Jeanne/Desktop/QIIME/SILVA123_QIIME_release/rep_set/rep_set_all/99/99_otus.fasta --log /Users/Sara_Jeanne/Desktop/QIIME/Split_Lib_FullReads_attachedBC_q20/usearch_checked_chimeras_Silva99ref/seqs.fna_chimeras_ref.log
---Fatal error---
Out of memory, mymalloc(10952), curr 1.04e+09 bytes
Is there any way to work around this so I can do this on my local machine? I was able to get it to work with the gg set but I prefer to use a more updated reference set.
thank you,
Sara