Hi all,
I'm stumped on an error, which seems to indicate that Emirge cant open the fasta index file;
emirge.py . -1 forward.fastq -2 reverse.fastq -f SSURef_111_candidate_db_fixed.fasta -b SSURef_111_candidate_dbindex. --mapping test -l 100 -i 110 -s 30 --phred33
If you use EMIRGE in your work, please cite these manuscripts, as appropriate.
imported _emirge C functions from: /usr/local/lib/python2.7/dist-packages/_emirge.so
Command:
/usr/local/bin/emirge.py . -1 forward.fastq -2 reverse.fastq -f SSURef_111_candidate_db_fixed.fasta -b SSURef_111_candidate_dbindex. --mapping test -l 100 -i 110 -s 30 --phred33
EMIRGE started at Sun May 17 19:04:17 2015
Beginning initialization at Sun May 17 19:04:17 2015...
Reading bam file /home/james/Documents/EMIRGE/test at Sun May 17 19:04:17 2015...
DONE Reading bam file /home/james/Documents/EMIRGE/test at Sun May 17 19:04:25 2015 [0:00:08.102563]...
DONE with initialization at Sun May 17 19:04:25 2015...
Starting iteration 0 at Sun May 17 19:04:25 2015...
Reading bam file /home/james/Documents/EMIRGE/test at Sun May 17 19:04:25 2015...
DONE Reading bam file /home/james/Documents/EMIRGE/test at Sun May 17 19:04:30 2015 [0:00:04.764734]...
Calculating likelihood (66, 271902) for iteration 0 at Sun May 17 19:04:30 2015...
Calculating Pr(N=n) for iteration 0 at Sun May 17 19:04:30 2015...
DONE calculating Pr(N=n) for iteration 0 at Sun May 17 19:04:32 2015 [0:00:01.941272]...
DONE Calculating likelihood for iteration 0 at Sun May 17 19:04:34 2015 [0:00:03.939788]...
Calculating posteriors for iteration 0 at Sun May 17 19:04:34 2015...
DONE Calculating posteriors for iteration 0 at Sun May 17 19:04:34 2015 [0.045 seconds]...
Writing consensus for iteration 0 at Sun May 17 19:04:34 2015...
snp_minor_prob_thresh = 0.100
snp_percentage_thresh = 0.040
splitting sequence 0 (GU366042.1.1200) to 66 (GU366042.1.1200_m01)...
Split out 1 new minor strain sequences.
Average time for split sequence: [0.233058 seconds]
Average time for posterior update: [0.231215 seconds]
Average time for non-split sequences: [0.000004 seconds]
Culled 65 sequences
DONE Writing consensus for iteration 0 at Sun May 17 19:04:35 2015 [0:00:00.432150]...
Clustering sequences for iteration 0 at Sun May 17 19:04:35 2015...
cluster threshold = 0.970
[fai_load] build FASTA index.
[fai_load] build FASTA index.
usearch command was:
usearch -search_global /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fasta.tmp.fasta --db /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fasta.tmp.fasta --id 0.800 -quicksort -query_cov 0.5 -target_cov 0.5 -strand plus --userout /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fasta.tmp.fasta.us.txt --userfields query+target+id+caln+qlo+qhi+tlo+thi -threads 1 -fulldp
usearch v8.0.1517_i86linux32, 4.0Gb RAM (11.5Gb total), 1 cores
(C) Copyright 2013-15 Robert C. Edgar, all rights reserved.
http://drive5.com/usearchLicensed to:
bsp...@bangor.ac.uk00:00 36Mb 0.1% Reading /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fa00:00 40Mb 100.0% Reading /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fasta.tmp.fasta
00:00 6.1Mb 0.1% Masking 00:00 6.1Mb 100.0% Masking
00:00 42Mb 100.0% Searching iter.00.cons.fasta.tmp.fasta, 100.0% matched
WARNING: Option -quicksort ignored
Writing new fasta file for iteration 0 at Sun May 17 19:04:35 2015...
removed 0 sequences after merging
sequences remaining for iteration 00: 2
DONE Clustering sequences for iteration 0 at Sun May 17 19:04:35 2015 [0:00:00.130133]...
Writing priors and probN to disk for iteration 0 at Sun May 17 19:04:35 2015...
DONE Writing priors and probN to disk for iteration 0 at Sun May 17 19:04:35 2015...
Starting read mapping for iteration 0 at Sun May 17 19:04:35 2015...
bowtie-build command:
bowtie-build -o 3 /home/james/Documents/EMIRGE/iter.00/iter.00.cons.fasta /home/james/Documents/EMIRGE/iter.00/bowtie.index.iter.00 > /home/james/Documents/EMIRGE/iter.00/bowtie.iter.00.log 2>&1
bowtie command:
cat /home/james/Documents/EMIRGE/forward.fastq | bowtie --phred33-quals -t -p 1 -n 3 -l 20 -e 300 --best --strata --all --sam --chunkmbs 128 --minins 100 --maxins 200 /home/james/Documents/EMIRGE/iter.00/bowtie.index.iter.00 -1 - -2 /home/james/Documents/EMIRGE/reverse.fastq | samtools view -b -S -F 0x0004 - > /home/james/Documents/EMIRGE/iter.00/bowtie.iter.00.PE.bam 2> /home/james/Documents/EMIRGE/iter.00/bowtie.iter.00.log
Time loading reference: 00:00:00
Time loading forward index: 00:00:00
Time loading mirror index: 00:00:00
Seeded quality full-index search: 00:00:17
# reads processed: 135951
# reads with at least one reported alignment: 2 (0.00%)
# reads that failed to align: 135949 (100.00%)
Reported 4 paired-end alignments to 1 output stream(s)
Time searching: 00:00:17
Overall time: 00:00:17
Finished Bowtie for iteration 00 at Sun May 17 19:04:52 2015:
DONE with read mapping for iteration 0 at Sun May 17 19:04:52 2015...
Finished iteration 0 at Sun May 17 19:04:52 2015...
Total time for iteration 0: 0:00:26.137206
Starting iteration 1 at Sun May 17 19:04:52 2015...
Reading bam file /home/james/Documents/EMIRGE/iter.00/bowtie.iter.00.PE.bam at Sun May 17 19:04:52 2015...
[fai_load] build FASTA index.
DONE Reading bam file /home/james/Documents/EMIRGE/iter.00/bowtie.iter.00.PE.bam at Sun May 17 19:04:52 2015 [0:00:00.195924]...
Calculating likelihood (67, 271902) for iteration 1 at Sun May 17 19:04:52 2015...
Calculating Pr(N=n) for iteration 1 at Sun May 17 19:04:52 2015...
DONE calculating Pr(N=n) for iteration 1 at Sun May 17 19:04:52 2015 [0:00:00.002210]...
DONE Calculating likelihood for iteration 1 at Sun May 17 19:04:52 2015 [0:00:00.002647]...
Calculating posteriors for iteration 1 at Sun May 17 19:04:52 2015...
DONE Calculating posteriors for iteration 1 at Sun May 17 19:04:52 2015 [0.008 seconds]...
Writing consensus for iteration 1 at Sun May 17 19:04:52 2015...
snp_minor_prob_thresh = 0.100
snp_percentage_thresh = 0.040
Split out 0 new minor strain sequences.
Average time for non-split sequences: [0.000000 seconds]
Culled 2 sequences
DONE Writing consensus for iteration 1 at Sun May 17 19:04:52 2015 [0:00:00.005879]...
Clustering sequences for iteration 1 at Sun May 17 19:04:52 2015...
cluster threshold = 0.970
[fai_load] build FASTA index.
[fai_load] fail to open FASTA index.
Traceback (most recent call last):
File "/usr/local/bin/emirge.py", line 1697, in <module>
main()
File "/usr/local/bin/emirge.py", line 1688, in main
do_iterations(em, max_iter = options.iterations, save_every = options.save_every)
File "/usr/local/bin/emirge.py", line 1444, in do_iterations
os.path.join(subdir, "iter.%02d.cons.fasta"%(em.iteration_i)))
File "/usr/local/bin/emirge.py", line 500, in do_iteration
self.cluster_sequences(consensus_filename) # merges sequences that have evolved to be the same (USEARCH)
File "/usr/local/bin/emirge.py", line 807, in cluster_sequences
return self.cluster_sequences2(fastafilename)
File "/usr/local/bin/emirge.py", line 829, in cluster_sequences2
num_seqs = self.write_consensus_with_mask(fastafilename, tmp_fastafilename, mask="soft")
File "/usr/local/bin/emirge.py", line 769, in write_consensus_with_mask
reference_fastafile = pysam.Fastafile(reference_fastafilename)
File "cfaidx.pyx", line 81, in pysam.cfaidx.FastaFile.__cinit__ (pysam/cfaidx.c:1701)
File "cfaidx.pyx", line 105, in pysam.cfaidx.FastaFile._open (pysam/cfaidx.c:2272)
IOError: could not open file `/home/james/Documents/EMIRGE/iter.01/iter.01.cons.fasta`
Thanks,
James