At one point during the assembly, the reads are aligned back to an
intermediate assembly before proceeding with the paired-end assembly.
The default aligner is KAligner, which is included with ABySS. The
abyss-bwa script is a small wrapper script to facilitate using BWA to
align the reads to the contigs.
To align contigs to a reference you do not want to use the abyss-bwa
script. Instead use `bwa bwasw'.
k=25 is quite small. What is your read length? Have you tried other
values of k? Higher coverage data sets require larger values of k.
Cheers,
Shaun
To answer your second question, you can use BAM (or SAM) files as input to ABySS. A use-case for this might be something like "try to assemble all the reads that didn't align to my genome reference".
In general though, we usually prefer to work from the unmodified qseq or fastq files.
Cheers,
Rod
With 500x coverage the optimal k will be pretty close to your read
length. I'd try assembling every 8th k from 64 to 96 and compare the N50
of those assemblies. If you find a peek in the N50, you can try a few
more assemblies around that value of k.
Cheers,
Shaun
Hi, Shaun.
I've been labouring under the misapprehension that we should always use
odd kmer values for ABySS to avoid palindromes - Is that wrong?
I tried a monotonically increasing sequence of kmer values with ABySS at
first, but the even ones clearly belong to a different set than the odd
ones. After some reading around, I discovered the problem of hash keys
with palindromes using even kmer values...
Bye,
Tony.
Hi Tony,
Even values of k should not be a problem for ABySS. I'd be happy to see
a counterexample if you have one.
Cheers,
Shaun