I've been using GMAP, BLAT or exonerate. I'll be interested to hear
which tools others are using.
Which tools are folk using to align the reads to the assembled contigs?
I've been using bowtie, BWA or GSNAP.
Cheers,
Shaun
I'm using exonerate for transcript assemblies and lastz for genomic assemblies. I'm happy with exonerate for the transcripts, especially as I can use a few parsers to get properly formatted GFF3 files for use in GBrowse.
I'm not happy with my genomic solution. I wanted to use mummer but it produces an error message when I try to run all contigs against all chromosomes (the same error Art posted to the mummer mailing list).
What about browsers? So far I haven't found a browser for viewing contigs aligned to a reference that I like at all. The browers I've tried tend to have one or two pros and many cons. Things are OK for bacterial assemblies (or in my case the chloroplast) but scaling up to larger multi-chromsome genomes becomes horrible (at least in my opinion).
For aligning reads to contigs I'm using bowtie or bwa - and I can't decide which to settle on. Again, I haven't found a good browser.
We were recently talking about what metrics we want post aligning reads to contigs and from the assembler output. For example, a list of contigs with lower or higher than average coverage, contigs with higher than average number of graph connection, per base coverage etc. If anyone wants to join our discussions and our effects to script producing these summary stats we would welcome the help - just drop me an email. We are also thinking to try and make this as easy to compare across assemblers as possible to help us get past a purely N50 based comparison.
Nat
About aligning and viewing genomes, what we use here is ABACAS and ACT:
http://sourceforge.net/projects/abacas/
http://www.sanger.ac.uk/resources/software/act/
ABACAS will align your contigs (using nucmer) to a reference that should
be single molecule (line) FASTA file. The output can be loaded along
with the contigs and the reference and you can visualize the matches.
Also, if you BLAST of BLAT your query contigs against the reference, the
m8 format can be loaded in ACT as well.
Cheers.
--
The Wellcome Trust Sanger Institute is operated by Genome Research
Limited, a charity registered in England with number 1021457 and a
company registered in England with number 2742969, whose registered
office is 215 Euston Road, London, NW1 2BE.
For either de novo contig re-alignments and/or reference (distant spp.) alignments we've tried bowtie, bwa, RMAP, SSHA2 and others. For browsers TABLET and commercial products.
Keep me in the loop on the "Metrics" discussion drops off the ABYSS list.
Durrell