maker_opts.ctl supplies the opportunity to specify the gmhmm, but I
don't think that genemark-es supplies a pre-built hmm (being a
self-training algo).
I've supplied the location of gmhmme3 in maker_exe.ctl, and set
unmask=1 in maker_opts.ctl, but the gene models generated look to come
soley from snap (snap hmm was supplied).
Is it possible to add evidence from genemark-es to the generated gene
models? Is the best option to simply run genemark-es independantly,
and then supply the gff as pred_gff in maker_opts.ctl?
-r
Rob Syme
PhD Student
Curtin University
Western Australia
_______________________________________________
maker-devel mailing list
maker...@box290.bluehost.com
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
At the end, there will be a mod directory created after running
Genemark-ES. Take the es.mod file in that directory. That is your final
HMM. Provide that to gmhmm in MAKER.
Depending on what organism you are annotating, you may want to consider
other predictors as well. Genemark works well on fungi and Oomycetes
(organisms with short introns and long exons). It performs poorly
relative to other predictors as introns become longer and exons become
shorter (like on mammals or many species of insects). For those genomes
Augustus tends to produce better results, it is harder to train though.
Thanks,
Carson
Fungal transcriptome assemblies often yield a number of erroneously
joined transcripts, due to high gene densities and overlapping UTRs in
fungi. Passing these fusion transcripts to MAKER as EST evidence
sometimes leads to prediction of merged genes, which are actually two.
Is there a way to compensate for this problem in MAKER? I understand
that newly trained predictors like SNAP rely on est2genome. But could a
correction possibly inferred from blast evidence, where two adjacent
similarity regions in one gene model actually originate from two
separate proteins?
Thanks
Olaf
--Carson