Dear Sir/Madam,
I have watched the webpage(http://genomewiki.ucsc.edu/index.php/Whole_genome_alignment_howto) in UCSC wiki about the masking before alignment, and it reads “MASKING: Both genomes have to be repeatmasked and masked Tandem Repeat Finder (trf) first (thanks to Hiram for pointing this out)”.
And then I have two questions about the masking:
1. Why just use the repeat annotation of repeatmasker and trf ?How to determin the criteria that what kind of repeat annotations should we use? Can I use the repeat annotation consist of repeatmodeler,repeatmasker and repeatmodeler?
2. Which masking method should we use? Hardmask or softmask? Will softmask cause the false negative problem?
Thank you very much!
Best!
Duo
Dear Duo,
Thank you for using the UCSC Genome Browser and your question about masking before alignment.
We use RepeatMasker and TRF because they work well for human and other vertebrates. NCBI uses WindowMasker. You may want to experiment with different tools to figure out which tool works best for your genomes and analysis needs. In general, less masking means more sensitivity, but can also lead to a huge amount of output for repetitive or low-complexity regions.
For your second question, we use softmasked sequences. Using lastz does not begin an alignment in a softmasked sequence, but it can extend an alignment through softmasked sequence if doing so increases the score of the alignment.
Thank you again for your inquiry and using the UCSC Genome Browser. If you have any further public questions, please reply to gen...@soe.ucsc.edu. All messages sent to that address are archived on a publicly-accessible forum. If your question includes sensitive data, you may send it instead to genom...@soe.ucsc.edu.
All the best,
Brian Lee
UC Santa Cruz Genomics Institute
--
--- You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To post to this group, send email to gen...@soe.ucsc.edu.
Visit this group at https://groups.google.com/a/soe.ucsc.edu/group/genome/.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/794f4522-a836-06f2-90be-ba4ca38df128%40soe.ucsc.edu.
Dear Brian and Hiram,
Thank you very much for your quick reply and it is very helpful!
Best!
Duo