memory usage of BLAT

206 views
Skip to first unread message

Yuan Jian

unread,
Jun 10, 2015, 11:48:05 AM6/10/15
to UCSC
Hello,
i would like to use BLAT to query sequence in human genome. could you tell me how much memory is needed to use BLAT? from the BLAT paper, i realized the whole index is loaded into memory. it seems a lot of memory is needed.
thanks
Jian Yu


Jonathan Casper

unread,
Jun 11, 2015, 6:13:49 PM6/11/15
to Yuan Jian, UCSC

Hello Jian Yu,

Thank you for your question about the memory requirements to run BLAT. One of our engineers suggests that the memory usage for BLAT is usually in the 2-4GB range for eukaryote assemblies. That can be reduced substantially by using BLAT to only search against one chromosome at a time, instead of an entire assembly.

If you will be running commandline standalone blat and would like to split up the genome database into smaller pieces (like one run per chromosome), then you can get greater genome-wide masking consistency by creating an ooc file.

To create the ooc, run the following commands:

blat database.fa /dev/null /dev/null -makeOoc=11.ooc -repMatch=1024

Use overused tile file N.ooc. N should correspond to the tileSize. database.fa can also be in 2bit format instead of fa; it should contain all of your chromosomes.

Then to use it on chr1:

blat chr1.fa query.fa chr1.psl -ooc=11.ooc

You can repeat this for each chromosome. More information on using the -ooc option for BLAT can be found in our FAQ at http://genome.ucsc.edu/FAQ/FAQblat.html#blat6. More detailed memory usage information is provided in the documentation at http://genome.ucsc.edu/goldenPath/help/blatSpec.html.

I hope this is helpful. If you have any further questions, please reply to gen...@soe.ucsc.edu or genome...@soe.ucsc.edu. Questions sent to those addresses will be archived in publicly-accessible forums for the benefit of other users. If your question contains sensitive data, you may send it instead to genom...@soe.ucsc.edu.

--
Jonathan Casper
UCSC Genome Bioinformatics Group


--


Reply all
Reply to author
Forward
0 new messages