Good Morning Assa:
Yes, you will need to break up your NR sequence into manageable chunks
that will fit into a 2bit file.
You would need to do this anyway for your analysis because if you could get
all the sequence in one file, that would then be too large to run an
efficient blat against it.
Partition both your query and your target sequence chunks into reasonable
sizes and numbers to obtain a reasonable run time for a blat of one
target set to the query set. Run all possible query to target
combinations in a compute cluster, filter the psl results to
your desired match criteria.
--Hiram
> Email:
yeros...@biochem.mpg.de<mailto:
yeros...@biochem.mpg.de>
>