Hi Shin,
Yes, that is correct that these files are generated from the blastz
alignment using those programs. Here are the steps:
1) lavToAxt is used to convert the blastz lav file to an axt file
lavToAxt in.lav tNibDir qNibDir out.axt
where the tNibDir is a directory containing the chrom sequence files in
nib format for the target and qNibDir contains those for the query.
Alternatively, a 2bit format file can specified for the sequences
instead of these directories. If you are looking at chimp alignments on the
human browser, then chimp is the query and human is the target.
2) Next axtChain is run to create the chains.
axtChain in.axt tNibDir qNibDir out.chain
Sometimes the -minScore option is used to filter out low scoring
alignments and -scoreScheme is used if the scoring matrix used for blastz
is not the default.
tNibDir and qNibDir are the same as for step 1.
Chains are then sorted using chainMergeSort:
chainMergeSort file(s)
so a list of chain files is given as input to the program. If you see
a file such as mm6.rn3.all.chain.gz in the downloads then this *.all.chain
file was created in this way.
3) To create the nets from the chains,
i) the chainPreNet program is used:
chainPreNet in.chain target.sizes query.sizes out.chain
in.chain can be the all.chain produced by the previous step.
target.sizes and query.sizes are files with a list of the chromosomes and
their sizes seperated by tabs
e.g.
chr1 195109612
chr2 181764313
etc.
This can be obtained by downloading the chromInfo table for the relevant
assembly through our Downloads site:
http://hgdownload.cse.ucsc.edu/downloads.html
Pick the relevant species and assembly and follow the link to Annotation
Database where you will find the dumps from our database tables.
ii)Then add synteny information to the net:
netSyntenic
in.net out.net
iii) Add classification information using the database tables:
netClass
in.net tDb qDb
out.net
If lineage-specific repeats were used for the Blastz alignment then
the -tNewR and -qNewR options must be used.
-tNewR=dir - Dir of chrN.out.spec files, with RepeatMasker .out format
lines describing lineage specific repeats in target
-qNewR=dir - Dir of chrN.out.spec files for query
The
out.net file is the gzipped net file that you see in our Downloads
area.
4) The axtNet files are created by converting the net to axt format.
This is done using the netToAxt program:
netToAxt
in.net in.chain tNibDir qNibDir out.axt
see step 1 for definition of tNibDir and qNibDir.
I hope that this helps. Please let me know if you need further details or
have any more questions.
Rachel
> _______________________________________________
> Genome maillist - Genome at
soe.ucsc.edu
>
http://www.soe.ucsc.edu/mailman/listinfo/genome
>
--
UCSC Genome Bioinformatics Group
http://genome.ucsc.edu