Hi all,
In short:
I wonder why the query (proCap1) coordinates are not the same in net alignments and in chain alignments (looking at the same chainId).
In more details:
I tried to find the homologous genomic sequence of one gene (SMCHD1) in the rock hyrax (procavia Capensis). Unfortunately several hyrax contig are aligned to the mouse region (chr17:71344493-71475343) and I would like to retrieve a (hypothetical) continuous sequence.
Doing my request in the mysql database, I noticed that coordinates of fragments in proCap1 weren't the same, for a same "chainId" between the chain and net tables.
Here are my requests:
use mm10;
# Chain
SELECT tName,tStart,tEnd,qStrand,qName,qStart,qEnd,score,id FROM chainProCap1
WHERE (tName="chr17"
AND (71344493 <= tStart AND tStart <= 71475343
OR 71344493 <= tEnd AND tEnd <= 71475343))
ORDER BY tStart;
# Net
SELECT level, tName,tStart,tEnd,strand,qName,qStart,qEnd,score,chainId FROM netProCap1
WHERE (tName="chr17" AND type="top"
AND (71344493 <= tStart AND tStart <= 71475343
OR 71344493 <= tEnd AND tEnd <= 71475343))
ORDER BY tStart;
I am not going to show the entire output here (18 and 19 rows), just the first row of each:
# Chain:
tName | tStart | tEnd | qStrand | qName | qStart | qEnd | score | id
chr17 | 71344307 | 71345537 | - | scaffold_115863 | 4129 | 5390 | 33044 | 98261
# Net
level | tName | tStart | tEnd | strand | qName | qStart | qEnd | score | chainId
1 | chr17 | 71344307 | 71345537 | - | scaffold_115863 | 270 | 1531 | 33044 | 98261
This is the same piece of alignment in both cases, why are *qStart* and *qEnd* not matching?
Thanks in advance for your help. I realize I should probably use the .axt alignment, but now these different coordinates are puzzling me.
Best regards,
Guillaume