Good morning,
I am comparing two genomes using the UCSC pipeline : Latsz + axtChain + chainNet. I would like to detect the specific duplications in both genomes. Do you have a pipeline or an available script to do that?
I am trying to do it by myself but I meet with several problems. My strategy is (for one way = 1 genome as reference and the other as query):
-scanning the net, I retrieve all regions in query that are used twice (or at least twice if I want to detect duplications of more than 2 copies)
-creation of a bed with these regions
-liftOver to get the corresponding regions in the second genome using the -multiple flag to have all possible correspondances
However, I do not understrand why, but liftOver do not give me all possible regions in the second genome.
Here is an example :
1/ working well
bed :
SFRU_RICE_008595 5379 5468 reg_3 #scaffold/start/end/id
chains (i do not write the information about gap for clarity) containing the scaffold SFRU_RICE_008595:
chain 84620 SFRU_RICE_008595 14293 + 317 1485 scaffold_28689 2593 + 3 1182 1265
chain 327568 SFRU_RICE_008595 14293 + 9740 14236 scaffold_18603 4868 - 0 4326 507
chain 960741 SFRU_RICE_008595 14293 + 0 14236 scaffold_425 146919 - 105838 121010 141
chain 40715 SFRU_RICE_008595 14293 + 2978 3621 scaffold_425 146919 - 108358 108971 1631
chain 37350 SFRU_RICE_008595 14293 + 6207 6750 scaffold_425 146919 - 111664 112184 1674
chain 32066 SFRU_RICE_008595 14293 + 6780 7165 scaffold_425 146919 - 112410 112791 1796
chain 9207 SFRU_RICE_008595 14293 + 5192 5478 scaffold_7072 14142 + 3268 3659 30806
In bold are chains including the region written in bed file.
lift file :
scaffold_425 36065 36154 reg_3 1
scaffold_7072 3557 3649 reg_3 2
Two regions in second genome are given.
2/ not giving expected regions
bed :
SFRU_RICE_008595 317 1484 reg_1
chains : same as above
The first and third chain (italic) include the region written in bed.
lift file :
scaffold_28689 3 1181 reg_1 1
Only the corresponding region found in the first chain is given. Can you explain why there is no region from scaffold_425 given for this bed entry?
Thank you very much in advance for your help.
Anaïs