Dear UCSC Team,
I am trying to liftover some RNAseq data from hg38 to hg19.
However in the output I get contains a number of overlapping regions, which is first confusing me and second does not allow me to convert .bedGraph to .bigwig to upload the data to UCSC browser and compare with other data.
The typical example is below, from sorted liftOver output .bedGraph file:
Line 2 here tells that the whole fragment from 10531 to 10564 is 0, but then line 3 that 10533 to 34 is 3, 10534 to 10535 is 4 etc
chr1 10485 10531 1
chr1 10531 10564 0
chr1 10533 10534 3
chr1 10534 10535 4
chr1 10535 10539 5
chr1 10539 10541 6
chr1 10541 10582 7
chr1 10564 10614 1
chr1 10614 13488 0
chr1 13085 13090 3
chr1 13090 13092 2
In unsorted output file sorting is a bit strange, but maybe it is supposed to be like that.
There is almost the whole chr1, ordered, then random piece from chr1 beginning, then random part of chrY, then chr2, chr1 again...
chr1 10485 10531 1
chr1 10531 10564 0
chr1 10564 10614 1
chr1 10614 13488 0
------[all the chr1, ordered]------
chr1 174426 174444 1 [last chr1 fragment]
chr1 10533 10534 3 [chr1 beginning, overlapping regions here]
chr1 10534 10535 4 [overlapping regions here]
chr1 10535 10539 5 [overlapping regions here]
chr1 10539 10541 6 [overlapping regions here]
chr1 10541 10582 7 [overlapping regions here]
chrY 59361617 59361649 1
chr2 114358172 114358221 0
chr2 114358113 114358121 0
chr2 114358112 114358113 1
chr2 114358064 114358112 2
chr2 114358062 114358064 1
chr2 114357993 114358062 0
chr2 114357974 114357993 1
chr2 114357946 114357974 2
chr2 114357943 114357946 3
chr1 13085 13090 3
chr1 13090 13092 2
chr1 13092 13116 3
chr1 13116 13118 4
chr15 102518034 102518051 3
chr15 102518027 102518034 2
chr2 114357759 114357849 0
chr2 114357733 114357759 1
Scripts I used were:
May be you have some advice how to avoid these overlapping elements arising in the output or how to delete/merge them?
Thank you,
Inna