Hello, Aude.
Thank you for your interest in the Genome Browser.
While liftOver cannot directly convert a file in that format, you can reformat the file, covert it, and recover it as you suggest.
The liftOver utility can be found in our download directory (http://hgdownload.soe.ucsc.edu/admin/exe/). You will want to download the utility corresponding to your operating system, for example linux is found here (http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/liftOver):
You can run the utility on its own to see a help message, e.x.
You can start with a file as you suggest, separated by tabs (or spaces), note that you will want to comment out the header line, or remove it:
You will then want to create two separate files, one with the start coordinates and one with the end coordinates. We are also saving the entire record as the 4th column in order to be able to accurately recombine the file:
If you kept the header line, you will want to comment it out in the created endPos.hg38.bed file now.
Both of these new files are bed 4+1 files, with the additional field being the weight column seen in your example. You will now want to download the chain file from hg38 to hg19:
At this point you can run the liftOver command on both files:
You will now have four new files. startPos.hg19.bed and endPos.hg19.bed will contain the successfully lifted coordinates for each file, and the two Unmapped files will contain any records that failed to lift.
You can now combine the two files and order the results by position:
The file will be recombined based on the initial coordinates, which can still be found in the first column of this final output. Keep in mind that there may be unmapped records.
Let us know if you have any questions on this process, or any other Genome Browser features or utilities.
I hope this is helpful. Please include gen...@soe.ucsc.edu in any replies to ensure visibility by the team. All messages sent to that address are archived on our public forum. If your question includes sensitive information, you may send it instead to genom...@soe.ucsc.edu.
Lou Nassar
UCSC Genomics Institute
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/VI1PR0701MB2830F7CAF7776570E6F0CE6AD8B59%40VI1PR0701MB2830.eurprd07.prod.outlook.com.