So I have been exploring this project for few days now in order to write a proposal describing my approach and I have a few concerns.
My first thought about converting a file format to another was that I would gain better insight on this if I can find a same genome file in all 3 formats. Searching on google with the following didn’t get me anywhere.
-> ".hgvs"
-> filetype:hgvs
I am struggling to find even one such trio, if you can help me with this that’d be great.
So my most prior hurdle is that I am finding the terms in hgvs format (
http://varnomen.hgvs.org) incomparable to VCF or MAF format for the following reasons :
-Most of the terms used in hgvs specification are not present at all in the VCF format specification pdf. (eg : frame shift, protein, nucleotide)