Can the h19 UCSC human reference genome sequence be updated to use the current NC_012920.1 mitochondrial sequence (same as http://www.ncbi.nlm.nih.gov/nuccore/J01415.2 from MitoMap) which has been in the GRCh37 reference genome build since way back in patch release 2?
Currently hg19 contains the outdated NC_00180 human mitochondrial sequence instead, which causes many issues for people using the hg19 sequence as the reference sequence for aligning and calling variants from NGS DNA resequencing data. Specifically since those mitochondrial sequences differ in length and likewise mitochondrial gene positions, to correctly annotate mitochondrial variants, researchers need to know more than whether they used the current human reference genome – they need to know if they used hg19 or GRCh37 which is a detail many scientists who are analyzing variant data do not know.
The GRCh38 build next year of the reference genome will include NC_012920.1 too, so could the default hg19 download be updated to include the current mitochondrial sequence now to improve accuracy and consistency of human genome variant interpretation? If hg19 must remain fixed, could at least a warning be displayed to alert folks that the mitochondrial sequence is out of date and provide them with a download URL to hg19 with the current sequence – many folks do not realize they have outdated alignment data until long after they’ve aligned data relative to hg19 and discover alignment issues.
Sincerely,
Dan Richards