Hi,
We are having trouble regenerating the 100way phastcons mod file (https://hgdownload.cse.ucsc.edu/goldenPath/hg38/phastCons100way/hg38.phastCons100way.mod) and are hoping to get some advice on how to do so.
We are following the methods section from https://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&g=cons100way and are running the following commands using the chr22 maf file available
on UCSC (http://hgdownload.cse.ucsc.edu/goldenPath/hg38/multiz100way/maf/chr22.maf.gz):
Our input gff3 file is from RefSeq Select and only contains protein-coding genes. We have also tried using gff3 from Gencode filtered for protein-coding genes but gotten similar results (that are not similar to the downloadable hg38.phastCons100way.mod).
Are we using the wrong gff3 file? We noticed that the methods mention filtering for "single-coverage long transcripts" ... what exactly does that mean? Is it important to compile the mod files across all chromosomes (we are only doing chr22 right now for
testing)? Is there something else that we may be missing?
Thanks!
Best,
Janet
--
---
You received this message because you are subscribed to the Google Groups "UCSC Genome Browser Public Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome+un...@soe.ucsc.edu.
To view this discussion on the web visit https://groups.google.com/a/soe.ucsc.edu/d/msgid/genome/1645543398782.54408%40childrens.harvard.edu.