Dear colleagues and team,
I have a question about the discrepancy I have encountered after I tried to get conservation score (phastCons score) from UCSC genome browser with two approaches: query from online table browser and search in downloaded .wigFix data. Details are appended at the end of this mail.
Here are my questions, and I would really appreciate your help for me to understand them:
The chromosome I used for the test is chr17, for which I defined the region as ”chr17 0 10’.
In the table browser, I chose
Clade: Mammal; Genome: Human; assembly: hg19; group: Comparative Genomics; track: Conservation; table: phasCons100way
Following is the output
variableStep chrom=chr17 span=1
1 0.0991496
2 0.0929528
3 0.0557717
4 0.0495748
5 0.0185906
6 0.0123937
7 0.00619685
8 0.00619685
9 0
10 0
For the downloaded chr17.phastCons46way.placental.wigFix
$ head chr17.phastCons46way.placental.wigFix
fixedStep chrom=chr17 start=1 step=1
0.096
0.089
0.071
0.060
0.023
0.016
0.010
0.010
0.010
0.010
Thank you very much.
Ju
$ head -11 chr17.phastCons100way.wigFix
fixedStep chrom=chr17 start=1 step=1
0.102
0.093
0.059
0.050
0.021
0.013
0.011
0.008
0.003
0.002
--
I hope this helps clarify things.(paragraph 2):Hello again, Ju.A colleague of mine pointed out something on the help page I referred to you:
For speed and efficiency, wiggle data is compressed and stored internally in 128 unique bins. This compression means that there is a minor loss of precision when data is exported from a wiggle track (i.e., with output format "data points" or "bed format" within the table browser). The bedGraph format should be used if it is important to retain exact data when exporting.
- - -
Luvina Guruvadoo
UCSC Genome Bioinformatics Group
--