On 13/06/2011, joey tribbani <jo.tr...@gmail.com> wrote:
> *Hi All,
>
> I would like to understand the significance of -10*log10(pvalue)in MACS
> output. Why MACS report -10*log10(pvalue) value and not pvalue?
Because in this way you have positive numbers; pvalue is by definition
between 0 and 1 and negative, logarithm will make a linear transformation,
-10 reverse it and give you numbers which are easier to manage ..
What should
> be the criteria to filter the peaks on the basis of -10*log10(pvalue)? Which
> peak is better (i) One having lower -10*log10(pvalue) or (ii) one having
> higher -10*log10(pvalue)- I guess this one?
The second - see above. The higher this value the lower the pvalue
>
> For following 3 peaks which is more significant?
>
>
> chr start end length summit tags -10*log10(pvalue)
> fold_enrichment FDR(%)
> chr1 14125 15560 1436 711 96 84.82
> 3.81 15.52
> chr1 18727 20390 1664 1178 88 109.55
> 3.92 15.81
> chr1 136335 137188 854 504 42 134.15
> 7.65 17.97
As a rule of thumb I vote for the peaks with a lower FDR and the
higher number of tags.
Hence my personal filtering preference will be the first. But they are
very similar. I could
suppose you could merge the first two .. which kind of experiment is this one ?
> Is there any document which explains details formula's for calculating
> -10*log10(pvalue), fold_enrichment and FDR?
MACS documentation and papers.
> Would also like to know if pvalue can be calculated from the
> -10*log10(pvalue)...
Er .. this is very simple maths indeed...
HTH,
A
Typically, math in a computer is limited to a certain number of
significant digits. Converting very small decimals (probabilities,
for example) to log space allows one to simply add numbers rather than
having to multiply very small numbers to get even smaller numbers,
thereby losing precision.
> For following 3 peaks which is more significant?
>
Higher is better.
> chr start end length summit tags -10*log10(pvalue)
> fold_enrichment FDR(%)
> chr1 14125 15560 1436 711 96 84.82
> 3.81 15.52
> chr1 18727 20390 1664 1178 88 109.55
> 3.92 15.81
> chr1 136335 137188 854 504 42 134.15
> 7.65 17.97
>
> Is there any document which explains details formula's for calculating
> -10*log10(pvalue), fold_enrichment and FDR?
>
> Would also like to know if pvalue can be calculated from the
> -10*log10(pvalue)...
P = 10 ^ (-Q/10)
where Q is the -10*log10(pvalue)
Sean
--
You received this message because you are subscribed to the Google Groups "MACS announcement" group.
To post to this group, send email to macs-ann...@googlegroups.com.
To unsubscribe from this group, send email to macs-announcem...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/macs-announcement?hl=en.
for example:
(1) P = 10 ^ (-84.82 /10) = 3.2961E-09
Good luck,
Jim
We can certainly answer your questions in email, but I think you might
benefit from getting someone local (a colleague in your lab or even
your local math department) to go through this with you.
Alternatively, start up Excel, make a column with p-values of various
values and put in the formula (-10) * log10(pvalue) in a second
column. That will allow you to experiment and begin to answer these
questions yourself. You will have to justify your decisions when you
write up your results, so it pays to understand the analysis yourself.
Sean