Hi everyone,I have some questions about cluster plots.After runing 'difine_clusters_by_cutting_tree.pl', it generates cluster plots. My question is Y-axis, why its parameter is 'centered log2(fpkm+1)'? I understand the values are log1-transformed, but I don't know it is centered and (fpkm+1).The followings are my questions1) centered??According to Trinity manual, each transcript's expression values are centered by the median value, so what does it mean median of? How to calculate it?
2) fpkm+1Why not using just fpkm? Why is fpkm added by 1?
3) In a cluster plots, there are gray lines and a blue line. I think gray lines mean each trasscript's patterns, but what is the blue line? Is it average value or median?
--Thank you,Suejin
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
Hi,Responses below:
On Thu, May 7, 2015 at 10:44 AM, Suejin Park <sjpar...@gmail.com> wrote:Hi everyone,I have some questions about cluster plots.After runing 'difine_clusters_by_cutting_tree.pl', it generates cluster plots. My question is Y-axis, why its parameter is 'centered log2(fpkm+1)'? I understand the values are log1-transformed, but I don't know it is centered and (fpkm+1).The followings are my questions1) centered??According to Trinity manual, each transcript's expression values are centered by the median value, so what does it mean median of? How to calculate it?Each gene's set of expression values (in log2(fpkm+1)) is centered by subtracting the median. This enables genes to be analyzed according to their relative expression across the different samples, rather than taking account the relative intensity within each sample.2) fpkm+1Why not using just fpkm? Why is fpkm added by 1?This is because some fpkm values are zero and you can't take log(0). log2(fpkm+1) is a rather common thing to do.3) In a cluster plots, there are gray lines and a blue line. I think gray lines mean each trasscript's patterns, but what is the blue line? Is it average value or median?The blue line is the average across all genes in that cluster.
Thank you,Suejin
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.