Hi all,
So for those who couldn't make it to the meeting (baaaaaaaaad... ^_^)
- These plots are showing the log2cpm(counts) for a given gene in different treatment groups at (0, ) 2, ,6, 24 and 48 hours post-treatment.
- Each treatment group contains 10 biological replicates at each timepoint (timepoints marked by the black dotted lines).
- Control group has an additional time point at 0 hours. Obviously, it doesn't make sense having a time zero for treated samples.
- At each timepoint in each group, one biological replicate is an aliquot of alveolar macrophages from one cow. Alveolar macrophages were collected from ten different cows. An aliquot from each cow was exposed to each of the three treatments
Now, my bad. I wrote "linear regression" which is definitely not what's there!
It is a
loess regression where each treatment group is represented by a line (the mean) and a colored area (the confidence interval at level 0.99). The loess regression is in fact a
non-linear regression which combines local regression around the timepoints. (
http://en.wikipedia.org/wiki/Local_regression)
Indeed, the point of my initial email was that the graphs I presented at the opensequencing meeting (the ones on the left side of the slides) included a different number of time points for the treatment groups. More specifically, the control group had 5 timepoints (0,2,6,24,48) while MB and TB only had 4 timepoints (2,6,24,48).
The software I am using is R, more specifically the "ggplot2" library.
The function is ggplot() + geom_smooth(aes(..., level=0.99))
The default level is 0.95 but I boldly went where a bunch of bioinformaticians went before.
I think I addressed all the questions now, otherwise let me know ! :)
Best,
Kevin