More confusion caused by the diverse origins of statistics.
ANCOVA, ANOVA, OLS regression are all the same model in matrix terms
Y = XB + e
the various X can be interval, or categorical or dichotomous, or whatever.
The ANOVA/ANCOVA stuff comes from agriculture, where they compared plots of land.
The regression stuff comes from astronomy and geography, where they measured distances
But it's all the same stuff.
Peter
Peter L. Flom, PhD
Statistical Consultant
Website: www DOT peterflomconsulting DOT com
Writing; http://www.associatedcontent.com/user/582880/peter_flom.html
Twitter: @peterflom
Well ... maybe *I'm* confused. Or maybe the terms are used differently in different fields. In my experience, in the social sciences and (a little less experience) in medicine, the terms are used as follows:
ANOVA - all the IVs are categorical
ANCOVA - some IVs are continuous, some categorical but there's not necessarily the distinctions Ted makes, above.
Regression - EITHER all the IVs are continuous OR there's a mix.
It's certainly true, in any field, that we may be interested in some variables only as covariates, but these variables need not be categorical. For example, in medicine, age is often a covariate (in this sense) in survival analysis. Everyone knows age is strongly related to death, so we better remove its effects. But, AFAIK, this can be done by simply including it in the regression.
Indeed, in Ted's example, it seems to me that the covariates are continuous, while the independent variable is categorical.
Very confusing!
And not made clearer by different people using terms in different ways.
_____________________________________________________
Doug Altman
Professor of Statistics in Medicine
Centre for Statistics in Medicine
University of Oxford
Wolfson College Annexe
Linton Road
Oxford OX2 6UD
email: doug....@csm.ox.ac.uk
Tel: 01865 284400 (direct line 01865 284401)
Fax: 01865 284424
www: http://www.csm-oxford.org.uk/
EQUATOR Network - resources for reporting research
www: http://www.equator-network.org/
My comment is that in my field, psychology, there is a question about when to use ANOVA vs. ANCOVA. Ostensibly, ANOVA tests the differences in means among groups. ANCOVA includes a covariate in this analysis for statistical control. Because when we gather data, we often gather demographic variables along with our predictor and outcome variables. If the hypothesis is to test the difference among means among three groups, and the selected analysis is ANOVA, the question is raised, why not use an ANCOVA and include the covariates for statistical control? If that is the case, then that would make every analysis where the intent is to compare group means, an ANCOVA because why wouldn't you want to include covariates for statistical control? In other words, assuming a sufficient sample size, why not conduct an ANCOVA every time instead of an ANOVA if you have covariates, especially demographics, where you can enter the covariates for statistical control? That would mean we would conduct ANCOVA’s every chance we get, i.e., whenever we have covariates. I would rather have the decision to use ANOVA vs. ANCOVA be based on conceptual and/or statistical grounds, but I can't seem to find such a justification for using ANOVA vs. ANCOVA.
AFter reading this thread below, it seems that the terms ANOVA and ANCOVA are (a) relics from a past when hand computations were necessary, (b) ANOVA and ANCOVA are as I have come to realize, extensions of the OLS, similar to multiple regression, and (c) it is more important to specify the model rather than referring to the analysis as an ANOVA or ANCOVA.
To answer my own question then, use ANCOVA rather than perseverate about the difference between ANOVA and ANCOVA, and focus attention on specifying the model.
Thanks for your comments, and any articles or website references are appreciated.
Peter
AFter reading this thread below, it seems that the terms ANOVA and ANCOVA are (a) relics from a past when hand computations were necessary, (b) ANOVA and ANCOVA are as I have come to realize, extensions of the OLS, similar to multiple regression, and (c) it is more important to specify the model rather than referring to the analysis as an ANOVA or ANCOVA.To answer my own question then, use ANCOVA rather than perseverate about the difference between ANOVA and ANCOVA, and focus attention on specifying the model.
On Monday, November 2, 2009 at 3:44:04 PM UTC-6, plf515 (Peter Flom) wrote:...
More confusion caused by the diverse origins of statistics.
ANCOVA, ANOVA, OLS regression are all the same model in matrix terms
Y = XB + e
the various X can be interval, or categorical or dichotomous, or whatever.
The ANOVA/ANCOVA stuff comes from agriculture, where they compared plots of land.
The regression stuff comes from astronomy and geography, where they measured distances
ANOVA, ANCOVA and linear regression are all the same model. They are all
Y = b_0 + b_1x_1 + b_2x_2 + ….. b_px_p + e
It took a while for people to realize that all these models were the same because they developed in different fields and the people in those fields didn’t talk to each other.
In ANOVA the x variables are al categorical. In ANCOVA, some are categorical and some continuous. But the linear model can hand x variables of any sort (although ordinal ones are tricky).
Also, the traditional output (from either hand calculations or computer programs) of ANOVA and regression look different but mean the same thing.
These days, in SAS, for example, PROC GLM is used a lot more than PROC ANOVA. And, in R, the lm function handles all three types of analyses.
Peter
--
--
To post a new thread to MedStats, send email to MedS...@googlegroups.com .
MedStats' home page is http://groups.google.com/group/MedStats .
Rules: http://groups.google.com/group/MedStats/web/medstats-rules
---
You received this message because you are subscribed to the Google Groups "MedStats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to medstats+u...@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/medstats/e34e3e07-214e-4e20-a3b2-a49d48eeb63d%40googlegroups.com.
Concise
Encyclopedia
of Biostatistics for
Medical Professionals
Martin P. Holt