So far so good ....
>According to what I have read, I have to choose a selection of
>variables backwards method by which I will obtain a linear model.
Not good at all. Backwards methods (and other automatic variable selection methods)
are not good. They are commonly used, but they are wrong.
The
>problem is that I have no experience on how to perform this. Even
>though, I have tried to do it using SPSS software and after running
>the regression, at the results window I get a series of data such us
>tables (descriptive statistics, correlation, included/deleted
>variables, a summary model, ANOVA, analysis of colinearity, excluded
>variables), and Graphics. What is the right way to run the multiple
>regression? How can I get the model from these data? Which data must
>be included in the equation? Thanks in advance for your help.
You might try asking on an SPSS list, for details of how to do things in SPSS,
but which variables you should use is not dependent on software. If you
are trying to replicate previous results, you should use the same variables.
With 500 newborns, you could use all 13 variables - unless there are collinearity problems.
Or you might want to use something like principal component regression, or partial least squares;
you might be concerned with possible nonlinear effects; there are other possibilities as well.
Peter
Peter L. Flom, PhD
Statistical Consultant
www DOT peterflomconsulting DOT com
Collinearity is very likely.
Start with a correlation matrix of all 13 measurements [Statistics =>
Correlate => Bivariate...]. Correlation coefficiants above, say, 0.80
usually show that the inclusion of both variables is not necessary or is
even counterproductive.
Regards,
Christian
Peter Flom schrieb:
You need to request collinearity diagnostics in linear regression. Then, examine the condition indexes. Identify any that are large, ie, >30 (or even 20). Then, examine the associated variance-decomposition proportions for those large condition indexes. Large VDP (>.50) will identify those variables that are involved in the near dependency.
Scott R Millis, PhD, ABPP (CN,CL,RP), CStat, CSci
Professor & Director of Research
Dept of Physical Medicine & Rehabilitation
Dept of Emergency Medicine
Wayne State University School of Medicine
261 Mack Blvd
Detroit, MI 48201
Email: smi...@med.wayne.edu
Tel: 313-993-8085
Fax: 313-966-7682
--- On Thu, 7/2/09, Christian Lerch <t....@gmx.net> wrote:
> > With 500 newborns, you could use all 13 variables - unless there are
>collinearity problems.
Christian Lerch <t....@gmx.net> replied
>
>Collinearity is very likely.
>
>Start with a correlation matrix of all 13 measurements [Statistics =>
>Correlate => Bivariate...]. Correlation coefficiants above, say, 0.80
>usually show that the inclusion of both variables is not necessary or is
>even counterproductive.
>
Actually, correlations are neither necessary nor sufficient for collinearity.
Much better to use condition indexes