> I'm analysing some data on 30 individuals all with a medical
> condition. I have scores on a test that they performed. I also have
> a large age and sex matched normative data set with about 2000
> 'healthy' individuals with scores for the same test.
>
> I would like to show that people with the medical condition perform
> worse on the test than would be expected. Can I legitimately conduct
> a test for a difference in mean scores between the 2 groups?
This is a common concern that people raise, but it is a false concern
for the most part. I've written about this at my old website:
* http://www.childrensmercy.org/stats/weblog2004/UnequalSampleSizes.asp
* http://www.childrensmercy.org/stats/ask/unequal.asp
You should be careful anytime you have unequal sample sizes in the two
groups, but there is nothing that disallows an analysis when the sample
sizes are unequal, even grossly unequal. If you have unequal variances
in the two groups, then the unequal sample sizes will exacerbate that
problem. But if everything else is fine, the only difficulty caused by
unequal sample sizes is that you have to use slightly more complicated
formulas.
> Should I use a nonparametric test (Wilcoxin rank sign test) because
> of the huge discrepancy in sample sizes?
The unequal sample sizes, by itself, is no reason to use a
non-parametric test. Furthermore, the most troublesome potential
violation of assumptions, unequal variances, would cause just as many
problems with a rank-based nonparametric test. Nonparametric is
synonymous with fewer assumptions, but even nonparametric tests are not
assumption-free.
> Is it silly to calculate the effect size change (Cohen's d) using a
> a pooled estimate of the standard deviation in the scores i.e.
> Cohen's d= mean difference/pooled SD.
I would argue that it is always silly to calculate the effect size
change, but there is nothing inherent in unequal sample sizes that makes
an effect size calculation more problematic.
Bottom line: don't let unequal sample sizes ruin your day.
--
Steve Simon, Standard Disclaimer
Sign up for The Monthly Mean, the newsletter that
dares to call itself "average" at www.pmean.com/news
-- --------------------------------------------------------------------- Karl Schlag Professor Tel: +43-1-4277-374-37 Department of Economics Fax: +43-1-4277-9374 University of Vienna email: karl....@univie.ac.at Hohenstaufengasse 9 Room: 505 1010 Vienna, Austria http://homepage.univie.ac.at/karl.schlag/
--
To post a new thread to MedStats, send email to MedS...@googlegroups.com .
MedStats' home page is http://groups.google.com/group/MedStats .
Rules: http://groups.google.com/group/MedStats/web/medstats-rules
I addition to Karl......i think you can still use the two sample t-test even if the variances are unequal but you then need to use Satterthwaite�s or Welch's approximation formula to approximate the degrees of freedom of the test. Note this is an approximate to the "true" df!!!
Have you plotted the data and compared the 2 groups that way? (I have heard this referred to as the interocular concussion test).
Would you be interested in differences in the variances? We often treat the variance as a nuisance parameter that we have to deal with, but in many cases (and this looks like one of them to me) a difference in variance between 2 groups would be of interest whether the means also differ or not.
Are you able/willing to explain what the Wilcoxin test is doing? If you want a non-parametric test with a simpler interpretation then you might want to consider a permutation test (the Wilcoxin is one particular case of a permutation test).
--
Gregory (Greg) L. Snow Ph.D.
Statistical Data Center
Intermountain Healthcare
greg...@imail.org
801.408.8111
Sent using BlackBerry® from Orange
I addition to Karl......i think you can still use the two sample t-test even if the variances are unequal but you then need to use Satterthwaite’s or Welch's approximation formula to approximate the degrees of freedom of the test. Note this is an approximate to the "true" df!!!
--
To post a new thread to MedStats, send email to MedS...@googlegroups.com .
MedStats' home page is http://groups.google.com/group/MedStats .
Rules: http://groups.google.com/group/MedStats/web/medstats-rules