I have done a sample size calculation for a continuous outcome between 2 groups (based on t-test). However the outcome will be heavily skewed so I would like to inflate the sample size to account for this. 2 questions: (1) do you think this is necessary? (2) have you any suggestions as to the inflation factor and a reference to back it up? I have currently used 15% as I've seen this used in a protocol but unable to find any firmer foundation. Thanks for your help.
I have done a sample size calculation for a continuous outcome between 2 groups (based on t-test). However the outcome will be heavily skewed so I would like to inflate the sample size to account for this. 2 questions: (1) do you think this is necessary? (2) have you any suggestions as to the inflation factor and a reference to back it up? I have currently used 15% as I've seen this used in a protocol but unable to find any firmer foundation. Thanks for your help.
--
--
To post a new thread to MedStats, send email to MedS...@googlegroups.com .
MedStats' home page is http://groups.google.com/group/MedStats .
Rules: http://groups.google.com/group/MedStats/web/medstats-rules
---
You received this message because you are subscribed to the Google Groups "MedStats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to medstats+u...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
This is a straightforward paper in the Journal: Optometry today, 2010 (July)
Prajapati, Bhavna; Dunne, Mark C.M. and Armstrong, Richard A. “Sample size estimation and statistical power analyses”
They use the context of the free software GPower3. On p4, Table 2 shows the asymptotic relative efficiency (ARE) of parametric/non-parametric tests.
There is a 95.5% efficiency between many of the common non-parametric tests, as compared to the parametric tests.
“The sample size required for a non-parametric test is
determined by multiplying the sample size calculated for an equivalent
parametric test by a correction factor. This correction factor is referred
to as the asymptotic relative efficiency (ARE) and as first described by
Pitman[13]”. Of course, AER is not the same as power. So the paper recommends an inflation factor of 1/0.955 of the t-test sample size for a study using the non-parametric equivalent (i.e. Mann-Whitney U).
If you were going to use a non-parametric test instead, then the sample size would need inflating & here is a reference I have seen used. .... They use the context of the free software GPower3. On p4, Table 2 shows the asymptotic relative efficiency (ARE) of parametric/non-parametric tests. ... There is a 95.5% efficiency between many of the common non-parametric tests, as compared to the parametric tests. .... "The sample size required for a non-parametric test is determined by multiplying the sample size calculated for an equivalent parametric test by a correction factor. This correction factor is referred to as the asymptotic relative efficiency (ARE) and as first described by Pitman[13]. Of course, AER is not the same as power. So the paper recommends an inflation factor of 1/0.955 of the t-test sample size for a study using the non-parametric equivalent (i.e. Mann-Whitney U).
The Wilcoxon-Mann-Whitney (WMW) test is often used to compare the means or medians of two independent, possibly nonnormal distributions. For this problem, the true significance level of the large sample approximate version of the WMW test is known to be sensitive to differences in the shapes of the distributions. Based on a wide ranging simulation study, our paper shows that the problem of lack of robustness of this test is more serious than is thought to be the case. In particular, small differences in variances and moderate degrees of skewness can produce large deviations from the nominal type I error rate. This is further exacerbated when the two distributions have different degrees of skewness. Other rank-based methods like the Fligner-Policello (FP) test and the Brunner-Munzel (BM) test perform similarly, although the BM test is generally better. By considering the WMW test as a two-sample T test on ranks, we explain the results by noting some undesirable properties of the rank transformation. In practice, the ranked samples should be examined and found to sufficiently satisfy reasonable symmetry and variance homogeneity before the test results are interpreted.
Copyright (c) 2009 John Wiley & Sons, Ltd.
Thank you all VERY much for all your advice which was extremely useful. I appreciate you taking the time to answer.
Thank you all VERY much for all your advice which was extremely useful. I appreciate you taking the time to answer.
--