On Fri, 24 May 2013 13:55:05 -0700 (PDT), Ray Koopman <
koo...@sfu.ca>
wrote:
oops! I see that I wasn't being very clear-sighted here.
There is no problem in telling apart these uniform and beta
data, based on raw data points, with even a few dozen points.
And, if the problem is one of restricting the test to a test
on kurtosis -- using the raw points -- it still would take an
N of only a few hundred for very high confidence.
>
>I should have been more explicit. On each of 100 trials, I compared
>the skewness of 20,000 means of 50 Betas to the skewness of 20,000
>means of 50 Uniforms. On 98 of those trials, the Beta skewness was
>algebraically less than the Uniform skewness. Later I repeated the
>experiment (with a different random number generator) using 1000
>trials. On 997 of those trials, Beta skew < Uniform skew.
Okay. This is an illustration of the power of testing using
a "50% test": Which is larger? Also, it is confirmation of the
validity of the slight approximation that achieved a point
estimate of the expected value of 2.7 for the overall test.
The p-value of z=2.7 is 0.0069, which suggests the 95% CI
results of (0 to 3) missed tests out of 100, and about (2 to 13)
missed out of 1000.
The result of 3-in-1000, inverting, implies a point estimate for z
of 2.97; the result of 2-in-100 similarly suggests z= 2.05.
>
>>
>> Given "98 out of 100," I imagine that every one of the samples
>> based on beta had skew in the same direction, and that you have
>> at least moderate power for testing "non-uniform" when testing
>> any one of them.
>
>I took the OP's question to be: using using only two sets of 20,000
>means of 50, one set from each of the two distributions, could we say
>which set used which distribution? My conclusion is that we could be
>reasonably confident that the set with the algebraically lower skew
>used Beta.
Thanks for spelling out in detail.
--
Rich Ulrich