Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

Q how to maintain the same characteristic of validation unit

5 views
Skip to first unread message

Cosine

unread,
Apr 17, 2023, 6:50:36 AM4/17/23
to
Hi:

When collecting a new set of samples for validation is expensive, one alternative is to use the k-fold validation to mimic the situation in that we have k sets of samples for validation.

However, we have to make sure these k units have identical characteristics. For example, if we have the numbers of normal patients and diseased ones to be 2:3 in the original set of samples, we ought to maintain this ratio for each of the k units. Suppose the original set has two types of patients 2*n1 and 3*n2. We have to choose the k to be a common factor of n1 and n2 so that the two types of patients in each of the k units are 2*m1 and 3*m2, where n1 = k*m1 and n2 = k*m2.

What if both n1 and n2 are prime numbers? More generally, what if n1 and n2 do not have a common factor?



David Jones

unread,
Apr 17, 2023, 2:19:07 PM4/17/23
to
THe obvious solution here is to do the analyses in terms of quantities
whose interpretation is not sample-size dependent.
0 new messages