Spearman's rank correlation is not so good for ties as Kendall's (so
Kendall said and he did study this subject).
Collapsing the interval scale is a waste of time. Never throw away
information.
Ordinal regression is very complicated for such a simple question.
Kruskal Wallis ignores the ordering of the categories and so throws away
information.
Martin
Liz wrote:
--
***************************************************
J. Martin Bland
Prof. of Health Statistics
Dept. of Health Sciences
Seebohm Rowntree Building Area 2
University of York
Heslington
York YO10 5DD
Email: mb...@york.ac.uk
Phone: 01904 321334
Fax: 01904 321382
Web site: http://www-users.york.ac.uk/~mb55/
***************************************************
I'd just offer one more alternative--the polyserial correlation
coefficient.
Advantage
Adjusts for censoring/discontinuity/ties in the ordered-category
variable
Potential Limitation
Assumes that the latent (unobserved), prediscretized version of the
ordered-category variable has a normal or near-normal distribution.
(Actually the assumption is that this latent variable, and your second,
interval-level measure are jointly distributed as bivariate normal).
The function, polyserial, available for R, will calculate the
polyserial correlation coefficient.
HTH
--
John Uebersax, PhD
I have a similar problem.
I am dealing with polytomous response data from a treatment adherence study
in cluster randomized clinical trial setup.
Some responses are coded as 1, 2, 3, 4, and 5.
there are some responses, where the patients give their socres on their
level of agreement on a certain question in a scale like
0 - 10- 20 - 30 - 40 ..... upto 100
every values are spaced by 10 units and the patients are asked to circle any
of these 11 values in the scale.
The clinicians are interested in the correlation coefficients between these
two types of response.
What would be the best approach - please suggest.
Regards.
Sanjoy
Dr. Sanjoy K. Paul
Senior Medical Statistician
University of Oxford
Diabetes Trial Unit
OCDEM, Churchill Hospital
Old Road
Headington
Oxford, OX3 7LJ
Tel: +44 (0)1865 857283
Fax: +44 (0)1865 857260
Email: sanjo...@dtu.ox.ac.uk
sambh...@hotmail.com
I'm going back to the original posting, since I think that subsequent
replies may have not quite addressed the real question.
The ordinal scale 0-3 (call it X) is clearly ranked in 0,1,2,3 order.
The very fact of seeking a "correlation" between this and the
"interval level measurement" (call it Y) -- i.e. ordinary numbers,
as I understand it -- implies that Liz is contemplating a monotonic
relationship (increasing or decreasing) between Y and X -- as X
moves upwards through 0,1,2,3 so the level of Y should move
upwards (or downwards), to within random scatter.
I say this straight out, since if the relationship is not
monotonic, then the notion of correlation makes less sense,
or even none.
So we should be looking for a measure of monotonic relationship
between the values of Y and the ordered categories of X. (I can
see no indication, and will make no asusmption about, whether
the values 0,1,2,3 of the ordinal scale have a quantitative or
"interval" meaning, or are merely ordered labels).
Now recall an earlier discussion about the "Williams test",
namely the ANOVA test devised by D.A. Williams to test a
null hypothesis of no difference between ordered groups
(labelled by X here), versus an alternative hypothesis that
the means differ, such that the power of the test is focussed
on differences which are monotonic with respect to X.
This thread on MedStats began with Jim Groenveld's posting
on 2 March 2006, and ended on 6 March. I quote in particular
from my reply on 2 March 19:15 --
"... the Williams test I'm aware of is designed to be applied
to the case where treatment groups receiving different dosages
are compared with a control or placebo group, AND it is expected
that the response will be monotonic in the dose (i.e. the higher
the dose, the greater the expected response.
"The basis of the test is to construct a series of contrasts
combining the group means in a specific order, and to apply
the test to the largest of these. The purpose of the test is
to target the power on alternatives which are of the 'monotonic'
type with respect to the group identities (identified according
to increasing dose).
"In contrast, the "usual" AOV methods treat all groups on the
same footing, without regard to order. Hence the Williams test
would be superior for monotonic alternatives, but likely to
be less powerful when the groups are "permutable" (unless by
chance the order of increasing effect in the groups was the
same as the order of the groups in the Williams test).
"The original Williams papers are in
Williams DA (1971). A test for differences between treatment
means when several dose levels are compared with a zero
dose control. Biometrics 27, 103-117.
Williams DA (1972). The comparison of several dose levels
with a zero dose control. Biometrics 28, 519-531.
"An outline of what goes on can be found at
http://www.stat.fi/isi99/proceedings/arkisto/varasto/brow0281.pdf
So I suggest that a similar approach would be suitable for
Liz's query, using the magnitude of the test statistic as
a measure of correlation. NOTE that the test depends essentially
on a regression of Y against *categorical* X, using contrasts
which reflect the targeted monotonicity.
If it would be of interest, I can try to work out details of
this and post them to the list.
As a final comment: Martin Bland suggested using a rank correlation
approach. This -- contrary to his stern (and comendable) injunction
"Never throw away information" -- throws away the information in the
values of Y (over and above the information in their ranks). It may
be that the distribution of the Y-values -- overall or within the
categories of X -- is so intangible that one is driven to resort
to a distribution-free method such as a rank-correlation, but that
is a matter of fact which depends on the context of the query and
can only be addressed in terms of the context and of the data on Y.
Best wishes to all,
Ted.
--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.H...@nessie.mcc.ac.uk>
Fax-to-email: +44 (0)870 094 0861
Date: 04-Aug-06 Time: 13:38:46
------------------------------ XFMail ------------------------------
Does the Williams test have advantages over using an ordinary
regression approach, keeping the ordinal levels as a 0-3 score?
Polynomial alternatives to the linear score could also be constructed
and compared to a factored (3 indicator variables) treatment which
would be the order "thrown away" comparison model.
An additional reminder that the Jonckheere-Terpstra test would be a
rank (permutation) alternative if the assumption of equal variances of
residuals across levels of the score variable were violated.
--
David Winsemius
It has the property that it need only depend on the ordering of the
categories (despite the fact that it was originally applied in
a context where the levels were ordered by a numerical variable,
namely the dose level).
> An additional reminder that the Jonckheere-Terpstra test would be a
> rank (permutation) alternative if the assumption of equal variances
> of residuals across levels of the score variable were violated.
Yes, that's also a possibility! (With its own properties, of course).
Best wishes,
Ted.
--------------------------------------------------------------------
E-Mail: (Ted Harding) <Ted.H...@nessie.mcc.ac.uk>
Fax-to-email: +44 (0)870 094 0861
Date: 04-Aug-06 Time: 15:28:33
------------------------------ XFMail ------------------------------
Finally: there is nothing wrong with using an ordinal logistic
regression model in this case if you know what you are doing, and
understand the underlying assumptions behind the model and can to
interpret the parameter values that are reported. In this respect the
linear models have an advantage: they are easier to understand and to
interpret.
Valter Sundh
Dept. of Community Medicine and Public Health
Sahlgrenska Academy,
Göteborg, Sweden