I keep reading that IRT (at least the one parameter model) is sample independent.I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.So I will not need to worry about representative samples to the same extent as I had to wit CTT.
My experience however is that whenever I create a mirt model from a sample (no matter how skewed it is) the sample ability mean will always be very close to 0.This suggests to me that if I oversampled clever people when creating the model I will still get a higher estimate for theta=0,and if I used that model I would still underestimate the ability of the population just like it used to be with CTT.I guess my bottomline question is: do I still need to have a representative sample if I used a simple Rasch or a 2PL model, or is an IRT model really sample independent?If it is sample independent, what does that mean and how could I show that?
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Straight out of wikipedia:"The one-parameter model (1PL) assumes that guessing is a part of the ability and that all items that fit the model have equivalent discriminations, so that items are only described by a single parameter ( {\displaystyle b_{i}} b_{i}). This results in one-parameter models having the property of specific objectivity, meaning that the rank of the item difficulty is the same for all respondents independent of ability, and that the rank of the person ability is the same for items independently of difficulty. "Of course, this is only true asymptotically. In practice you don't have the true intercepts parameters, so the rank-ordering of the individuals is only guaranteed to be true in larger samples (particularly when items with similar difficulty values exist, because they are likely to be exchanged due to sampling error).On Tue, Oct 18, 2016 at 9:40 AM, Balazs Klein <balazs...@gmail.com> wrote:I keep reading that IRT (at least the one parameter model) is sample independent.I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.So I will not need to worry about representative samples to the same extent as I had to wit CTT.Oh my no, that's never true. You ALWAYS need a representative sample for the population with which you are trying to make inferences to. Rasch/1PL are not magical, they are just models that need to be calibrated.
My experience however is that whenever I create a mirt model from a sample (no matter how skewed it is) the sample ability mean will always be very close to 0.This suggests to me that if I oversampled clever people when creating the model I will still get a higher estimate for theta=0,and if I used that model I would still underestimate the ability of the population just like it used to be with CTT.I guess my bottomline question is: do I still need to have a representative sample if I used a simple Rasch or a 2PL model, or is an IRT model really sample independent?If it is sample independent, what does that mean and how could I show that?You could only show it as a didactic example, because sample data will never show this property (how could it? You only ever have estimates of the parameters, not the parameters themselves).
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package...@googlegroups.com.
Phil Chalmers於 2016年10月19日星期三 UTC+8上午11時08分28秒寫道:Straight out of wikipedia:"The one-parameter model (1PL) assumes that guessing is a part of the ability and that all items that fit the model have equivalent discriminations, so that items are only described by a single parameter ( {\displaystyle b_{i}} b_{i}). This results in one-parameter models having the property of specific objectivity, meaning that the rank of the item difficulty is the same for all respondents independent of ability, and that the rank of the person ability is the same for items independently of difficulty. "Of course, this is only true asymptotically. In practice you don't have the true intercepts parameters, so the rank-ordering of the individuals is only guaranteed to be true in larger samples (particularly when items with similar difficulty values exist, because they are likely to be exchanged due to sampling error).On Tue, Oct 18, 2016 at 9:40 AM, Balazs Klein <balazs...@gmail.com> wrote:I keep reading that IRT (at least the one parameter model) is sample independent.I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.So I will not need to worry about representative samples to the same extent as I had to wit CTT.Oh my no, that's never true. You ALWAYS need a representative sample for the population with which you are trying to make inferences to. Rasch/1PL are not magical, they are just models that need to be calibrated.Using MML requires the assumption of person distribution, but not the case for conditional maximum likelihood method specifically for the Rasch model. See eRm package.
My experience however is that whenever I create a mirt model from a sample (no matter how skewed it is) the sample ability mean will always be very close to 0.This suggests to me that if I oversampled clever people when creating the model I will still get a higher estimate for theta=0,and if I used that model I would still underestimate the ability of the population just like it used to be with CTT.I guess my bottomline question is: do I still need to have a representative sample if I used a simple Rasch or a 2PL model, or is an IRT model really sample independent?If it is sample independent, what does that mean and how could I show that?You could only show it as a didactic example, because sample data will never show this property (how could it? You only ever have estimates of the parameters, not the parameters themselves).I would say "no model" can exactly be confirmed by limited sample data. You can, however, find out the "better" one among model candidates. Using item fit to check whether the Rasch model could approximate the data (generated by unknown model) good enough is a practical way. But criteria of "good enough" is somewhat subjective.
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package+unsubscribe@googlegroups.com.
On Wed, Oct 19, 2016 at 6:34 AM, Keith Lau <genw...@gmail.com> wrote:
Phil Chalmers於 2016年10月19日星期三 UTC+8上午11時08分28秒寫道:Straight out of wikipedia:"The one-parameter model (1PL) assumes that guessing is a part of the ability and that all items that fit the model have equivalent discriminations, so that items are only described by a single parameter ( {\displaystyle b_{i}} b_{i}). This results in one-parameter models having the property of specific objectivity, meaning that the rank of the item difficulty is the same for all respondents independent of ability, and that the rank of the person ability is the same for items independently of difficulty. "Of course, this is only true asymptotically. In practice you don't have the true intercepts parameters, so the rank-ordering of the individuals is only guaranteed to be true in larger samples (particularly when items with similar difficulty values exist, because they are likely to be exchanged due to sampling error).On Tue, Oct 18, 2016 at 9:40 AM, Balazs Klein <balazs...@gmail.com> wrote:I keep reading that IRT (at least the one parameter model) is sample independent.I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.So I will not need to worry about representative samples to the same extent as I had to wit CTT.Oh my no, that's never true. You ALWAYS need a representative sample for the population with which you are trying to make inferences to. Rasch/1PL are not magical, they are just models that need to be calibrated.Using MML requires the assumption of person distribution, but not the case for conditional maximum likelihood method specifically for the Rasch model. See eRm package.This is actually a common misinterpretation of CML, so I think I'll help clear it up here. CML is just a method to obtain ML estimates for item parameters through conditioning on a sufficient statistic supplied by the data. MML achieves the same goal, but specifies a latent distribution (i.e., a range of uncertainty) over which each individual could be located. However, when choosing the Gaussian distribution (and the model is just identified), maximal uncertainty about the location of \theta is defined, and MML parameter estimates become identical to CML.
Phil Chalmers於 2016年10月19日星期三 UTC+8下午10時58分40秒寫道:On Wed, Oct 19, 2016 at 6:34 AM, Keith Lau <genw...@gmail.com> wrote:
Phil Chalmers於 2016年10月19日星期三 UTC+8上午11時08分28秒寫道:Straight out of wikipedia:"The one-parameter model (1PL) assumes that guessing is a part of the ability and that all items that fit the model have equivalent discriminations, so that items are only described by a single parameter ( {\displaystyle b_{i}} b_{i}). This results in one-parameter models having the property of specific objectivity, meaning that the rank of the item difficulty is the same for all respondents independent of ability, and that the rank of the person ability is the same for items independently of difficulty. "Of course, this is only true asymptotically. In practice you don't have the true intercepts parameters, so the rank-ordering of the individuals is only guaranteed to be true in larger samples (particularly when items with similar difficulty values exist, because they are likely to be exchanged due to sampling error).On Tue, Oct 18, 2016 at 9:40 AM, Balazs Klein <balazs...@gmail.com> wrote:I keep reading that IRT (at least the one parameter model) is sample independent.I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.So I will not need to worry about representative samples to the same extent as I had to wit CTT.Oh my no, that's never true. You ALWAYS need a representative sample for the population with which you are trying to make inferences to. Rasch/1PL are not magical, they are just models that need to be calibrated.Using MML requires the assumption of person distribution, but not the case for conditional maximum likelihood method specifically for the Rasch model. See eRm package.This is actually a common misinterpretation of CML, so I think I'll help clear it up here. CML is just a method to obtain ML estimates for item parameters through conditioning on a sufficient statistic supplied by the data. MML achieves the same goal, but specifies a latent distribution (i.e., a range of uncertainty) over which each individual could be located. However, when choosing the Gaussian distribution (and the model is just identified), maximal uncertainty about the location of \theta is defined, and MML parameter estimates become identical to CML.
Thanks for the illustrations of simulations. I agree CML is a sort of ML estimation, but the algorithm of CML indeed does not require specifying prior distribution of persons, which does not mean the parameter estimators would be necessarily different from those of MML. In your simulation, I feel it is demonstrating the robustness of using a normal distribution but violated in reality. Somewhat similar to the Bayesian inference that we need to specify a prior distribution, although now we are talking about in the frequentist inference. We cannot say much about whether the prior distribution is correct, but sensitivity analysis could help here, as you demonstrated.
But I am still curious why so some studies claim the effect of nonnormal distribution on the parameter estimation. Do they contradict your simulation results? See
http://epm.sagepub.com/content/76/4/662
https://www.ncbi.nlm.nih.gov/pubmed/16953704
http://epm.sagepub.com/content/74/2/343.abstract
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package+unsubscribe@googlegroups.com.
Imposing and estimating prior distribution are different things.
Nonparametric IRT like Rasmay method (as I know) uses sum scores to estimate person distribution. It does not mean CML needs a prior distribution assumption. I will thank if you can indicate which steps of CML algorithm require that assumption where I might misunderstand.
And sorry for this long off the track, Phil.
robi...@ipn.uni-kiel.de於 2016年10月23日星期日 UTC+8下午4時07分26秒寫道:
> Just to put it in a different way. It is often shown in the literature that conditional maximum likelihood estimation is equivalent to a marginal maximum likelihood estimation with a nonparametrically estmated distribution (e.g. Formann or Verhelst). So, the statement that CML do not need distributional assumption contrary to MML is not correct. Furthermore, the true (nonparametric) theta distribution can be only identified asymptotically (number of items tending to infinity) for both (CML and MML) estimation approaches.
>
> Am Dienstag, 18. Oktober 2016 15:40:17 UTC+2 schrieb Balazs Klein:
> I keep reading that IRT (at least the one parameter model) is sample independent.
> I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.
> So I will not need to worry about representative samples to the same extent as I had to wit CTT.
>
>
> My experience however is that whenever I create a mirt model from a sample (no matter how skewed it is) the sample ability mean will always be very close to 0.
>
> This suggests to me that if I oversampled clever people when creating the model I will still get a higher estimate for theta=0,
> and if I used that model I would still underestimate the ability of the population just like it used to be with CTT.
>
>
> I guess my bottomline question is: do I still need to have a representative sample if I used a simple Rasch or a 2PL model, or is an IRT model really sample independent?
> If it is sample independent, what does that mean and how could I show that?
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package+unsubscribe@googlegroups.com.
On Sun, Oct 23, 2016 at 10:53 AM, Keith Lau <genw...@gmail.com> wrote:Imposing and estimating prior distribution are different things.Right. What Alexander is referring to is a kind of empirical Bayes approach to the problem. Either way, from my perspective such an issue is largely trivial. I was merely showing the MML and CML are basically identical in all practical respects, though I tend to favour MML for other more obvious reasons (the assumption of latent distributions never bothered me so much as it's not really of consequence with enough data....priors tend to disappear asymptotically, like they tend to in SEM and other latent variable models).
Nonparametric IRT like Rasmay method (as I know) uses sum scores to estimate person distribution. It does not mean CML needs a prior distribution assumption. I will thank if you can indicate which steps of CML algorithm require that assumption where I might misunderstand.I don't think Ramsay ever claimed the leave-one-out total score to be sufficient in any absolute sense, more that it's a reasonable and efficient mechanism to condition on to obtain kernal smoothed curves for exploratory/functional analyses. But yes I know what you are saying.Although the total score is a sufficient statistic (in the sense of a mathematical fact) for CML estimation it is still only a stand-in to the true ability and requires the assumption that it is actually sufficient (as a philosophical/validity fact). E.g, a test with only 3 dichotomous items will have 4 total possible scores, but this hardly means that the unobserved abilities are literally sufficient to captured by the empirical information about the distribution of the abilities (surely there are more than 4 possible abilities in the population!). So, statistical sufficiency becomes an assumption about estimability of intercepts assuming the total score is "good enough", but not about validity sufficiency for representation of the underlying abilities (which requires nitems to tend to infty). Hence, why I call it just a trick, which requires an assumption much the same as MML. In the end, something has to be assumed for these models to work: either the assumption that the (potentially weighted) total score is sufficient to capture the ability parameters, or we assume something about the underlying distribution directly and attempt to average over this distribution.
And sorry for this long off the track, Phil.Not at all, it's why this forum is here. Cheers.Phil
robi...@ipn.uni-kiel.de於 2016年10月23日星期日 UTC+8下午4時07分26秒寫道:
> Just to put it in a different way. It is often shown in the literature that conditional maximum likelihood estimation is equivalent to a marginal maximum likelihood estimation with a nonparametrically estmated distribution (e.g. Formann or Verhelst). So, the statement that CML do not need distributional assumption contrary to MML is not correct. Furthermore, the true (nonparametric) theta distribution can be only identified asymptotically (number of items tending to infinity) for both (CML and MML) estimation approaches.
>
> Am Dienstag, 18. Oktober 2016 15:40:17 UTC+2 schrieb Balazs Klein:
> I keep reading that IRT (at least the one parameter model) is sample independent.
> I thought it means that even if I had a non-representative, biased sample the model would reflect the properties of the whole population.
> So I will not need to worry about representative samples to the same extent as I had to wit CTT.
>
>
> My experience however is that whenever I create a mirt model from a sample (no matter how skewed it is) the sample ability mean will always be very close to 0.
>
> This suggests to me that if I oversampled clever people when creating the model I will still get a higher estimate for theta=0,
> and if I used that model I would still underestimate the ability of the population just like it used to be with CTT.
>
>
> I guess my bottomline question is: do I still need to have a representative sample if I used a simple Rasch or a 2PL model, or is an IRT model really sample independent?
> If it is sample independent, what does that mean and how could I show that?
--
You received this message because you are subscribed to the Google Groups "mirt-package" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mirt-package...@googlegroups.com.