JMLE is inconsistent!

Reviewers may reject Winsteps and its estimation method, JMLE, because they are "inconsistent", quoting perhaps

Christensen (2012) "“Estimation of the item parameters using the joint likelihood function leads to inconsistent item parameter estimates because the number of parameters increases with the number of persons”, and recommending CMLE or MMLE

 

Response: It is true that JMLE (UMLE, UCON) is statistically inconsistent for infinite data. For finite data  this is seen as "estimation bias". During many discussions/arguments in the 1970s and 1980s it was discovered that the JMLE estimation bias is inconsequential for most datasets and can easily be corrected where it is consequential . See, for instance, www.rasch.org/memo45.htm which appeared in Applied Psychological Measurement 12 (3) pp. 315-318, September 1988. and, more recently, "Overall, the differences between the results produced with the three estimation methods [CMLE, JMLE, MMLE] were negligible, and the discrepancies observed between datasets were attributable to the software choice as opposed to the estimation method." in Nicklin C., Vitta J.P. (2022) Assessing Rasch measurement estimation methods across R packages with yes/no vocabulary test data. Language Testing.

 

Of course, MMLE is estimation-biased if the person distribution mismatches the assumed person theta distribution (which it always does, empirical data never matches a theoretical distribution, - as the reviewer implicitly admits). CMLE, as usually implemented, is estimation-biased for the person measures (thetas) - see my note in Rasch Measurement Transactions - www.rasch.org/rmt/rmt331.pdf "CMLE – a Problem, its Solution and a Useful Approximation". Further depending on the nature of your data and analysis, CMLE and MMLE may be impossible to implement.

 

In my experience over 40 years JMLE estimation bias is only consequential for pairwise data, so Winsteps has a special command for this unique situation: "PAIREDdata=Yes". In other situations, I recommend against bias correction, STBIAScorrection=Yes in Winsteps, because of its side-effects. For instance, only for uncorrected JMLE can we complete this loop: raw person and item scores with original data -> item estimates and person estimates -> original person and item raw scores. This is important if we intend to predict person raw scores and item p-values for future data from the current estimates.

 


 

Every estimation method has advantages and disadvantages. A somewhat obscure disadvantage, and almost irrelevant for practical purposes, is the lack of "statistical consistency" of JMLE estimates. Here is the idea. Suppose we had an infinite amount of data, and used JMLE to estimate the parameter values from it. Would those parameter values be the "true" values of the parameters? The answer is no! The worst case is a test of only two dichotomous items. If the true difference between the difficulties of the two items is one logit. JMLE would report two logits difference! This indicates that JMLE is "statistical inconsistent". However, this two-to-one "estimation bias" is easy to correct. We simply divide all the estimates by two! In Winsteps, this is done automatically with the instruction "PAIRED=Yes". With longer tests, the correction is (number of items-1)/(number of items), corrected by STBIAS=YES. For longer tests, this correction quickly becomes meaninglessly small.

 

In fact, even for short tests, the correction for estimation bias is unnecessary unless we need to make very exact inferences about small logit differences. However, we discover that the correction is smaller than the standard errors of the estimates. We are in the situation that Quality-Control guru, W. E. Deming, pointed out is over-correction when applied to industrial machinery. We are trying to adjust the estimation process within its area of uncertainty. The result, Deming demonstrated, is that the outcome of the process becomes worse, not better!

 

Estimation bias, the practical consequence of statistical inconsistency, does not change the hierarchy of item difficulties and person abilities. So, if you are applying a user-friendly rescaling (USCALE=) to the logit values, the estimation bias is entirely inconsequential.

 

On the other hand, estimation methods that are statically consistent (with infinite data), such as CMLE, MMLE, PMLE, can also have estimation bias (with finite data) depending on the characteristics of those data. For instance, MMLE imposes a theoretical distribution of person abilities on the person parameters. This may or may not be a good match to the actual distribution of those parameters. PMLE uses the data in an uneven way during the estimation process resulting in estimation bias. CMLE, MMLE and PMLE are all asymmetric in the way they estimate the item and person parameters. This is fine if we think of a fixed set of items and a potentially infinite sample of persons, but in other applications of Rasch measurement, both the "items" and "persons" are fixed, or both the "items" and "persons" are potentially infinite. JMLE is symmetric in its estimates. Transpose the data matrix, and the absolute differences between the estimates for specific items and/or specific persons do not change. For those other estimation methods, those differences do change.


Help for Winsteps Rasch Measurement and Rasch Analysis Software: www.winsteps.com. Author: John Michael Linacre

Facets Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Minifac download
Winsteps Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Ministep download

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn, 2024 George Engelhard, Jr. & Jue Wang Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes Statistical Analyses for Language Testers (Facets), Rita Green Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind Rasch Measurement: Applications, Khine Winsteps Tutorials - free
Facets Tutorials - free
Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse Rasch Measurement Theory Analysis in R, Wind, Hua Applying the Rasch Model in Social Sciences Using R, Lamprianou El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Rasch Models for Measurement, David Andrich Constructing Measures, Mark Wilson Best Test Design - free, Wright & Stone
Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias Diseño de Mejores Pruebas - free, Spanish Best Test Design A Course in Rasch Measurement Theory, Andrich, Marais Rasch Models in Health, Christensen, Kreiner, Mesba Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen
As an Amazon Associate I earn from qualifying purchases. This does not change what you pay.

facebook Forum: Rasch Measurement Forum to discuss any Rasch-related topic

To receive News Emails about Winsteps and Facets by subscribing to the Winsteps.com email list,
enter your email address here:

I want to Subscribe: & click below
I want to Unsubscribe: & click below

Please set your SPAM filter to accept emails from Winsteps.com
The Winsteps.com email list is only used to email information about Winsteps, Facets and associated Rasch Measurement activities. Your email address is not shared with third-parties. Every email sent from the list includes the option to unsubscribe.

Questions, Suggestions? Want to update Winsteps or Facets? Please email Mike Linacre, author of Winsteps mike@winsteps.com


State-of-the-art : single-user and site licenses : free student/evaluation versions : download immediately : instructional PDFs : user forum : assistance by email : bugs fixed fast : free update eligibility : backwards compatible : money back if not satisfied
 
Rasch, Winsteps, Facets online Tutorials


 

 
Coming Rasch-related Events: Winsteps and Facets
Oct 21 - 22 2024, Mon.-Tues. In person workshop: Facets and Winsteps in expert judgement test validity - UNAM (México) y Universidad Católica de Colombia. capardo@ucatolica.edu.co, benildegar@gmail.com
Oct. 4 - Nov. 8, 2024, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Jan. 17 - Feb. 21, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
May 16 - June 20, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

 

 

Our current URL is www.winsteps.com

Winsteps® is a registered trademark