Table 44.1 Global statistics

 

Global Statistics:

Active KID: 35, non-extreme: 34

Active TAP: 18, non-extreme: 14

Active datapoints: 630

Missing datapoints: none

Non-extreme datapoints: 476, ln(476) = 6.1654

Standardized residuals N(0,1): mean: -.009 P.SD: .827 count: 476.00

 

Fit Indicators:

Equal-item-discriminations test of Parallel ICCs/IRFs: 7.4177 with 13 d.f., probability = .9174

van den Wollenberg Q1 test of Parallel ICCs/IRFs: 64.5188 with 117 d.f., probability = 1.0000

Estimated Free Parameters = Lesser of (Non-extreme KID or Non-extreme TAP) - 1 + Sum(Thresholds - 1) = 17

Analysis Degrees of freedom (d.f.) by counting = 476 - 17 = 459

Log-Likelihood chi-squared: 221.6118 with 459 d.f., probability = 1.0000

Pearson Global chi-squared: 325.2167 with 459 d.f., probability = 1.0000

Akaike Information Criterion, AIC = (2 * parameters) + chi-squared = 255.6118

Schwarz Bayesian Information Criterion, BIC = (parameters * ln(non-extreme datapoints)) + chi-squared = 326.4239

Global Root-Mean-Square Residual: .2318 with expected value: .2344 count: 630.00

Capped Binomial Deviance: .0802 with expected value: .0858 count: 630.00 dichotomies

 

Global statistics:

Statistics based on the currently-selected data, omitting permanently and temporarily deleted, deselected and dropped items and persons

Active person, weighted, non-extreme

Persons currently active in this analysis. Weighted count only if active persons are weighted. Non-extreme persons do not have extreme scores (0%, 100% scores)

Active item, weighted, non-extreme

Items currently active in this analysis. Weighted count only if active items are weighted.

Active datapoints, weighted, non-extreme

Observations/responses currently active in this analysis (includes extreme scores).  Weighted count only if active items or persons are weighted. Non-extreme items do not have extreme scores (0%, 100% scores)

Missing datapoints, % of Active+Missing datapoints

Observations/responses not active in this analysis, usually because they do not have scored values. If any potentially active data are missing, then percent that are active

Non-extreme datapoints, weighted, non-extreme

Observations/responses currently active in this analysis (excludes extreme scores). Weighted count if any active persons or items are weighted.

Standardized residuals N(0,1):

mean: P.SD: count:

Standardized Residuals are modeled to have a unit normal distribution. Large departures from mean of 0.0 and standard deviation of 1.0 indicate that the data do not conform to the basic Rasch model specification that randomness in the data be normally distributed do that standardized residuals to be close to mean 0.0, P.SD 1.0. The count is of relevant observations.

Fit Indicators:


Equal-item-discriminations test of Parallel ICCs/IRFs: .... with .... d.f., probability = ....

Parallel ICCs are tested using a fixed-effects chi-squared test of the item discriminations, DISCRIM=Yes in Table 14, etc. - The degrees of freedom are (number of active items  - 1).

van den Wollenberg Q1 test of Parallel ICCs/IRFs: .... with .... d.f., probability = ....

The Q1 test (1982, 1995 - below) for parallel ICCs is modified to include polytomous ICCs and missing data by stratifying the person sample by ability (theta) instead of raw score.

Estimated Free Parameters = Lesser of (Non-extreme persons or Non-extreme items) - 1 + Sum(Thresholds - 1)

Count of free parameters. Anchoring is ignored. Item and Person weights are included

Analysis Degrees of freedom (d.f.) by counting

Weighted count of datapoints in non-extreme response strings less the count of free parameters.

Log-Likelihood chi-squared: .... with ... d.f., probability = ..

The chi-square value asymptotically = -2 * log-likelihood of the active datapoints. It is based on the currently-reported estimates which may depart noticeably from the "true" maximum likelihood estimates for these datapoints. The probability that these data fit the Rasch model globally. Despite good global fit, there can be considerable local misfit in Tables 6 and 10.

Pearson Global chi-squared: ... with ... d.f., probability = ....

This chi-squared is calculated using the squared standardized residuals, which approximate unit-normal deviates.

Cressie, N., & Read, T. R. (1989). Pearson's X2 and the log-likelihood ratio statistic G2: a comparative review. International Statistical Review, 57, 19-43.

Akaike Information Criterion, AIC

AIC = 2*k - 2*ln(L) where k = parameters, and -2*ln(L) is the log-likelihood chi-square.

Used for model comparison: higher values indicate worse adjusted fit

Schwarz Bayesian Information Criterion, BIC

BIC = ln(n)*k - 2*ln(L) where n is the number of Non-extreme datapoints. Used for model comparison: higher values indicate worse adjusted fit. ln() = natural logarithm of count of datapoints

Global (Weighted) Root-Mean-Square Residual (RMSR) with expected value:

This is √(∑(X-E)²) where the sum is across X, each of the observations, and E, the expectation of each observation according to the Rasch model. Weighting is applied to the data if IWEIGHT= or PWEIGHT= are specified. The expected value of the RMSR according to the Rasch model. RMSR values smaller than the expected value indicate better fit (or overfit) to the Rasch model.

Capped (Weighted) Binomial Deviance (CBD) = ... with expected value ... for ... dichotomies

This is the average of -[X*LOG10(E) + (1-X)*LOG10(1-E)] for all dichotomous observations where X=0,1 is the observation and E is its Rasch-model expectation. E is limited to the range 0.01 to 0.99. Weighting is applied to the data if IWEIGHT= or PWEIGHT= are specified.

Glickman, Mark E. "Parameter estimation in large dynamic paired comparison experiments." Journal of the Royal Statistical Society: Series C (Applied Statistics) 48.3 (1999): 377-394.

The expected value of the CBD according to the Rasch model. CBD values smaller than the expected value indicate better fit (or overfit) to the Rasch model.

 

Example: Rating Scale Model (RSM) and Partial Credit Model (PCM) of the same dataset. When the models are nested (as they are with RSM and PCM), then we have:

RSM chi-square and RSM d.f.

PCM chi-square (which should be smaller) and PCM d.f. (which will be smaller)

Then the model choice could be based on: (RSM LL chi-square - PCM LL chi-square)  with (RSM-PCM) d.f., However, global fit statistics obtained by analyzing your data with conditional Rasch estimation, CMLE, or log-linear models (e.g., in SPSS) will be more exact than those produced by Winsteps. For the more information about the choice between RSM and PCM, see www.rasch.org/rmt/rmt143k.htm.

 

if global fit statistics are the decisive evidence for choice of analytical model, then Rasch analysis may not be appropriate. In the statistical philosophy underlying Rasch measurement, the decisive evidence for choice of model is "which set of measures is more meaningful and useful" (a practical decision), not "which set of measures fit the model better" (a statistical decision).

 

However, local fit and misfit are more important that global fit for decision-making: "Evaluating Restrictive Models in Educational and Behavioral Research: Local Misfit Overrides Model Tenability" (2020) Tenko Raykov, Christine DiStefano. Educational and Psychological Measurement.

 

Glas C.A.W., Verhelst N.D. (1995) Testing the Rasch Model. In: Fischer G.H., Molenaar I.W. (eds) Rasch Models. Springer, New York, NY.

van den Wollenberg, A. L. (1982). Two new test statistics for the Rasch model. Psychometrika 47: 123–140.

 


 

Question: Is it possible in Winsteps to obtain the p-value corresponding to the chi-square test on the difference in model fit between the Rating Scale Model and Partial Credit Model?

 

Answer: Yes, do the two analyses. Output Table 44. Compare the two log-likelihood chi-squares. However, the big challenge is the d.f. of the difference between the chi-squares. You can estimate these as "number of PCM Andrich thresholds - number of RSM Andrich thresholds" which is probably different from the difference in the "d.f. by simulation" of the two analyses.


Help for Winsteps Rasch Measurement and Rasch Analysis Software: www.winsteps.com. Author: John Michael Linacre

Facets Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Minifac download
Winsteps Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Ministep download

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn, 2024 George Engelhard, Jr. & Jue Wang Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes Statistical Analyses for Language Testers (Facets), Rita Green Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind Rasch Measurement: Applications, Khine Winsteps Tutorials - free
Facets Tutorials - free
Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse Rasch Measurement Theory Analysis in R, Wind, Hua Applying the Rasch Model in Social Sciences Using R, Lamprianou El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Rasch Models for Measurement, David Andrich Constructing Measures, Mark Wilson Best Test Design - free, Wright & Stone
Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias Diseño de Mejores Pruebas - free, Spanish Best Test Design A Course in Rasch Measurement Theory, Andrich, Marais Rasch Models in Health, Christensen, Kreiner, Mesba Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen
As an Amazon Associate I earn from qualifying purchases. This does not change what you pay.

facebook Forum: Rasch Measurement Forum to discuss any Rasch-related topic

To receive News Emails about Winsteps and Facets by subscribing to the Winsteps.com email list,
enter your email address here:

I want to Subscribe: & click below
I want to Unsubscribe: & click below

Please set your SPAM filter to accept emails from Winsteps.com
The Winsteps.com email list is only used to email information about Winsteps, Facets and associated Rasch Measurement activities. Your email address is not shared with third-parties. Every email sent from the list includes the option to unsubscribe.

Questions, Suggestions? Want to update Winsteps or Facets? Please email Mike Linacre, author of Winsteps mike@winsteps.com


State-of-the-art : single-user and site licenses : free student/evaluation versions : download immediately : instructional PDFs : user forum : assistance by email : bugs fixed fast : free update eligibility : backwards compatible : money back if not satisfied
 
Rasch, Winsteps, Facets online Tutorials


 

 
Coming Rasch-related Events
Jan. 17 - Feb. 21, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Feb. - June, 2025 On-line course: Introduction to Classical Test and Rasch Measurement Theories (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Feb. - June, 2025 On-line course: Advanced Course in Rasch Measurement Theory (D. Andrich, I. Marais, RUMM2030), University of Western Australia
Apr. 21 - 22, 2025, Mon.-Tue. International Objective Measurement Workshop (IOMW) - Boulder, CO, www.iomw.net
May 16 - June 20, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

 

 

Our current URL is www.winsteps.com

Winsteps® is a registered trademark