Glossary

Glossary - Dictionary - Lexicon of Rasch Measurement Terminology

Glosario Español www.rasch.org/rmt/glosario.htm

Ability

the level of successful performance of the objects of measurement (persons) on the latent variable. Each person's location on the unidimensional variable measured in "additive Rasch units", usually logits.

Additive scale

Scale of measurement in which the units have the properties of simple addition, so that "one more unit = the same amount extra regardless of the amount you already have". Typical measuring devices such as tape measures and thermometers have additive scales. Rasch additive scales are usually delineated in logits.

Agent of Measurement

the tool (items, questions, etc.) used to define a latent variable, and to position objects of measurement (persons etc.) along that variable.

Analytic rating

a rating of a specific aspect of a performance (cf. Holistic rating)

Anchor

the process of using anchor values to insure that different analyses produce directly comparable results.

Anchor Value

a pre-set logit value assigned to a particular object, agent or step to be used as a reference value for determining the measurements or calibrations of other objects, agents or steps.

Anchor Table

the table of Anchor Values used during Rasch analysis of an Input Grid and so included in the Results Table produced. The Anchor Table has the same format as the Results Table.

Anchoring

the process of using anchor values to insure that different analyses produce directly comparable results.

Best Test Design

Wright, B.D. & Stone, M.H., Best Test Design: Rasch Measurement. Chicago: Mesa Press, 1979

Bias

A change in logit values based on the particular agents or objects measured.

BOTTOM

The value shown in the Results Table for an agent on which all objects were successful, (so it was of bottom difficulty), or for an object which had no success on any agent (so it was of bottom ability)

Bottom Category

the response category at which no level of successful performance has been manifested.

Calibration

a difficulty measure in logits used to position the agents of measurement (usually test items) along the latent variable.

CAT  Test

Computer-Adaptive Test. A test administered by computer in which the display of the next item depends on the response to the previous item.

Categories

CATS

qualitative levels of performance on an observational or response format, e.g., a rating scale.

Cell

Location of data in the spreadsheet, given by a column letter designation and row number designation e.g. B7

Classical Test Theory

Item analysis in which the raw scores are treated as additive numbers.

Common Scale

a scale of measurement on which all agents and objects can be represented.

Column

Vertical line of data in the Spreadsheet data, usually representing in an Input Grid all responses to a particular item, or in a Results Table, all statistics measuring the same attribute of agents or objects.

Comment

A semi-colon ; followed by text. This is ignored by Winsteps and Facets.

Complete data

Data in which every persons responds to every item. It makes a completely-filled rectangular data matrix. There are no missing data.

Computer-Adaptive Test

CAT Test. A test administered by computer in which the display of the next item depends on the response to the previous item.

Construct validity

The correlation between the item difficulties and the latent trait as intended by the test constructor. "Is the test measuring what it is intended to measure?"

Content

the subject area evoked and defined by an agent.

Continuation line

A separate line of text which Winsteps analyses as appended to the end of the previous line. These are shown with "+".

Contrast component

In the principal components analysis of residuals, a principal component (factor) which is interpreted by contrasting the items (or persons) with opposite loadings (correlations) on the component.

Control file

A DOS-text file on your disk drive containing the Winsteps control variables.

Control variable

In Winsteps, "control variable = value", is an instruction for controlling the computer program, e.g., "ITEM1 = 4".

Convergence

the point at which further improvement of the item and person estimates makes no useful difference in the results. Rasch calculation ends at this point.

Correlation

the relationship between two variables

CTT

Classical Test Theory

Data file

Winsteps: file containing the person labels and the responses to the items. It is part of the Control file if DATA= or MFORMS= are not used.

Demographics

Information about the person included the person label, e.g., "F" for female or "M" for male

Deterministic

Exactly predictable without any uncertainty. This contrasts with Probabilistic.

Dichotomous Response

a response format of two categories such as correct-incorrect, yes-no, agree-disagree.

DIF Differential item functioning

Change of item difficulty depending on which person classification-group is responding to the item, also called "item bias"

Difficulty

the level of resistance to successful performance of the agents of measurement on the latent variable. An item with high difficulty has a low marginal score. The Rasch item difficulty is the location on the unidimensional latent variable, measured in additive Rasch units, usually logits. Item difficulty measures are the locations on the latent variable (Rasch dimension) where the highest and lowest categories of the item are equally probable, regardless of the number of categories the item has.

Dimension

a latent variable which is influencing the data values.

Discrepancy

one or more unexpected responses.

Distractor

Incorrect answer to a multiple-choice question, which is intended to distract the examinee away from the correct option. Sometimes all the options, correct and incorrect, are called "distractors".

Disturbance

one or more unexpected responses.

Diverging

the estimated calibrations at the end of an iteration are further from convergence than at the end of the previous iteration.

Easiness

the level of susceptibility to successful performance of the agents of measurement on the latent variable. An item with high easiness has a high marginal score.

Eigenvalue

The value of a characteristic root of a matrix, the numerical "size" of the matrix

Element

Individual in a facet, e.g., a person, an item, a judge, a task, which participates in producing an observation.

Empirical

Based on observation or experiment

Empirical data

data derived from observation or experimentation

END LABELS

END NAMES

Winsteps: the end of the list of item identifying labels. This is usually followed by the data.

Entry number

Sequence number of the person or item in the dataset.

Person: Entry number 1 is the top row of the response-level data.

Item: Entry number 1 is the left-hand column of item-response data.

Equating

Putting the measures from two tests in the same frame of reference

Estimate

A value obtained from the data. It is intended to approximate the exactly true, but unknowable value.

EXP

Expected value

Value predicted for this situation based on the measures

Expected Response

the predicted response by an object to an agent, according to the Rasch model analysis.

EXP()

Exponential

Mathematical function used in estimating the Rasch measures

Exponential form

The Rasch model written in terms of exponentials, the form most convenient for computing response probabilities.

Extreme item

An item with an extreme score. Either everyone in the sample scored in the top category on the item, or everyone scored in the bottom category. An extreme measure is estimated for this item, and it fits the Rasch model perfectly, so it is omitted from fit reports.

Extreme person

A person with an extreme score. This person scored in the top category on the every item, or in the bottom category on every item. An extreme measure is estimated for this person, who fits the Rasch model perfectly, so is omitted from fit reports.

Facet

The components conceptualized to combine to produce the data, e.g., persons, items, judges, tasks.

Fit Statistic

a summary of the discrepancies between what is observed and what we expect to observe.

Focal group

The person classification-group which is the focus of a differential-item-functioning investigation

Frame of reference

The measurement system within which measures are directly comparable

Fundamental Measurement

1. Measurement which is not derived from other measurements.

2. Measurement which is produced by an additive (or equivalent) measurement operation.

Guttman

Louis Guttman (1916-1987) organized data into Scalograms intending that the observed response by any person to any items could be predicted deterministically from its position in the Scalogram.

Guttman pattern

Success on all the easier items. Failure on all the more difficulty items.

Heading

an identifier or title for use on tables, maps and plots.

Holistic rating

One rating which captures all aspects of the performance (cf. Analytic rating)

Hypothesis test

Fit statistics report on a hypothesis test. Usually the null hypothesis to be tested is something like "the data fit the model", "the means are the same", "these is no DIF". The null hypothesis is rejected if the results of the fit test are significant (p≤.05) or highly significant (p≤.01). The opposite of the null hypothesis is the alternate hypothesis.

Imputed data

Data generated by the analyst or assumed by the analytical process instead of being observed.

Independent

Not dependent on which particular agents and objects are included in the analysis. Rasch analysis is independent of agent or object population as long as the measures are used to compare objects or agents which are of a reasonably similar nature.

Infit

an information-weighted or inlier-sensitive fit statistic that focuses on the overall performance of an item or person, i.e., the information-weighted average of the squared standardized deviation of observed performance from expected performance. The statistic plotted and tabled by Rasch is this mean square normalized.

Interval scale

Scale of measurement on which equal intervals represent equal amounts of the variable being measured. Rasch analysis constructs interval scales with additive properties.

Item

agent of measurement (prompt, probe, "rating scale"), not necessarily a test question, e.g., a product rating. The items define the intended latent trait.

Item bank

Database of items including the item text, scoring key, difficulty measure and relevant statistics, used for test construction or CAT tests

Iteration

one run through the data by the Rasch calculation program, done to improve estimates by minimizing residuals.

Knox Cube Test

a tapping pattern test requiring the application of visual attention and short term memory.

Latent Trait

The idea of what we want to measure. A latent trait is defined by the items or agents of measurement used to elicit its manifestations or responses.

Link

Relating the measures derived from one test with those from another test, so that the measures can be directly compared.

LN()

Logarithm

Natural or Napierian logarithm. A logarithm to the base e, where e = 2.718... This contrasts with logarithms to the base 10.

Local origin

Zero point we have selected for measurement, such as sea-level for measuring mountains, or freezing-point for Celsius temperature. The zero point is chosen for convenience (similarly to a "setting-out point"). In Rasch measurement, it is often the average difficulty of the items.

Log-odds

The natural logarithm of the ratio of two probabilities (their odds).

Logit

"Log-odds unit": the unit of measure used by Rasch for calibrating items and measuring persons on the latent variable. A logarithmic transformation of the ratio of the probabilities of a correct and incorrect response, or of the probabilities of adjacent categories on a rating scale.

Logistic curve-fitting

an estimation method in which the improved value of an estimate is obtained by incrementing along a logistic ogive from its current value, based on the size of the current raw-score residual.

Logistic ogive

the relationship between additive measures and the probabilities of dichotomous outcomes.

Logit-linear

The Rasch model written in terms of log-odds, so that the measures are seen to form a linear, additive combination

Map

a bar chart showing the frequency and spread of agents and objects along the latent variable.

Matrix

a rectangle of responses with rows (or columns) defined by objects and columns (or rows) defined by agents.

MCQ

Multiple-Choice Question.

This is an item format often used in educational testing where the examinee selects the letter corresponding to the answer.

Mean-square

MnSq

Also called the relative chi-square and the normed chi-square. A mean-square fit statistic is a chi-square statistic divided by its degrees of freedom (d.f.). Its expectation is 1.0. Values below 1.0 indicate that the data are too predictable = overly predictable = overfit of the data to the model. Values above 1.0 indicate the data too unpredictable = underfit of the data to the model

Measure

Measurement

the location (usually in logits) on the latent variable. The Rasch measure for persons is the person ability. The Rasch measure for items is the item difficulty.

Menu bar

This is at the top of a program's window, and shows a list of standard program operations

Misfit

Any difference between the data the model predictions. Misfit usually refers to "underfit". The data are too unpredictable.

Missing data

Data which are not responses to the items. They can be items which the examinees did not answer (usually score as "wrong") or items which were not administered to the examinee (usually ignored in the analysis).

Model

Mathematical conceptualization of a relationship

Muted

Overfit to the Rasch model. The data are too predictable. The opposite is underfit, excessive noise.

Newton-Raphson iteration

A general method for finding the solution of non-linear equations

Noise

1. Randomness in the data predicted by the Rasch model.

2. Underfit: excessive unpredictability in the data, perhaps due to excessive randomness or multidimensionality.

Normal

a random distribution, graphically represented as a "bell" curve which has a mean value of 0 and a standard deviation of 1.

Normalized

1. the transformation of the actual statistics obtained so that they are theoretically part of a unit-normal distribution. "Normalized" means "transformed into a unit-normal distribution". We do this so we can interpret the values as "unit-normal deviates", the x-values of the normal distribution. Important ones are ±1.96, the points on the x-axis for which 5% of the distribution is outside the points, and 95% of the distribution is between the points.

2. linearly adjusting the values so they sum to a predetermined amount. For instance, probabilities always sum to 1.0.

Not administered

an item which the person does not see. For instance, all the items in an item bank which are not part of a computer-adaptive test.

Object of Measurement

person, product, site, to be measured or positioned along the latent variable.

OBS

Observed

Value derived from the data

Observation

Observed Response

the actual response by an object to an agent.

Odds ratio

Ratio of two probabilities, e.g., "odds against" is the ratio of the probability of losing (or not happening) to the probability of winning (or happening).

Outfit

an outlier-sensitive fit statistic that picks up rare events that have occurred in an unexpected way. It is the average of the squared standardized deviations of the observed performance from the expected performance. Rasch plots and tables use the normalized unweighted mean squares so that the graphs are symmetrically centered on zero.

Outliers

unexpected responses usually produced by agents and objects far from one another in location along the latent variable.

Overfit

The data are too predictable. There is not enough randomness in the data. This may be caused by dependency or other constraints.

Perfect score

Every response "correct" or the maximum possible score. Every observed response in the highest category.

Person

the object of measurement, not necessarily human, e.g., a product.

Plot

an x-y graph used by Rasch to show the fit statistics for agents and objects.

Point Labels

the placing on plots of the identifier for each point next to the point as it is displayed.

Point-measure correlation

PT-MEASURE, PTMEA

The correlation between the observations in the data and the measures of the items or persons producing them.

Poisson Counting

a method of scoring tests based on the number of occurrences or non-occurrences of an event, e.g. spelling mistakes in a piece of dictation.

Polarity

The direction of the responses on the latent variable. If higher responses correspond to more of the latent variable, then the polarity is positive. Otherwise the polarity is negative.

Polytomous response

responses in more than two ordered categories, such as Likert rating-scales.

Population

Every person (or every item) with the characteristics we are looking for. A sample of persons or items is usually assumed to be a random sample from the population.

Predictive validity

This is the amount of agreement between results obtained by the evaluated instrument and results obtained from more directly, e.g., the correlation between success level on a test of carpentry skill and success level making furniture for customers. "Do the person measures correspond to more and less of what we are looking for?"

Probabilistic

Predictable to some level of probability, not exactly. This contrasts with Deterministic.

Process

the psychological quality, i.e., the ability, skill, attitude, etc., being measured by an item.

PROX

the "Normal Approximation" estimation algorithm (Cohen, 1979). used to obtain initial estimates for the iterative estimation process.

Rack

Placing the responses to two tests in adjacent columns for each person, as though the items were being placed on a rack, c.f., stack.

Rasch, Georg

Danish Mathematician (1906-1980), who first propounded the application of the statistical approach used by Rasch.

Rasch measure

linear, additive value on an additive scale representing the latent variable

Rasch Model

a mathematical formula for the relationship between the probability of success (P) and the difference between an individual's ability (B) and an item's difficulty (D). P=exp(B-D)/(1+exp(B-D)) or log [P/(1-P)] = B - D

Rasch-Andrich Threshold

Step calibration. Location on the latent variable (relative to the center of the rating scale) where adjacent categories are equally probable.

Rating Scale

A format for observing responses wherein the categories increase in the level of the variable they define, and this increase is uniform for all agents of measurement.

Rating Scale Analysis

Wright, B.D. & Masters, G.N., Rating Scale Analysis: Rasch Measurement. Chicago: Mesa Press, 1982.

Raw score

the marginal score; the sum of the scored observations for a person, item or other element.

Reference group

The person classification-group which provides the baseline item difficulty in a differential-item-functioning investigation

Reliability

Reliability (reproducibility) = True Variance / Observed Variance (Spearman, 1904, etc.). It is the ratio of sample or test variance, corrected for estimation error, to the total variance observed.

Residuals

the difference between data observed and values expected.

Response

The value of an observation or data-point indicating the degree of success by an object (person) on an agent (item)

Response set

Choosing the same response on every item, such as always selecting option "C" on a multiple-choice test, or always selecting "Agree" on an attitude survey.

Results Table

a report of Rasch calculations.

Rigidity

when agents, objects and steps are all anchored, this is the logit inconsistency between the anchoring values, and is reported on the Iteration Screen and Results Table. 0 represents no inconsistency.

Row

a horizontal line of data on a Spreadsheet, usually used, in the Input Grid, to represent all responses by a particular object. The top row of each spreadsheet is reserved for Rasch control information.

Rule-of-thumb

A tentative suggestion that is not a requirement nor a scientific formula, but is based on experience and inference from similar situations. Originally, the use of the thumb as a unit of measurement.

Sample

the persons (or items) included in this analysis

Scale

the quantitative representation of a latent variable.

Scalogram

Picture of the data in which the persons (rows) and items (columns)  are arranged by marginal raw scores.

Score points

the numerical values assigned to responses when summed to produce a score for an agent or object.

Scoring key

The list of correct responses to multiple-choice (MCQ) items.

Scree plot

Plot showing the fraction of total variance in the data in each variance component.

Separation

the ratio of sample or test standard deviation, corrected for estimation error, to the average estimation error.

This is the number of statistically different levels of performance that can be distinguished in a normal distribution with the same "true" S.D. as the current sample. Separation = 2: high measures are statistically different from low measures.

Specification

A Winsteps control-variable and its value, e.g., "Name1=17"

Stack

Analyzing the responses of the same person to multiple administrations of the same test as though they were made by separate persons, by "stacking" the person records in one long data file, c.f., "rack"

Standard Deviation: P.SD, S.SD

The root mean square of the differences between the sample of values and their mean value. In Winsteps, all standard deviations are "population standard deviations" (the sample is the entire population) = P.SD. For the larger "sample standard deviation" (the sample is a random selection from the population) = S.SD, please multiply the Winsteps standard deviation by square-root (sample-size / (sample size - 1)).

Standard Error

An estimated quantity which, when added to and subtracted from a logit measure or calibration, gives the least distance required before a difference becomes meaningful.

Step calibration

Step difficulty

Rasch-Andrich threshold. Location on the latent variable (relative to the center of the rating scale) where adjacent categories are equally probable.

Steps

the transitions between adjacent categories as ordered by the definition of the latent variable.

Strata

= (4*Separation+1)/3 This is the number of statistically different levels of performance that can be distinguished in a normal distribution with the same "true" S.D. as the current sample, when the tales of the normal distribution are due to "true" measures, not measurement error. Strata=3: very high, middle, and very low measures can be statistically distinguished.

Sufficient statistic

A statistic (a number) which contains all the information in the data from which to estimate the value of a parameter.

Suffix

The letters added to a file name which specify the file format, e.g., ".txt" means "text file". If you do not see the suffix letters, instruct Windows to display them. See the Lesson 1 Appendix.

Table

Lists of words and numbers, arrange in columns, usually surrounded by "|".

Targeted

when the item difficulty is close to the person ability, so that he probability of success on a dichotomous item is near to 50%, or the expected rating is near to the center of the rating scale.

Targeting

Choosing items with difficulty equal to the person ability.

Task bar

This shows the Windows programs at the bottom of your computer screen

Template

a specially formatted input file.

Test length

The number of items in the test

Test reliability

The reliability (reproducibility) of the measure (or raw score) hierarchy of sample like this sample for this test. The reported reliability is an estimate of (true variance)/(observed variance), as also are Cronbach Alpha and KR-20.

TOP

The value shown in the Results Table for an agent on which no objects were successful, (so it was of top difficulty), or for an object which succeeded on every agent (so it was of top ability)

Top Category

the response category at which maximum performance is manifested.

UCON

the unconditional (or "joint" JMLE) maximum likelihood estimation formula, used by some Rasch programs for the second part of the iteration process.

Underfit

The data are too unpredictable. The data underfit the model. This may be because of excessive guessing, or contradictory dimensions in the data.

UNSURE

Rasch was unable to calibrate this data and treated it as missing.

Unweighted

the situation in which all residuals are given equal significance in fit analysis, regardless of the amount of the information contained in them.

Variable

a quantity or quality which can change its value

Weighted

the adjustment of a residual for fit analysis, according to the amount of information contained in it.

Zero score

Every response "incorrect" or the minimum possible score. Every observed response in the lowest category.

ZSTD

Probability of a mean-square statistic expressed as a z-statistic, i.e., a unit-normal deviate. For p≤.05 (double-sided), ZSTD>|1.96|.

&END

The end of the list of Winsteps control variables

&INST

The beginning of the list of Winsteps control variables. This is not necessary.


Help for Winsteps Rasch Measurement and Rasch Analysis Software: www.winsteps.com. Author: John Michael Linacre

Facets Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Minifac download
Winsteps Rasch measurement software. Buy for $149. & site licenses. Freeware student/evaluation Ministep download

Rasch Books and Publications
Invariant Measurement: Using Rasch Models in the Social, Behavioral, and Health Sciences, 2nd Edn, 2024 George Engelhard, Jr. & Jue Wang Applying the Rasch Model (Winsteps, Facets) 4th Ed., Bond, Yan, Heene Advances in Rasch Analyses in the Human Sciences (Winsteps, Facets) 1st Ed., Boone, Staver Advances in Applications of Rasch Measurement in Science Education, X. Liu & W. J. Boone Rasch Analysis in the Human Sciences (Winsteps) Boone, Staver, Yale
Introduction to Many-Facet Rasch Measurement (Facets), Thomas Eckes Statistical Analyses for Language Testers (Facets), Rita Green Invariant Measurement with Raters and Rating Scales: Rasch Models for Rater-Mediated Assessments (Facets), George Engelhard, Jr. & Stefanie Wind Aplicação do Modelo de Rasch (Português), de Bond, Trevor G., Fox, Christine M Appliquer le modèle de Rasch: Défis et pistes de solution (Winsteps) E. Dionne, S. Béland
Exploring Rating Scale Functioning for Survey Research (R, Facets), Stefanie Wind Rasch Measurement: Applications, Khine Winsteps Tutorials - free
Facets Tutorials - free
Many-Facet Rasch Measurement (Facets) - free, J.M. Linacre Fairness, Justice and Language Assessment (Winsteps, Facets), McNamara, Knoch, Fan
Other Rasch-Related Resources: Rasch Measurement YouTube Channel
Rasch Measurement Transactions & Rasch Measurement research papers - free An Introduction to the Rasch Model with Examples in R (eRm, etc.), Debelak, Strobl, Zeigenfuse Rasch Measurement Theory Analysis in R, Wind, Hua Applying the Rasch Model in Social Sciences Using R, Lamprianou El modelo métrico de Rasch: Fundamentación, implementación e interpretación de la medida en ciencias sociales (Spanish Edition), Manuel González-Montesinos M.
Rasch Models: Foundations, Recent Developments, and Applications, Fischer & Molenaar Probabilistic Models for Some Intelligence and Attainment Tests, Georg Rasch Rasch Models for Measurement, David Andrich Constructing Measures, Mark Wilson Best Test Design - free, Wright & Stone
Rating Scale Analysis - free, Wright & Masters
Virtual Standard Setting: Setting Cut Scores, Charalambos Kollias Diseño de Mejores Pruebas - free, Spanish Best Test Design A Course in Rasch Measurement Theory, Andrich, Marais Rasch Models in Health, Christensen, Kreiner, Mesba Multivariate and Mixture Distribution Rasch Models, von Davier, Carstensen
As an Amazon Associate I earn from qualifying purchases. This does not change what you pay.

facebook Forum: Rasch Measurement Forum to discuss any Rasch-related topic

To receive News Emails about Winsteps and Facets by subscribing to the Winsteps.com email list,
enter your email address here:

I want to Subscribe: & click below
I want to Unsubscribe: & click below

Please set your SPAM filter to accept emails from Winsteps.com
The Winsteps.com email list is only used to email information about Winsteps, Facets and associated Rasch Measurement activities. Your email address is not shared with third-parties. Every email sent from the list includes the option to unsubscribe.

Questions, Suggestions? Want to update Winsteps or Facets? Please email Mike Linacre, author of Winsteps mike@winsteps.com


State-of-the-art : single-user and site licenses : free student/evaluation versions : download immediately : instructional PDFs : user forum : assistance by email : bugs fixed fast : free update eligibility : backwards compatible : money back if not satisfied
 
Rasch, Winsteps, Facets online Tutorials


 

 
Coming Rasch-related Events: Winsteps and Facets
Oct 21 - 22 2024, Mon.-Tues. In person workshop: Facets and Winsteps in expert judgement test validity - UNAM (México) y Universidad Católica de Colombia. capardo@ucatolica.edu.co, benildegar@gmail.com
Oct. 4 - Nov. 8, 2024, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
Jan. 17 - Feb. 21, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
May 16 - June 20, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com
June 20 - July 18, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Further Topics (E. Smith, Facets), www.statistics.com
Oct. 3 - Nov. 7, 2025, Fri.-Fri. On-line workshop: Rasch Measurement - Core Topics (E. Smith, Winsteps), www.statistics.com

 

 

Our current URL is www.winsteps.com

Winsteps® is a registered trademark