Score-Based Tests with Fixed Effects Person Parameters in Item Response Theory: Detecting Model Misspecification Including Differential Item Functioning

Rudolf Debelak
Charles C Driver

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We present a fast, score-based test to detecting model misspecification in item response theory (IRT) models, that remains valid when person parameters are treated as fixed effects, as may be used for very large data sets. The new approximation (i) eliminates the need to pre-specify ability groups or priors for person abilities, (ii) does not require explicit functional form assumptions, (iii) works with two estimators designed for very high item / person counts -- constrained joint maximum likelihood (CJML) and joint maximum a posteriori (JMAP) -- and (iv) requires only a single model fit, making DIF-screening faster and simpler than alternatives based on model comparisons. A spline-based residualisation step further suppresses spurious Type I error when the ordering covariate is correlated with ability. Simulations with the two-parameter logistic model show nominal error rates and high power once examinees contribute 15–20 responses; only extremely short tests (10 items) still pose challenges under strong impact. An application to 1602 reading items and 57684 students from the Mindsteps platform demonstrates scalability and practical value, flagging 13% of items for gender-related DIF and correlating highly with conventional approaches of explicitly modelling DIF. Together, these results position the proposed test as a robust, computation-light diagnostic for large-scale assessments when classical random-effects approaches are infeasible, ability group structure is unknown or complex, or the shape of DIF effects is unknown or complex.

Version published to 10.31234/osf.io/jw8xb_v2 on OSF Preprints
Oct 6, 2025
Version published to 10.31234/osf.io/jw8xb on OSF Preprints
Sep 20, 2023

Evaluation of Residual-Based Fit Statistics for Item Response Theory Models in the Presence of Non-Responses

This article has 2 authors:
1. Minho Lee
2. Juyoung Jung
This article has no evaluationsLatest version Sep 22, 2025
Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

This article has 3 authors:
1. Oskar Engels
2. Oliver Lüdtke
3. Alexander Robitzsch
This article has no evaluationsLatest version Oct 4, 2025
Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

This article has 3 authors:
1. Oskar Engels
2. Oliver Lüdtke
3. Alexander Robitzsch
This article has no evaluationsLatest version Oct 4, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Evaluation of Residual-Based Fit Statistics for Item Response Theory Models in the Presence of Non-Responses

Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches