Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

Oskar Engels
Oliver Lüdtke
Alexander Robitzsch

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In longitudinal assessments, tests are frequently used to estimate trends over time. When item parameters lack invariance, time point comparisons can be distorted, necessitating the use of appropriate statistical methods for accurate estimation. This study compares trend estimates using the 2PL model under item parameter drift (IPD) across five trend estimation approaches for two time points: concurrent calibration, which jointly estimates all item parameters across time points and assumes full invariance; fixed calibration, which calibrates the first time point (T1), and calibrates subsequently the second time point (T2) with the common item parameters being fixed to the estimates from T1; separate calibration of the two time points is followed by robust Haberman or Haebara linking that down-weights outlying common items via Lp or L0 losses to place parameters on the scale of T1; partial invariance via IPD detection, where non-invariant items are identified using likelihood-ratio tests or root mean square deviation (RMSD) with fixed or data-driven cutoffs and trends are re-estimated on the resulting anchor set; and regularized estimation that simultaneously estimates all parameters while shrinking most IPD effects toward zero using a smooth Bayesian information criterion (SBIC).The simulation study varied sample size, number of items, IPD effect size, IPD item proportion, and whether the IPD was balanced or unbalanced. Bias and relative RMSE were evaluated for the mean and SD at T2. An empirical example using synthetic longitudinal reading data, applying trend estimation approaches, is provided.Results indicate that SBIC regularized estimation generally performed best across conditions, maintaining low bias and RMSE. Among robust linking methods, Haberman linking with the L0 loss function showed superior performance under unbalanced IPD. Detection-based approaches revealed that commonly used RMSD cutoffs may be too lenient; stricter cutoffs (0.03 to 0.05) were necessary to achieve satisfactory parameter recovery, especially under unbalanced IPD conditions.

Version published to 10.31234/osf.io/5dtfy_v2 on OSF Preprints
Oct 4, 2025
Version published to 10.31234/osf.io/5dtfy_v1 on OSF Preprints
Oct 4, 2025

Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

This article has 3 authors:
1. Oskar Engels
2. Oliver Lüdtke
3. Alexander Robitzsch
This article has no evaluationsLatest version Oct 4, 2025
Score-Based Tests with Fixed Effects Person Parameters in Item Response Theory: Detecting Model Misspecification Including Differential Item Functioning

This article has 2 authors:
1. Rudolf Debelak
2. Charles C Driver
This article has no evaluationsLatest version Oct 6, 2025
Estimating age-based developmental trajectories using latent change score models based on measurement occasion

This article has 3 authors:
1. Eduardo Estrada
2. Fumiaki Hamagami
3. Emilio Ferrer
This article has no evaluationsLatest version Sep 30, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Estimating Longitudinal Trends with Differential Item Functioning: A Comparison of Five IRT-Based Approaches

Score-Based Tests with Fixed Effects Person Parameters in Item Response Theory: Detecting Model Misspecification Including Differential Item Functioning

Estimating age-based developmental trajectories using latent change score models based on measurement occasion