Predicting psychological constructs from biased measurements: The impact of non-invariant targets in machine learning

Philipp Sterner
Eunsook Kim
David Goretzko

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Psychology is increasingly interested in the prediction of constructs via machine learning (ML) models, for example, predicting a person’s personality or intelligence. To measure these constructs, psychologists often draw on questionnaires. In supervised ML, these measurements are then used as target variables (i.e., the “ground truth”) for model training. It is currently paid only little attention to psychometric issues and biases that might be carried over from measurements in the training data to the final model used for predictions. One potential bias is a lack of measurement invariance (MI) of the questionnaire data across groups that are used as target values for supervised learning. If non-invariant measrurements are used for model training, this might bias the predictions of the final ML model. Specifically, people from two different groups with the same true score on a construct might receive different predicted scores by the model. In this article, we assess the impact of a lack of MI in target variables on ML predictive performance and investigate approaches to counter this impact. We address this question by a comprehensive simulation study in which we derive target values from (a) single-group models (i.e., ignoring non-invariance) and (b) alignment optimization (i.e., handling non-invariance). Results show that single-group factor scores make ML models reproduce measurement bias in their predictions. Aligned factor scores can improve prediction performance if measurements are non-invariant, but only if certain conditions are met. We discuss implications for psychological applications of ML as well as directions for future research.

Version published to 10.31234/osf.io/47fu3_v1 on OSF Preprints
Jan 24, 2026

Cognitive models facilitate real-time inference of latent motives

This article has 2 authors:
1. Anderson K. Fitch
2. Peter D. Kvam
This article has no evaluationsLatest version Dec 16, 2025
A case study on norming an intelligence test using Regularised Prediction and Poststratification

This article has 4 authors:
1. Taym Alsalti
2. Ian Hussey
3. Malte Elson
4. Ruben C. Arslan
This article has no evaluationsLatest version Dec 7, 2025
Learning the determinants of human behavior

This article has 4 authors:
1. Bingsong Zhao
2. Stefan T. Radev
3. Konstantina Sokratous
4. Peter D. Kvam
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Cognitive models facilitate real-time inference of latent motives

A case study on norming an intelligence test using Regularised Prediction and Poststratification

Learning the determinants of human behavior