Bridging Traditional Statistics and Machine Learning Approaches in Psychology: Navigating Small Samples, Measurement Error, Non-independent Observations and Missing Data

Rosa Lavelle-Hill
Gavin Smith
Kou Murayama

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In recent years, machine learning has propagated into different aspects of psychologicalresearch, and supervised machine learning methods have increasingly been used as a toolfor predicting human behavior or psychological characteristics when there is a largenumber of possible predictors. However, researchers often face practical challenges whenusing machine learning methods on psychological data. In this article, we identify anddiscuss four key challenges that often arise when applying machine learning to datacollected for psychological research. The four challenge areas cover (i) limited sample size,(ii) measurement error, (iii) non-independent data, and (iv) missing data. Such challengesare extensively discussed in the “traditional” statistical literature but are often notexplicitly addressed, or at least not to the same extent, in the applied machine learningcommunity. We present how each of these challenges is dealt with first from a traditionalstatistics perspective and then from a machine learning perspective, and discuss thestrengths and weaknesses of these solutions by comparing the approaches. We argue thatthe boundary between traditional statistics and machine learning is fluid, and emphasizethe need for cross-disciplinary collaboration to better tackle these core challenges andimprove replicability.

Version published to 10.31219/osf.io/6xt82_v3 on OSF Preprints
May 12, 2025
Version published to 10.31219/osf.io/6xt82_v2 on OSF Preprints
May 8, 2025
Version published to 10.31219/osf.io/6xt82_v1 on OSF Preprints
Oct 30, 2023

Just in Time or Just a Guess? Addressing Challenges in Validating Prediction Models Based on Longitudinal Data

This article has 2 authors:
1. Anna M Langener
2. Nicholas C. Jacobson
This article has no evaluationsLatest version Jun 11, 2025
Predicting continuous outcomes: Some new tests of associative approaches to contingency learning

This article has 4 authors:
1. Julie Y. L. Chow
2. Hilary J. Don
3. Ben Colagiuri
4. Evan J. Livesey
This article has no evaluationsLatest version Jun 15, 2025
A tutorial for comparing nonnested latent variables models

This article has 1 author:
1. Holmes Finch
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

Just in Time or Just a Guess? Addressing Challenges in Validating Prediction Models Based on Longitudinal Data

Predicting continuous outcomes: Some new tests of associative approaches to contingency learning

A tutorial for comparing nonnested latent variables models