How Much Variance Does Your Model Explain? A Clarifying Note on the Use of Split-Half Reliability for Computing Noise Ceilings

Sander van Bree
Malin Styrnal
Martin N. Hebart

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Noise ceilings estimated from a dataset's split-half reliability offer a powerful way to quantify how much variance a model can in principle explain given the noise in the dataset, allowing researchers to assess model performance relative to an upper bound. In this work, we caution against a common pitfall in this approach to estimating noise ceilings. Specifically, even though the split-half reliability is expressed as a correlation coefficient, it reflects the maximum explained variance of a perfect model, not the maximum correlation. This subtle misinterpretation leads to artificially lower noise ceilings and, as a consequence, may inflate how close models appear to be to the noise ceiling. A systematic literature analysis suggests that this overly permissive ceiling is the most prevalent interpretation of noise ceilings estimated through split-half reliability. The purpose of this work is to explain when the mistake happens, why it happens, what its consequences are, and how to avoid it. Toward this end, we offer a general explanation showing how split-half reliabilities relate to the performance of a maximally predictive model, supplemented by simulations, and mathematical derivations. Overall, this clarifying piece is meant to help researchers better understand the statistical underpinnings of noise ceilings and support more consistent reporting across studies.

Version published to 10.31234/osf.io/gjk45_v2 on OSF Preprints
Dec 4, 2025
Version published to 10.31234/osf.io/gjk45_v1 on OSF Preprints
Dec 3, 2025

But r Won’t Do That: The Limits of Standardized Covariance

This article has 1 author:
1. John Protzko
This article has no evaluationsLatest version Mar 24, 2026
A quick, easy, and imperfect fix for using the Binomial Effect Size Display for correlations with continuous variables: divide by three

This article has 1 author:
1. Denis Lajoie
This article has no evaluationsLatest version Mar 28, 2026
A Proof of Concept for the Measurement of Reliability as Conditional Determinacy

This article has 1 author:
1. Domenic Groh
This article has no evaluationsLatest version Apr 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

But r Won’t Do That: The Limits of Standardized Covariance

A quick, easy, and imperfect fix for using the Binomial Effect Size Display for correlations with continuous variables: divide by three

A Proof of Concept for the Measurement of Reliability as Conditional Determinacy