When do interaction/moderation effects stabilize in linear regression?

Andrew J Castillo
Joshua Miller
Colin Vize
David A Baranger
Donald R Lynam

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Two-way interaction effects in linear regression occur when the relation between two variables changes depending on the level of a third. Despite their frequent use, interactions are notoriously difficult to estimate accurately and test for statistical significance due to small effect sizes and low reliability. This study uses Monte Carlo simulations to establish stability thresholds for two-way interactions between continuous variables across combinations of reliability (.7–1.0), main effect size (.1–.5), collinearity (.1–.5), and interaction effect size (.05–.2). Stability was defined as the consistency of estimated effect sizes across repeated samples of the same size from the same population and operationalized using modified definitions of the Corridor of Stability (COS) and Point of Stability (POS) from Schönbrodt and Perugini (2013). Results show that the stability of interaction estimates is primarily determined by sample size and predictor reliability. The case representing a realistic psychology field study, where researchers have limited control over variables, stabilized at n = 3,800, requiring 72% statistical power. At n < 100, 11%–45% of the estimates were incorrectly signed (i.e., negative when the true effect was positive). Most psychology studies enroll far fewer than 500 participants, and our results indicate many published interactions may be unstable. It should be noted that analyses involving highly reliable predictors, such as group assignment in experimental designs, may stabilize at lower sample sizes as they attenuate the expected effect size less than variables with more measurement error. Researchers are encouraged to avoid routine tests of two-way interactions unless sample size and reliability are adequate and hypotheses are specified a priori.

Version published to 10.31234/osf.io/35t84_v1 on OSF Preprints
Nov 12, 2025

Shedding some light on the relationship between measurement error and statistical power in multilevel models applied to intensive longitudinal designs

This article has 3 authors:
1. Ginette Lafit
2. Sigert Ariens
3. Richard Artner
This article has no evaluationsLatest version Feb 2, 2026
Moderating the consequences of longitudinal change for distal outcomes

This article has 1 author:
1. Ethan Michael McCormick
This article has no evaluationsLatest version Dec 31, 2025
Reciprocal Effects Between Self-Esteem and Work Experiences: A Reanalysis of Two Longitudinal Studies

This article has 3 authors:
1. Max Lustenberger
2. Laurenz L. Meier
3. Ulrich Orth
This article has no evaluationsLatest version Jan 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Shedding some light on the relationship between measurement error and statistical power in multilevel models applied to intensive longitudinal designs

Moderating the consequences of longitudinal change for distal outcomes

Reciprocal Effects Between Self-Esteem and Work Experiences: A Reanalysis of Two Longitudinal Studies