When do interaction/moderation effects stabilize in linear regression?

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Two-way interaction effects in linear regression occur when the relation between two variables changes depending on the level of a third. Despite their frequent use, interactions are notoriously difficult to estimate accurately and test for statistical significance due to small effect sizes and low reliability. This study uses Monte Carlo simulations to establish stability thresholds for two-way interactions between continuous variables across combinations of reliability (.7–1.0), main effect size (.1–.5), collinearity (.1–.5), and interaction effect size (.05–.2). Stability was defined as the consistency of estimated effect sizes across repeated samples of the same size from the same population and operationalized using modified definitions of the Corridor of Stability (COS) and Point of Stability (POS) from Schönbrodt and Perugini (2013). Results show that the stability of interaction estimates is primarily determined by sample size and predictor reliability. The case representing a realistic psychology field study, where researchers have limited control over variables, stabilized at n = 3,800, requiring 72% statistical power. At n < 100, 11%–45% of the estimates were incorrectly signed (i.e., negative when the true effect was positive). Most psychology studies enroll far fewer than 500 participants, and our results indicate many published interactions may be unstable. It should be noted that analyses involving highly reliable predictors, such as group assignment in experimental designs, may stabilize at lower sample sizes as they attenuate the expected effect size less than variables with more measurement error. Researchers are encouraged to avoid routine tests of two-way interactions unless sample size and reliability are adequate and hypotheses are specified a priori.

Article activity feed