How not to MISS an outlier: comparing three classic univariate methods and introducing a new one, the MAD–IQR–SD Simultaneous (MISS).

Guillaume Pech
Nathan Vacarro
Emilie A. Caspar
Pietro Amerio
Axel Cleeremans
Christophe Leys
Christophe Ley

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Outliers can strongly change statistical results, yet the three most widely used methods in psychology and neuroscience to detect them, Standard Deviation (SD), Interquartile Range (IQR), and Median Absolute Deviation (MAD), do not perform equally well across different data conditions. Importantly, these methods have rarely been compared systematically across a broad set of realistic scenarios, making it difficult for researchers to select an appropriate method in advance.In this study, we conducted large-scale simulations that varied distributional shape (normal, asymmetric, leptokurtic, uniform, asymmetric bimodal), sample size, outlier placement and outlier rate (0–40%). For each scenario, we assessed how SD, IQR, and MAD classified data points as valid or outlying by computing an accuracy score.Across the classic approaches, the most effective fixed thresholds were 2 SD, 2 IQR, and 3 MAD. SD performed well only when outliers were extremely rare (0-6%), but its accuracy dropped sharply as contamination increased. IQR was more robust at moderate outlier rates (8-20%), yet it became unreliable when many outliers were present or when distributions were asymmetric or bimodal. MAD showed the highest accuracy at high contamination levels (22-40%), although in leptokurtic (heavy-tailed) distributions both MAD and IQR were more biased than SD.To address these limitations, we introduce the MAD–IQR–SD Simultaneous (MISS) method, which integrates the three classical statistics using optimized weights identified through a genetic algorithm: 1.5 MAD [87.8%], 1 IQR [1.2%], 5 SD [11%]. Across all simulations, MISS achieved the highest overall accuracy (97.4%) and outperformed each classical method across most contexts.

Version published to 10.31234/osf.io/2r9yw_v1 on OSF Preprints
Mar 19, 2026

Robust Standard Errors and Confidence Intervals for Standardized Mean Difference

This article has 1 author:
1. Metin Bulus
This article has no evaluationsLatest version Mar 19, 2026
The Statistical Costs of Two-Step Signal Detection Analyses: A Case for a Maximum Likelihood Mixed-Effects Approach

This article has 4 authors:
1. Marie Jakob
2. Raphael Hartmann
3. Karl Christoph Klauer
4. Constantin Gregor Meyer-Grant
This article has no evaluationsLatest version Mar 12, 2026
Analyzing Binary Judgments: A Comparison of ANOVA, Signal Detection Theory, and Generalized Linear Mixed Models in the Context of the Illusory Truth Effect

This article has 2 authors:
1. Semih Can Aktepe
2. Daniel W. Heck
Reviewed by PREreview

This article has 1 evaluationAppears in 1 listLatest version Jan 28, 2026Latest activity Mar 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Robust Standard Errors and Confidence Intervals for Standardized Mean Difference

The Statistical Costs of Two-Step Signal Detection Analyses: A Case for a Maximum Likelihood Mixed-Effects Approach

Analyzing Binary Judgments: A Comparison of ANOVA, Signal Detection Theory, and Generalized Linear Mixed Models in the Context of the Illusory Truth Effect