The Dark Side of Sequential Testing: A Simulation Study on Questionable Research Practices

Meike Steinhilber
Martin Schnuerch
Anna-Lena Schubert

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In response to growing replication failures attributed to underpowered studies and questionable research practices (QRPs) such as data peeking, researchers have increasingly turned to sequential testing frameworks -- most notably the Sequential Probability Ratio Test (SPRT). By evaluating evidence at each interim analysis, SPRTs achieve the same inferential rigor as fixed-sample protocols while requiring substantially fewer participants, all without sacrificing robustness to violations of their statistical assumptions. However, like any statistical method, SPRTs are susceptible to QRPs. We conducted a simulation study using a fictional researcher who applied various hacking strategies to favor the alternative hypothesis. These included running multiple parallel sequential ANOVAs, performing opportunistic subgroup or outlier analyses, flexibly redefining expected effect sizes, reshuffling observation order, and filtering datapoints that weakened interim likelihood ratios. Single hacking strategies and moderate combinations inflated the nominal 5% Type I error rate to 6-19 % (rising to 99 % under extreme data filtering), shifted effect-size estimates upward, increased sample size efficiency and reduced the rate of non-decisions. These findings underscore that while SPRTs offer substantial gains in efficiency, they are not immune to misuse, much like fixed-design approaches. It is therefore critical to promote transparency and preregistrations in sequential designs to prevent the adoption of QRPs early on.

Version published to 10.31234/osf.io/vkbu3_v1 on OSF Preprints
Jul 4, 2025

Type I Error Inflation in Unexpected Event During Survey Designs

This article has 2 authors:
1. Joris Frese
2. Sascha Riaz
This article has no evaluationsLatest version Jul 29, 2025
Type I Error Inflation in Unexpected Event During Survey Designs

This article has 2 authors:
1. Joris Frese
2. Sascha Riaz
This article has no evaluationsLatest version Jul 29, 2025
Beyond p-values: Rethinking Statistical Frameworks for Addressing the Replication Crisis

This article has 8 authors:
1. Fernando Marmolejo-Ramos
2. Jose D. Perezgonzalez
3. Raydonal Ospina
4. Freddy Hernandez-Barajas
5. Mauricio Castillo
6. Rafael Izbicki
7. Rafael Stern
8. Julian Tejada
This article has no evaluationsLatest version Jun 12, 2025

Listed in

Abstract

Article activity feed

Related articles

Type I Error Inflation in Unexpected Event During Survey Designs

Type I Error Inflation in Unexpected Event During Survey Designs

Beyond p-values: Rethinking Statistical Frameworks for Addressing the Replication Crisis