Using Classification Trees to Identify the Best Method in Monte Carlo Simulations: From Population Parameters to Observed Features

Jeongwon Choi
Hao Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Monte Carlo simulations are widely used to compare statistical methods, but their findings are often difficult to interpret and rarely translate into practical guidelines for method selection. This gap arises for two reasons. First, method performance often depends on complex interactions among simulation factors that are hard to detect with conventional summaries. Second, performance is typically evaluated against population values that are unknown in practice, whereas applied researchers only see features of their sample data. We propose a classification tree framework that addresses both problems, along with two pruning strategies tailored to the simulation context: a combined pruning procedure that accounts for equivalent representations of the same data, and effect-size-based pruning that prevents large numbers of replications from inflating tree complexity. We illustrate this framework by selecting among zero cell correction strategies for estimating tetrachoric correlations. In Example 1, we reanalyze results from Choi & Wu (2026) using the original simulation factors as predictors, yielding clear decision rules that still rely on population values. In Example 2, we generate data under continuous simulation factors and construct predictors directly from observed 2×2 contingency tables, allowing the resulting rules to be applied to real data. More broadly, the proposed framework provides a general approach for translating Monte Carlo comparisons of competing methods into practical method selection guidelines.

Version published to 10.31234/osf.io/kr2vu_v1 on OSF Preprints
Mar 27, 2026

Repeatability in Monte Carlo Simulation Studies

This article has 2 authors:
1. Amanda Kay Montoya
2. Samantha F. Anderson
This article has no evaluationsLatest version Apr 1, 2026
Consistent Estimates from Biased Estimators: Monte-Carlo Consistent Partial Least Squares for Latent Interaction Models with Ordinal Indicators

This article has 3 authors:
1. KJell Slupphaug
2. Mehmet Mehmetoglu
3. Matthias Mittner
This article has no evaluationsLatest version Mar 21, 2026
Guidelines for the interpretation of NCDIF as an effect size measure

This article has 2 authors:
1. Trung Le
2. Victor Hernando Cervantes
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Repeatability in Monte Carlo Simulation Studies

Consistent Estimates from Biased Estimators: Monte-Carlo Consistent Partial Least Squares for Latent Interaction Models with Ordinal Indicators

Guidelines for the interpretation of NCDIF as an effect size measure