Tree-based machine learning methods for multilevel data: The impact of predictor levels and clustering on prediction and inference

Linus Hany
Marjolein Fokkema
Constantin Tobias Alexander Wiegand
Mirka Henninger

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Machine learning methods, like decision trees and random forests, allow researchers to investigate complex non-linear and interaction effects, making them valuable tools for exploring complex psychological processes. Recently, attempts have been made to extend decision tree methods (multilevel-trees) for their application to multilevel data, e.g., with level-1 (e.g., students) and level-2 (e.g., classes) units. While these adaptations include adding random effects to address lack of independence between observations, they do not consider the influence of the levels at which predictor variables operate. We investigate how predictor level (level-1 vs. level-2) and clustering (Intraclass Correlation Coefficient (ICC), number of clusters, cluster size) affect inference and prediction in six tree-based methods: rpart, ctree, REEMtree, REEMctree, MERT, and lmertree. Using simulation studies, we evaluate variable selection, predictive performance (PMSE, r2), and predictive contribution of single predictor variables. Our results show that both standard and multilevel-tree methods are seriously affected by the level of the predictor variables and clustering in the data regarding prediction and inference. In particular, the risk of falsely selecting non-informative level-2 predictors is substantial, especially when the ICC is high. Ignoring the multilevel structure of the data might lead to erroneous research conclusions even though multilevel tree methods are applied.

Version published to 10.31234/osf.io/bjc3g_v1 on OSF Preprints
Feb 20, 2026

Using Classification Trees to Identify the Best Method in Monte Carlo Simulations: From Population Parameters to Observed Features

This article has 2 authors:
1. Jeongwon Choi
2. Hao Wu
This article has no evaluationsLatest version Mar 27, 2026
Beyond the Hype: A Simulation Study Evaluating the Predictive Performance of Machine Learning Models in Psychology

This article has 4 authors:
1. Kim-Laura Speck
2. Kristin Jankowsky
3. Florian Scharf
4. Ulrich Schroeders
This article has no evaluationsLatest version Mar 10, 2026
thresholdanalysis: An R Package for Detecting and Visualizing Threshold Effects in Logistic Regression, Linear Regression, and Cox Proportional Hazards Models

This article has 6 authors:
1. Sheng Song
2. Zhang Yan-Hong
3. Zhang Lei
4. Ma Hang-Kun
5. Zhao Yang
6. Huang Ye
This article has no evaluationsLatest version Feb 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Using Classification Trees to Identify the Best Method in Monte Carlo Simulations: From Population Parameters to Observed Features

Beyond the Hype: A Simulation Study Evaluating the Predictive Performance of Machine Learning Models in Psychology

thresholdanalysis: An R Package for Detecting and Visualizing Threshold Effects in Logistic Regression, Linear Regression, and Cox Proportional Hazards Models