A more comprehensive and reliable analysis of individual differences with generalized random forest for high-dimensional data: validation and guidelines

Jinwoo Lee
Junghoon Justin Park
Maria Pak
Seung Yun Choi
Jiook Cha

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Analyzing individual differences in treatment or exposure effects is a central challenge in psychology and behavioral sciences. Conventional statistical models have focused on average treatment effects, overlooking individual variability, and struggling to identify key moderators. Generalized Random Forest (GRF) can predict individualized treatment effects, but current implementations suffer from two critical limitations: (1) prediction performances vary substantially across random initializations, and (2) identification of key moderator is limited in high-dimensional settings. Here, we introduce two methodological advances to address these issues. First, a seed ensemble strategy stabilizes predictions by aggregating models trained under different random initializations. Second, a backward elimination procedure systematically identifies key moderators from high-dimensional inputs. Simulation analyses across diverse scenarios demonstrate that our approach achieves reliable and valid predictions across random seeds, improved performance in moderator identification, and robust generalization to independent data. To facilitate adoption and interpretation, we provide step-by-step guidance using large-scale neuroimaging dataset ( N = 8,778) with reusable code. These enhancements make GRF more reliable for modeling individual differences in treatment effects, supporting data-driven hypothesis generation, and identification of responsive subgroups.

Version published to 10.1101/2025.10.28.685232 on bioRxiv
Oct 31, 2025

Item-Level Heterogeneous Treatment Effects in Instrumental Variables Regression

This article has 5 authors:
1. Sanford R Student
2. Joshua Gilbert
3. Jesse Uzochukwu Eze
4. William Young
5. Benjamin Domingue
This article has no evaluationsLatest version Jan 6, 2026
A Unified Framework for Psychometrics in Experimental Psychology: The Standardized Generalized Hierarchical Factor Model

This article has 4 authors:
1. Ricardo Rey-Sáez
2. Alicia Franco-Martínez
3. Javier Revuelta
4. Miguel A. Vadillo
This article has no evaluationsLatest version Dec 29, 2025
A Two-Step Robust Estimation Approach for Inferring Within-person Relations in Longitudinal Design: Tutorial and Simulations

This article has 1 author:
1. Satoshi Usami
This article has no evaluationsLatest version Dec 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Item-Level Heterogeneous Treatment Effects in Instrumental Variables Regression

A Unified Framework for Psychometrics in Experimental Psychology: The Standardized Generalized Hierarchical Factor Model

A Two-Step Robust Estimation Approach for Inferring Within-person Relations in Longitudinal Design: Tutorial and Simulations