Leakage-Aware LLM Augmentation for Attrition Prediction: A Decision-Centric Evaluation

Weiquan Liao
Jiayangmei Xu
Ekaterina A. Panova

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Employee attrition imposes high financial and organizational costs, with preventable departures typically far more expensive than false alarms. This study frames attrition prediction as a decision-support problem and introduces a leakage-aware framework that leverages LLM-based augmentation to generate realistic minority-class samples. Using the IBM HR dataset, we benchmark classical, tree-based, transformer, and AutoML models. Results show that LLM-based augmentation consistently improves recall of potential leavers, even when AUC or Average Precision remain statistically unchanged. From a managerial perspective, higher recall enables organizations to prevent more costly departures at the expense of only modest increases in false positives, producing a favorable cost–benefit balance. SHAP analyses confirm that key drivers such as overtime, mobility, and job satisfaction remain interpretable and actionable, while fairness analysis shows small subgroup disparities, supporting equitable deployment. Overall, the proposed framework demonstrates how leakage-aware, recall-oriented augmentation can translate generative AI advances into transparent, fair, and decision-relevant tools for HR retention, with potential applicability to other rare-event domains such as churn, fraud, and risk prediction.

Version published to 10.20944/preprints202509.1238.v1
Sep 16, 2025

Optimizing Fairness in Machine Learning: A Hyperparameter Tuning Approach

This article has 1 author:
1. Abdul Nadeem Mohammed
This article has no evaluationsLatest version Oct 14, 2025
Machine Learning for Fair and Accurate University Admission Prediction: A Case Study from the UAE

This article has 1 author:
1. Mohammad Abbadi
This article has no evaluationsLatest version Sep 22, 2025
Shadow AI thrives under punitive social evaluation

This article has 5 authors:
1. Mengchen Dong
2. Hiromu Yakura
3. Omar Sherif
4. Jean-François Bonnefon
5. Iyad Rahwan
This article has no evaluationsLatest version Aug 26, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing Fairness in Machine Learning: A Hyperparameter Tuning Approach

Machine Learning for Fair and Accurate University Admission Prediction: A Case Study from the UAE

Shadow AI thrives under punitive social evaluation