Explainable and fair anti money laundering models using a reproducible SHAP framework for financial institutions

Pristly Turjo Mazumder

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Financial institutions are under growing regulatory pressure to detect and report money laundering in a way that is accurate, auditable, and fair. This study introduces a reproducible machine learning pipeline for Anti-Money Laundering (AML) detection that integrates statistically validated synthetic data generation, class-imbalance handling, and post-hoc explainability. Using a 10,000-record synthetic AML dataset generated with the Synthetic Data Vault (SDV) and Faker, we train Random Forest and Multilayer Perceptron classifiers with class weighting and F₂-optimized threshold tuning to maximize minority-class recall. Model performance is evaluated using PR-AUC, precision/recall for the suspicious class, F₁ score, MCC, balanced accuracy, and probability calibration. Global and local model interpretability are achieved using TreeSHAP and KernelSHAP, enabling analysts to understand feature contributions and diagnose false positives and false negatives. Fairness audits across age and regional proxies reveal Equal Opportunity gaps, which are mitigated via post-processing threshold adjustments. Results show substantially improved AML recall at regulatorily compliant operating points and provide transparent, auditable outputs aligned with Bank Secrecy Act (BSA) and FATF guidance. This work offers U.S. financial institutions a deployable framework that enhances compliance efficiency, reduces false positives, and supports supervisory review, replication, and industry benchmarking.

Version published to 10.21203/rs.3.rs-7724977/v1 on Research Square
Nov 4, 2025

Integrating Model Explainability and Uncertainty Quantification for Trustworthy Fraud Detection

This article has 2 authors:
1. Tebogo Mapaila
2. Makhamisa Senekane
This article has no evaluationsLatest version Jan 7, 2026
Objective over Architecture: Fraud Detection Under Extreme Imbalance in Bank Account Opening

This article has 6 authors:
1. Wenxi Sun
2. Qiannan Shen
3. Yijun Gao
4. Qinkai Mao
5. Tongsong Qi
6. Shuo Xu
This article has no evaluationsLatest version Dec 9, 2025
Utility Optimized Anti Money Laundering Detection Using ISO 20022 Trade Graphs Conformal Graph Neural Networks and SHAP

This article has 1 author:
1. Pristly Turjo Mazumder
This article has no evaluationsLatest version Dec 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrating Model Explainability and Uncertainty Quantification for Trustworthy Fraud Detection

Objective over Architecture: Fraud Detection Under Extreme Imbalance in Bank Account Opening

Utility Optimized Anti Money Laundering Detection Using ISO 20022 Trade Graphs Conformal Graph Neural Networks and SHAP