A Study on Explainable Artificial Intelligence(XAI) in Malware Detection for Proactive Cyber Threat Hunting

Pankaj Gajakosh S.
Rama Abirami K.
Nagendra Kumar Y. J.

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In recent years, effective malware detection requires Machine Learning models that are both accurate and interpretable. While high-performing models often suffer from poor explainability, this study addresses this gap by integrating advanced Explainable Artificial Intelligence (XAI) frameworks with well-established ML algorithms to create transparent, trustworthy detection systems. We evaluate Decision Tree, Random Forest, XGBoost, Naïve Bayes, and Kernel SVM using SHAP (SHapley Additive exPlanations) for feature importance assessment. Models with limited interpretability (Kernel SVM, Decision Tree) are excluded, while remaining models undergo ELI5 permutation importance validation to confirm feature rankings and decision logic. XGBoost emerges as optimal due to its superior accuracy, superior handling of complex non-linear relationships, and stable, reproducible explanations. We apply advanced XAI techniques—Partial Dependence Plots (PDP), Individual Conditional Expectation (ICE) plots, and 2D Accumulated Local Effects (2D-ALE)—to reveal global and local trends, feature interactions, and non-linear patterns within the model's most influential features. This analysis demonstrates that tree-based ensemble models, when paired with rigorous XAI techniques, yield transparent and operationally trustworthy tools for cybersecurity applications. The result is a methodology that bridges high predictive performance with actionable intelligence, enabling security practitioners to validate and deploy ML-based malware classifiers with confidence.

Version published to 10.21203/rs.3.rs-8386211/v1 on Research Square
Dec 23, 2025

Explainable and Adversarial Robust Deep Learning for Malware Campaigns Forensic Attribution

This article has 4 authors:
1. Idowu Olugbenga ADEWUMI
2. Wumi AJAYI
3. Tolulope OLUFEMI
4. Ayoade Oluwafisayo BABATOPE
This article has no evaluationsLatest version Jan 7, 2026
Integrating Model Explainability and Uncertainty Quantification for Trustworthy Fraud Detection

This article has 2 authors:
1. Tebogo Mapaila
2. Makhamisa Senekane
This article has no evaluationsLatest version Jan 7, 2026
Evaluating Adversarial Robustness of AI Intrusion Detection Systems Using Automated Traffic Generation

This article has 2 authors:
1. Samer Aoudi
2. Hussain Al-Aqrabi
This article has no evaluationsLatest version Dec 26, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Explainable and Adversarial Robust Deep Learning for Malware Campaigns Forensic Attribution

Integrating Model Explainability and Uncertainty Quantification for Trustworthy Fraud Detection

Evaluating Adversarial Robustness of AI Intrusion Detection Systems Using Automated Traffic Generation