A Multimodal Phishing Website Detection System Using Explainable Artificial Intelligence Technologies

Alexey Vulfin
Alexey Sulavko
Vladimir Vasiliev
Alexander Minko
Anastasia Kirillova
Alexander Samotuga

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The purpose of the present study is to improve the efficiency of phishing web resource detection through multimodal analysis and using methods of explainable artificial intelligence. We propose a late fusion architecture in which independent specialized models process four modalities and are combined using weighted voting. The first branch uses CatBoost for URL features and metadata; the second uses CNN1D for symbolic-level URL representation; the third uses a Transformer based on a pretrained CodeBERT for the homepage HTML code; and the fourth uses EfficientNet-B7 for page screenshot analysis. SHAP, Grad-CAM, and attention matrices are used to interpret decisions; a local LLM generates a consolidated textual explanation. A prototype system based on a microservice architecture, integrated with the SOC, has been developed. This integration enables streaming processing and reproducible validation. Computational experiments using our own updated dataset and the public MTLP dataset show high performance: F1-scores of up to 0.989 on our own dataset and 0.953 on MTLP; multimodal fusion consistently outperforms single-modal baseline models. The practical significance of this approach for zero-day detection and false positive reduction, through feature alignment across modalities and explainability, is demonstrated. All limitations and operational aspects (data drift, adversarial robustness, LLM latency) of the proposed prototype are presented. We also outline areas for further research.

Version published to 10.3390/make8010011
Jan 4, 2026
Version published to 10.20944/preprints202511.1683.v1
Nov 24, 2025

A Hybrid Deep Ensemble Framework with Interpretability for Phishing URL Detection

This article has 2 authors:
1. Yaozu Xue
2. Yinjie Zhang
This article has no evaluationsLatest version Dec 10, 2025
A Multimodal Adaptive Graph-based Intelligent Classification Model for Fake News

This article has 1 author:
1. Junhao Xu
This article has no evaluationsLatest version Jan 20, 2026
A Binary Genetic Harris Hawks Optimization With Machine Learning on Detection of Phishing Url

This article has 2 authors:
1. Ponni Ponnusamy
2. Priyadharsini Ganesan
This article has no evaluationsLatest version Dec 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Hybrid Deep Ensemble Framework with Interpretability for Phishing URL Detection

A Multimodal Adaptive Graph-based Intelligent Classification Model for Fake News

A Binary Genetic Harris Hawks Optimization With Machine Learning on Detection of Phishing Url