Detecting malicious websites using machine learning models by incorporating both lexical and network-based features.

Daniel Kwadwo Nterful
Richard Appiah
David Ezejimofor

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The utilization of blacklists is a commonly used approach for detecting malicious websites. However, blacklists have limitations as they lack comprehensive information and cannot be easily updated to include newly discovered harmful websites. To enhance security and reduce vulnerability to these attacks, it is crucial to employ techniques that can automatically identify and manage newly emerging malicious websites. In this regard, machine learning models offer a promising solution. By utilizing eight different machine learning models, namely Random Forests (RF), Decision Trees (DT), Logistic Regression (LR), Naive Bayes (NB), K-Nearest Neighbors (KNN), Support Vector Machines (SVM), XGBoost, and LightGBM, it is possible to detect and classify malicious websites effectively. These models leverage the power of machine learning algorithms to analyze various features and patterns associated with malicious URLs, enabling accurate identification and proactive defense against such threats. Additionally, it investigates the application of ensemble methods, particularly the Stacking method, to create a brand-new model known as DKN. The study explores the experimental assessment, including the dataset source, feature extraction, and evaluation measures, and presents the architecture of the DKN model. The outcomes show how well the suggested models and the ensemble DKN stacking model predict the characteristics of URLs. The paper looks at methods like downsampling and oversampling to enhance model performance as well as the problem of imbalanced datasets. By investigating the fusion of several variables and machine-learning models to produce precise predictions, the research makes a contribution to the field of malicious website identification.

Version published to 10.21203/rs.3.rs-6256091/v1 on Research Square
Mar 20, 2025

A Comparative Analysis of Machine Learning Models for URL-Based Phishing Detection

This article has 4 authors:
1. Rafi MRM
2. Nuski F.A.M
3. Suhaif A.M
4. Shaminda K.A.S
This article has no evaluationsLatest version Apr 15, 2025
Detecting Zero-Day Web Attacks Using One-Class Ensemble Classifiers

This article has 2 authors:
1. Vahid Babaey
2. Hamid Reza Faragardi
This article has no evaluationsLatest version Mar 4, 2025
AI-Powered Defence: Leveraging Deep Learning for Effective Malware Detection

This article has 1 author:
1. Nancy Awadallah Awad
This article has no evaluationsLatest version Apr 21, 2025

Listed in

Abstract

Article activity feed

Related articles

A Comparative Analysis of Machine Learning Models for URL-Based Phishing Detection

Detecting Zero-Day Web Attacks Using One-Class Ensemble Classifiers

AI-Powered Defence: Leveraging Deep Learning for Effective Malware Detection