Enhanced Automated Penetration Testing Using Double Deep Q-Learning

Eman M. Ahmed
Rasha H. Sakr
Mohamed F. Alrahmawy

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As cyberattacks grow in complexity, traditional manual penetration testing becomes increasingly time-consuming, costly, and dependent on expert knowledge. In this paper, we present an automated penetration testing framework based on Double Deep Q-Learning (DDQN) to enhance attack planning efficiency, stability, and decision-making. The framework builds realistic logical network topologies using real-world vulnerability and host data gathered from the Shodan search engine and the National Vulnerability Database. It produces attack graphs and effective attack paths using MulVAL and then subsequently transforms them into matrix representations appropriate for reinforcement learning. After comparison to the baseline Deep Q-Network (DQN), experimental results on static logical topologies demonstrate that DDQN achieves more stable learning and lower variance, with an average success rate of approximately 65% in reaching the target system. Using these results, we show how well DDQN directs ethical hackers toward effective attack tactics and illustrates the framework's potential for automated penetration testing systems and cybersecurity training.

Version published to 10.21203/rs.3.rs-8888402/v1 on Research Square
Mar 10, 2026

Addressing the Deployment Gap: Hybrid Symbolic-Statistical Vulnerability Detection in Safety-Critical C/C++ Systems

This article has 5 authors:
1. Jude E. Ameh
2. Abayomi Otebolaku
3. Augustine Ikpehai
4. Alex Shenfield
5. Dauda Sule
This article has no evaluationsLatest version Apr 10, 2026
Test Case Generation with Hecate: To Infinity and Beyond!

This article has 8 authors:
1. Nunzio Marco Bisceglia
2. Michael Marzella
3. Daniele Lazzari
4. Marcello Minervini
5. Federico Formica
6. Angelo Gargantini
7. Claudio Menghi
8. Andrea Bombarda
This article has no evaluationsLatest version Apr 13, 2026
Critical Safety Attention Heads: Architecture-Dependent Vulnerabilities in LLMs

This article has 5 authors:
1. Letian Sha
2. Peijie Sun
3. Hao Xue
4. Shijie Hao
5. Fu Xiao
This article has no evaluationsLatest version Mar 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Addressing the Deployment Gap: Hybrid Symbolic-Statistical Vulnerability Detection in Safety-Critical C/C++ Systems

Test Case Generation with Hecate: To Infinity and Beyond!

Critical Safety Attention Heads: Architecture-Dependent Vulnerabilities in LLMs