PentestMCP: LLM and MCP Based Multi-Agent Framework for Automated Penetration Testing

Jiqiang Zhai
Xinyi Zhou
Hong Miao
Zekun Li
Zhe Li
Hailu Yang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

@Ryzen's saved articles (Ryzen)

Abstract

As information systems grow increasingly complex and cyberattack techniques continue to evolve, traditional penetration testing heavily dependent on manual expertise and operations---faces serious challenges in both efficiency and scalability. To overcome these limitations, this paper introduces PentestMCP, an end-to-end automated penetration testing framework driven by large language models (LLMs). The framework integrates three core components: a multi-agent architecture that covers the complete workflow of Information gathering, Vulnerability discovery, and exploitation; the Model Context Protocol (MCP), which standardizes tool orchestration; and retrieval-augmented generation (RAG), which strengthens contextual reasoning and reduces execution errors. In addition, PentestMCP employs a dual-path execution strategy together with a Penetration Task Graph (PTG) to achieve autonomous task decomposition, dynamic scheduling, and closed-loop control. We evaluated PentestMCP on more than one hundred real-world vulnerabilities collected from VulHub and the National Vulnerability Database, spanning diverse CWE categories and varying complexity levels. Experimental results show that PentestMCP consistently achieves higher success rates, stability, and efficiency than existing baselines, while also reducing token consumption and execution time. Using GPT-4.1, the system achieved average success rates of 87.3% for Information gathering, 62.3% for Vulnerability discovery, and 56.6% for exploitation. The findings strongly validate that an LLM and MCP-based multi-agent framework holds substantial potential for advancing the automation, scalability, and practical applicability of penetration testing.

Version published to 10.21203/rs.3.rs-7582841/v1 on Research Square
Nov 4, 2025

Multi Hop AI Agent Suite - Architecture

This article has 3 authors:
1. Sharan Kumar Yenugula
2. Revanth Ch
3. Venkat Kotipally
This article has no evaluationsLatest version Feb 18, 2026
Towards a Science of Scaling Agent Systems

This article has 20 authors:
1. Yubin Kim
2. Ken Gu
3. Chanwoo Park
4. Chunjong Park
5. Samuel Schmidgall
6. A. Ali Heydari
7. Yao Yan
8. Zhihan Zhang
9. Yuchen Zhuang
10. Yun Liu
11. Mark Malhotra
12. Paul Liang
13. Hae Won Park
14. Yuzhe Yang
15. Xuhai Xu
16. Yilun Du
17. Shwetak Patel
18. Tim Althoff
19. Daniel McDuff
20. Xin Liu
This article has no evaluationsLatest version Jan 23, 2026
A Discovery Technique for Expressive Yet Sound Process Models

This article has 3 authors:
1. Humam Kourani
2. Gyunam Park
3. Wil M.P. van der Aalst
This article has no evaluationsLatest version Jan 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Multi Hop AI Agent Suite - Architecture

Towards a Science of Scaling Agent Systems

A Discovery Technique for Expressive Yet Sound Process Models