Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback

Shiyin Lin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Software fuzzing has become a cornerstone in automated vulnerability discovery, yet existing mutation strategies often lack semantic awareness, leading to redundant test cases and slow exploration of deep program states. In this work, we present a hybrid fuzzing framework that integrates static and dynamic analysis with Large Language Model (LLM)-guided input mutation and semantic feedback. Static analysis extracts control-flow and data-flow information, which is transformed into structured prompts for the LLM to generate syntactically valid and semantically diverse inputs. During execution, we augment traditional coverage-based feedback with semantic feedback signals—derived from program state changes, exception types, and output semantics—allowing the fuzzer to prioritize inputs that trigger novel program behaviors beyond mere code coverage. We implement our approach atop AFL++, combining program instrumentation with embedding-based semantic similarity metrics to guide seed selection. Evaluation on real-world open-source targets, including libpng, tcpdump, and sqlite, demonstrates that our method achieves faster time-to-first-bug, higher semantic diversity, and a competitive number of unique bugs compared to state-of-the-art fuzzers. This work highlights the potential of combining LLM reasoning with semantic-aware feedback to accelerate and deepen vulnerability discovery.

Version published to 10.20944/preprints202509.1822.v1
Sep 23, 2025

LLM-Driven Adaptive Source–Sink Identification and False Positive Mitigation for Static Analysis

This article has 1 author:
1. Shiyin Lin
This article has no evaluationsLatest version Sep 10, 2025
Integrating Large Language Models into Automated Software Testing

This article has 4 authors:
1. Yanet Sáez Iznaga
2. Luís Rato
3. Pedro Salgueiro
4. Javier Lamar León
This article has no evaluationsLatest version Sep 18, 2025
Self-Debugging AI: A Comprehensive Analysis of Claude 4.1 Sonnet's Code Generation and Error Resolution Capabilities

This article has 1 author:
1. Harshith Vaddiparthy
This article has no evaluationsLatest version Aug 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

LLM-Driven Adaptive Source–Sink Identification and False Positive Mitigation for Static Analysis

Integrating Large Language Models into Automated Software Testing

Self-Debugging AI: A Comprehensive Analysis of Claude 4.1 Sonnet's Code Generation and Error Resolution Capabilities