Self-Debugging AI: A Comprehensive Analysis of Claude 4.1 Sonnet's Code Generation and Error Resolution Capabilities

Harshith Vaddiparthy

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a novel meta-experimental approach to analyzing the debugging capabilities of large language models (LLMs), specifically Claude 3 Opus. Through a carefully designed experiment where the AI system first generates intentionally buggy code and subsequently debugs it without prior knowledge, we document and analyze the systematic debugging methodology employed by modern AI systems. Our experiment involved a Python-based Task Management System containing 12 distinct bug categories, ranging from syntax errors to complex runtime issues. The AI successfully identified and resolved all bugs using a methodical, error-driven approach that mirrors human debugging strategies. Key findings include the AI’s ability to: (1) prioritize syntax errors before runtime issues, (2) leverage Python’s error messages effectively, (3) implement comprehensive fixes with proper error handling, and (4) validate solutions through automated testing. This research contributes to understanding AI’s role in automated software debugging and has implications for the future of AI-assisted software development, code review processes, and programming education.

Version published to 10.21203/rs.3.rs-7467553/v1 on Research Square
Aug 29, 2025

The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs

This article has 2 authors:
1. Muntasir Adnan
2. Carlos C. N. Kuhn
This article has no evaluationsLatest version Sep 12, 2025
Real-Time AI Code Security Auditing: Automated Vulnerability Detection and Remediation Through Meta-Experimental Analysis

This article has 1 author:
1. Harshith Vaddiparthy
This article has no evaluationsLatest version Sep 4, 2025
Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback

This article has 1 author:
1. Shiyin Lin
This article has no evaluationsLatest version Sep 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs

Real-Time AI Code Security Auditing: Automated Vulnerability Detection and Remediation Through Meta-Experimental Analysis

Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback