Integrating Large Language Models into Automated Software Testing

Yanet Sáez Iznaga
Luís Rato
Pedro Salgueiro
Javier Lamar León

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This work investigates the use of LLMs to enhance automation in software testing, with a particular focus on generating high-quality, context-aware test scripts from natural language descriptions, while addressing both text-to-code and text+code-to-code generation tasks. The Codestral Mamba model was fine-tuned by proposing a way to integrate LoRA matrices into its architecture, enabling efficient domain-specific adaptation and positioning Mamba as a viable alternative to Transformer-based models. The model was trained and evaluated on two benchmark datasets: CONCODE/CodeXGLUE and the proprietary TestCase2Code dataset. Through structured prompt engineering, the system was optimized to generate syntactically valid and semantically meaningful test cases code. Experimental results demonstrate that the proposed methodology successfully enables the automatic generation of code-based test cases using large language models. In addition, this work reports secondary benefits, including improvements in test coverage, automation efficiency, and defect detection when compared to traditional manual approaches. The integration of LLMs into the software testing pipeline also showed potential for reducing time and cost, while enhancing developer productivity and software quality. The findings suggest that LLM-driven approaches can be effectively aligned with continuous integration and deployment workflows. This work contributes to the growing body of research on AI-assisted software engineering and offers practical insights into the capabilities and limitations of current LLM technologies for testing automation.

Version published to 10.20944/preprints202509.1433.v1
Sep 18, 2025

Automating Code Generation for a New Ecosystem: Establishing Baselines with Large Language Model Based Code Generation for ArkTS and HarmonyOS

This article has 3 authors:
1. Mehmet Cem Aytekin
2. Fatma Gizem Calli
3. Mustafa Umut Demirezen
This article has no evaluationsLatest version Sep 4, 2025
Survey and Benchmarking of Large Language Models for RTL Code Generation: Techniques and Open Challenges

This article has 4 authors:
1. Arun Ravindran
2. Aditya Patra
3. Vahid Babaey
4. Suresh Purini
This article has no evaluationsLatest version Sep 19, 2025
Securing the Software Development Lifecycle with Large Language Models: A Framework for Automated Threat Modeling and Secure Code Generation

This article has 6 authors:
1. Shuvo Chakraborty
2. Mehedi Hassan
3. Habibullah Mohammad Masum
4. Md Rakibul Islam Fahim
5. Sayed Mahmood Twki
6. Md. Badiuzzaman Biplob
This article has no evaluationsLatest version Aug 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Automating Code Generation for a New Ecosystem: Establishing Baselines with Large Language Model Based Code Generation for ArkTS and HarmonyOS

Survey and Benchmarking of Large Language Models for RTL Code Generation: Techniques and Open Challenges

Securing the Software Development Lifecycle with Large Language Models: A Framework for Automated Threat Modeling and Secure Code Generation