A System for Automated Unit Test Generation Using Large Language Models and Assessment of Generated Test Suites (AIST 2025 - 5th International Workshop on Artificial Intelligence in Software Testing)

Mon 31 March - Fri 4 April 2025 Naples, Italy

Who

Andrea Lops, Fedelucio Narducci, Azzurra Ragone, Michelantonio Trizio, Claudio Bartolini

Track

AIST 2025

Abstract

Unit tests are fundamental for ensuring software correctness but are costly and time-intensive to design and create. Recent advances in Large Language Models (LLMs) have shown potential for automating test generation, though existing evaluations often focus on simple scenarios and lack scalability for real-world applications. To address these limitations, we present AgoneTest, an automated system for generating and assessing complex, class-level test suites for Java projects. Leveraging the Methods2Test dataset, we developed Classes2Test, a new dataset enabling the evaluation of LLM-generated tests against human-written tests. Our key contributions include a scalable automated software system, a new dataset, and a detailed methodology for evaluating test quality.

Andrea Lops

Polytechnic University of Bari, Italy

Italy