Unit Testing Past vs. Present - Examining LLMs' Impact on Defect Detection and Efficiency (ICST 2025 - Posters)

Mon 31 March - Fri 4 April 2025 Naples, Italy

Who

Rudolf Ramler, Philipp Straubinger, Reinhold Plösch, Dietmar Winkler

Track

ICST 2025 Posters

Abstract

The integration of Large Language Models (LLMs), such as ChatGPT and GitHub Copilot, into software engineering workflows has shown potential to enhance productivity, particularly in software testing. This paper investigates whether LLM support improves defect detection effectiveness during unit testing. Building on prior studies comparing manual and tool-supported testing, we replicated and extended an experiment where participants wrote unit tests for a Java-based system with seeded defects within a time-boxed session, supported by LLMs. Comparing LLM supported and manual testing, results show that LLM support significantly increases the number of unit tests generated, defect detection rates, and overall testing efficiency. These findings highlight the potential of LLMs to improve testing and defect detection outcomes, providing empirical insights into their practical application in software testing.

Rudolf Ramler

Software Competence Center Hagenberg (SCCH)

Unit Testing Past vs. Present - Examining LLMs' Impact on Defect Detection and Efficiency

Rudolf Ramler

Software Competence Center Hagenberg (SCCH)

Austria

Philipp Straubinger

University of Passau

Germany

Reinhold Plösch

Johannes Kepler University

Austria

Dietmar Winkler

Vienna University of Technology, Austria

Austria

Tracks

Workshops