LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops (ICSME 2025 - Tool Demonstration Track) - ICSME 2025 - International Conference on Software Maintenance and Evolution

Sun 7 - Fri 12 September 2025 Auckland, New Zealand

Who

Ravin Ravi, Dylan Bradshaw, Stefano Ruberto, Gunel Jahangirova, Valerio Terragni

Track

ICSME 2025 Tool Demonstration Track

Abstract

Large Language Models (LLMs) are showing remarkable performance in generating source code, yet the generated code often has issues like compilation errors or incorrect code. Researchers and developers often face wasted effort in implementing checks and refining LLM-generated code, frequently duplicating their efforts. This paper presents LLMLOOP, a framework that automates the refinement of both source code and test cases produced by LLMs. LLMLOOP employs five iterative loops: resolving compilation errors, addressing static analysis issues, fixing test case failures, and improving test quality through mutation analysis. These loops ensure the generation of high-quality test cases that serve as both a validation mechanism and a regression test suite for the generated code. We evaluated LLMLOOP on HUMANEVAL-X, a recent benchmark of programming tasks. Results demonstrate the tool effectiveness in refining LLM-generated outputs. A demonstration video of the tool is available at https://youtu.be/2CLG9x1fsNI

Ravin Ravi

University of Auckland

New Zealand

Dylan Bradshaw