ICSE 2026
Sun 12 - Sat 18 April 2026 Rio de Janeiro, Brazil
Wed 15 Apr 2026 11:30 - 11:45 at Oceania IX - Testing and Analysis 1 Chair(s): Michael Pradel

Fixing software bugs is crucial yet demands significant resources from developers. Automated Program Repair (APR) is a promising solution to address this challenging task. The emergence of Large Language Models (LLMs) has opened a new era of LLM-based APR, substantially advancing the APR field further. LLM-based APR methods face significant challenges regarding memory inefficiency, hindering their scalability and effectiveness. This is largely due to the beam search utilized in the patch generation phase of LLM-based APR, which requires large beam sizes to search for more potentially good repair candidates.

In this paper, we first show that increases in beam size, even for small-sized LLMs (1B-7B params), require extensive GPU usage, leading to up to 80% of recurring crashes due to memory overloads in LLM-based APR. Seemingly simple solutions to reduce memory consumption are (1) to quantize LLM models, i.e., converting the weights of an LLM from high-precision values to lower-precision ones, and (2) to make beam search sequential, i.e., forwarding each beam through the model sequentially and then concatenating them back into a single output. However, we show that these approaches still do not work via both theoretical analysis and experiments.

To address this, we introduce FLAMES, a novel LLM-based APR technique that employs semantic-guided patch generation to enhance repair effectiveness and memory efficiency. Unlike conventional methods that rely on beam search, FLAMES utilizes greedy decoding to enhance memory efficiency while steering the search towards more potentially good repair candidates via a semanticguided best-first search algorithm. At each decoding step, FLAMES uses semantic feedback from test validation, such as the number of passing and failing test cases, to select the most promising token to explore further. Our empirical evaluation on Defects4J shows that FLAMES substantially reduces memory consumption by up to 83% compared to LLM-based APR without compromising time efficiency. Moreover, FLAMES correctly fixes 133 bugs on Defects4J, fixing 10 bugs more than the best baseline. Additionally, these improvements also generalize to the HumanEval-Java and TransformedD4J datasets, where FLAMES generates 12% and 36.5% more correct patches, respectively, than the best baseline.

Wed 15 Apr

Displayed time zone: Brasilia, Distrito Federal, Brazil change

11:00 - 12:30
Testing and Analysis 1SE In Practice (SEIP) / Research Track at Oceania IX
Chair(s): Michael Pradel CISPA Helmholtz Center for Information Security
11:00
15m
Talk
BFix: Automated Safe Memory-Leak Fixing for Binary CodeVirtual Attendance
Research Track
Wen Zhang University of Georgia, Botang Xiao University of Georgia, Qingchen Kong University of Georgia, Boyang Yi University of Georgia, Suxin Ji University of Georgia, USA, Yage Hu University of Georgia, Songlan Wang University of Georgia, Wenwen Wang University of Georgia
11:15
15m
Talk
Learning without Forgetting: Towards Continual learning of Fault Localization Models in Industrial Software SystemsVirtual Attendance
Research Track
Chun Li Nanjing University, Hui Li Samsung Electronics (China) R&D Centre, Zhong Li Nanjing University, Minxue Pan Nanjing University, Xuandong Li Nanjing University
Media Attached File Attached
11:30
15m
Talk
Memory-Efficient Large Language Models for Program Repair with Semantic-Guided Patch GenerationVirtual Attendance
Research Track
Thanh Le-Cong Singapore University of Technology and Design, Singapore, Xuan-Bach D. Le University of Melbourne, Toby Murray University of Melbourne
Media Attached
11:45
15m
Talk
Addressing Test Flakiness: Practical Approaches in a Database-Reliant Industrial System
SE In Practice (SEIP)
George Vegelien Delft University of Technology, Carolin Brandt Delft University of Technology, Bas Graaf Exact, Arie van Deursen TU Delft
Pre-print
12:00
15m
Talk
XTrace: A Non-Invasive Dynamic Tracing Framework for Android Applications in Production
SE In Practice (SEIP)
Qi Hu ByteDance, Jiangchao Liu ByteDance, Lin Zhang ByteDance, Edward Jiang ByteDance, Xin Yu ByteDance
12:15
15m
Talk
Delta Debugging for LLM-integrated Systems
SE In Practice (SEIP)
Hao-Nan Zhu University of California, Davis, Muhammad Numair Mansur Amazon Web Services, Martin Schäf Amazon Web Services, Zeya Chen Amazon Web Services, Tancrède Lepoint Amazon, Willem Visser Amazon Web Services