LLM-based Agents for Automated Bug Fixing: How Far Are We? (ICSE 2026 - Research Track)

Sun 12 - Sat 18 April 2026 Rio de Janeiro, Brazil

Who

Xiangxin Meng, Zexiong Ma, Pengfei Gao, Chao Peng

Track

ICSE 2026 Research Track

Abstract

Large language models (LLMs) and LLM-based Agents have been applied to fix bugs automatically, demonstrating the capability in addressing software defects by engaging in development environment interaction, iterative validation and code modification. However, systematic analysis of these agent systems remain limited, particularly regarding performance variations among top-performing ones. In this paper, we examine six repair systems on the SWE-bench Verified benchmark for automated bug fixing. We first assess each system’s overall performance, noting the instances solvable by all or none of these systems, and explore the capabilities of different systems. We also compare fault localization accuracy at file and code symbol levels and evaluate bug reproduction capabilities. Through analysis, we concluded that further optimization is needed in both the LLM capability itself and the design of Agentic flow to improve the effectiveness of the Agent in bug fixing.

Xiangxin Meng

LLM-based Agents for Automated Bug Fixing: How Far Are We?

Xiangxin Meng

ByteDance

Zexiong Ma

Peking University

Pengfei Gao

ByteDance

Chao Peng

ByteDance

China

Tracks

Co-hosted Conferences

Workshops