LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation (ISSTA 2025 - Research Papers)

Who

Ziyao Zhang, Chong Wang, Yanlin Wang, Ensheng Shi, Yuchi Ma, Wanjun Zhong, Jiachi Chen, Mingzhi Mao, Zibin Zheng

Track

ISSTA 2025 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 25 Jun 2025 11:00 - 11:25 at Aurora A - Program Repair Chair(s): Yannic Noller

Abstract

Code generation aims to automatically generate code from input requirements, significantly enhancing development efficiency. Recent large language models (LLMs) based approaches have shown promising results and revolutionized code generation task. Despite the promising performance, LLMs often generate contents with hallucinations, especially for the code generation scenario requiring the handling of complex contextual dependencies in practical development process. Although previous study has analyzed hallucinations in LLM-powered code generation, the study is limited to standalone function generation. In this paper, we conduct an empirical study to study the phenomena, mechanism, and mitigation of LLM hallucinations within more practical and complex development contexts in repository-level generation scenario. First, we manually examine the code generation results from six mainstream LLMs to establish a hallucination taxonomy of LLM- generated code. Next, we elaborate on the phenomenon of hallucinations, analyze their distribution across different models. We then analyze causes of hallucinations and identify four potential factors contributing to hallucinations. Finally, we propose an RAG-based mitigation method, which demonstrates consistent effectiveness in all studied LLMs. The replication package including code, data, and experimental results is anonymously available at https://anonymous.4open.science/r/LLMCodingHallucination/.

DOI

https://doi.org/10.1145/3728894

Ziyao Zhang

Sun Yat-sen University

Chong Wang

Nanyang Technological University

Yanlin Wang

Sun Yat-sen University

China

Ensheng Shi

Xi’an Jiaotong University

China

Yuchi Ma

Huawei Cloud Computing Technologies

China

Wanjun Zhong

Sun Yat-sen University

China

Jiachi Chen

Sun Yat-sen University

China

Mingzhi Mao

Sun Yat-sen University

Zibin Zheng

Sun Yat-sen University

China

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 25 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

11:00 - 12:30	Program RepairTool Demonstrations / Research Papers at Aurora A Chair(s): Yannic Noller Ruhr University Bochum

11:00 25m Talk		LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation Research Papers Ziyao Zhang Sun Yat-sen University, Chong Wang Nanyang Technological University, Yanlin Wang Sun Yat-sen University, Ensheng Shi Xi’an Jiaotong University, Yuchi Ma Huawei Cloud Computing Technologies, Wanjun Zhong Sun Yat-sen University, Jiachi Chen Sun Yat-sen University, Mingzhi Mao Sun Yat-sen University, Zibin Zheng Sun Yat-sen University DOI
11:25 25m Talk		AdverIntent-Agent: Adversarial Reasoning for Repair Based on Inferred Program Intent Research Papers He Ye University College London (UCL), Aidan Z.H. Yang Carnegie Mellon University, Chang Hu Macau University of Science and Technology, Yanlin Wang Sun Yat-sen University, Tao Zhang Macau University of Science and Technology, Claire Le Goues Carnegie Mellon University DOI
11:50 25m Talk		PatchScope: LLM-Enhanced Fine-Grained Stable Patch Classification for Linux Kernel Research Papers Rongkai Liu Central South University, Heyuan Shi Central South University, Shuning Liu Central South University, China, Chao Hu Central South University, Sisheng Li Central South University, China, Yuheng Shen Tsinghua University, Runzhe Wang Alibaba Group, Xiaohai Shi Alibaba Group, Yu Jiang Tsinghua University DOI
12:15 15m Demonstration		InfraFix: Technology-Agnostic Repair of Infrastructure as Code Tool Demonstrations Nuno Saavedra INESC-ID and IST, University of Lisbon, João F. Ferreira INESC-ID and IST, University of Lisbon, Alexandra Mendes Faculty of Engineering, University of Porto & INESC TEC

Information for Participants

Wed 25 Jun 2025 11:00 - 12:30 at Aurora A - Program Repair Chair(s): Yannic Noller

Info for room Aurora A:

Aurora A is the first room in the Aurora wing.

When facing the main Cosmos Hall, access to the Aurora wing is on the right, close to the side entrance of the hotel.