A Taxonomy of Inefficiencies in LLM-Generated Code (ICSME 2025 - Research Papers Track) - ICSME 2025 - International Conference on Software Maintenance and Evolution

Who

Altaf Allah Abbassi, Leuson Da Silva, Amin Nikanjam, Foutse Khomh

Track

ICSME 2025 Research Papers Track

Time Zone

The program is currently displayed in (GMT+12:00) Auckland, Wellington.

Use conference time zone: (GMT+12:00) Auckland, WellingtonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 11 Sep 2025 11:00 - 11:15 at Case Room 2 260-057 - Session 8 - Code Quality 1 Chair(s): Ronnie de Souza Santos

Abstract

Large Language Models (LLMs) are widely adopted for automated code generation with promising results. Although prior research has assessed LLM-generated code and identified various quality issues- such as redundancy, poor maintainability, and sub-optimal performance- a systematic understanding and categorization of these inefficiencies remain unexplored. Therefore, we empirically investigate inefficiencies in LLM-generated code by state-of-the-art models, i.e., CodeLlama, DeepSeek-Coder, and CodeGemma. To do so, we manually analyze 492 generated code snippets in the HumanEval+ dataset. We then construct a taxonomy of inefficiencies in LLM-generated code that includes 5 categories (General Logic, Performance, Readability, Maintainability, and Errors) and 19 subcategories of inefficiencies. We validate the obtained taxonomy through an online survey with 58 LLM practitioners and researchers. The surveyed participants affirmed the completeness of the proposed taxonomy, and the relevance and the popularity of the identified code inefficiency patterns. Our qualitative findings indicate that inefficiencies are diverse and interconnected, affecting multiple aspects of code quality, with logic and performance-related inefficiencies being the most frequent and often co-occur while impacting overall code quality. Our taxonomy provides a structured basis for evaluating the quality of LLM-generated code and guiding future research to improve code generation efficiency.

Altaf Allah Abbassi

Polytechnique Montreal

Canada

Leuson Da Silva

Polytechnique Montreal

Canada

Amin Nikanjam

Huawei Canada

Canada

Foutse Khomh

Polytechnique Montréal

Canada

Time Zone

The program is currently displayed in (GMT+12:00) Auckland, Wellington.

Use conference time zone: (GMT+12:00) Auckland, WellingtonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 11 Sep
Displayed time zone: Auckland, Wellington change

10:30 - 12:00	Session 8 - Code Quality 1Research Papers Track / Industry Track at Case Room 2 260-057 Chair(s): Ronnie de Souza Santos University of Calgary

10:30 15m		Adoption and Evolution of Code Style and Best Programming Practices in Open-Source Projects Research Papers Track Alvari Kupari University of Auckland, Nasser Giacaman The University of Auckland, Valerio Terragni University of Auckland Pre-print
10:45 15m		Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations Research Papers Track Samah Kansab Software Engineering Departement, Ecole de Technologie Supérieure (ETS) - Québec University, Francis Bordeleau École de Technologie Supérieure (ETS), Ali Tizghadam TELUS Pre-print
11:00 15m		A Taxonomy of Inefficiencies in LLM-Generated Code Research Papers Track Altaf Allah Abbassi Polytechnique Montreal, Leuson Da Silva Polytechnique Montreal, Amin Nikanjam Huawei Canada, Foutse Khomh Polytechnique Montréal
11:15 15m		Automated Code Review At Ericsson Using Large Language Models: An Experience Report Industry Track Shweta Ramesh Ericsson, Joy Bose Ericsson, Hamender Singh Ericsson R&D, Raghavan Ak Ericsson, Sujoy Roychowdhury Ericsson, Giriprasad Sridhara Ericsson, Nishrith Saini Ericsson, Ricardo Britto Ericsson / Blekinge Institute of Technology Pre-print
11:30 15m		AskGraph: A Dependency-Aware Code Assistant Powered by Code Graphs and LLM-Generated Cypher Queries Industry Track Nan Yang TNO-ESI, Joseph Reynolds TNO-ESI, Laurens Prast TNO-ESI, Rosilde Corvino TNO-ESI
11:45 15m		AI Mentor System: Building A Technical Debt Dashboard For Low Code Industry Track Alexandre Lemos OutSystems, Joana Coutinho OutSystems