Enhancing Python Code Maintainability through Large Language Model-Based Approaches
This program is tentative and subject to change.
While Large Language Models (LLMs) increasingly assist in code generation, concerns persist regarding the main- tainability of the code they produce—an aspect often overshad- owed by functional correctness. Overlooking maintainability can contribute to technical debt and inflate long-term software costs. This research investigates whether targeted fine-tuning can enhance an LLM’s ability to generate more maintainable Python code. We developed a approach involving the curation of cus- tom datasets (from CommitPackFT and Code Alpaca Python subsets) specifically annotated for maintainability using met- rics like Source Lines of Code (SLOC), Halstead Effort, and Maintainability Index (MI). A weak-to-strong generalization strategy was employed, using a smaller model (Phi 4 14B) to generate maintainability-focused examples for fine-tuning a larger model (QwenCoder2.5 32B Instruct) with parameter- efficient techniques. Evaluations revealed the fine-tuned model significantly reduced code complexity (Halstead Effort) and length (SLOC) compared to the original code samples. While the model preserved high functional similarity (verified by CodeBERTScore), results for the Maintainability Index metric were inconclusive in this evaluation. Performance on standard functional correctness benchmarks (HumanEval+, MBPP+) was largely comparable to the base model. Nevertheless, expert user feedback confirmed the fine- tuned model’s utility as a practical AI companion for code refactoring to improve maintainability.
This program is tentative and subject to change.
Tue 2 DecDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
09:30 - 11:00 | |||
09:30 15mTalk | Enhancing Python Code Maintainability through Large Language Model-Based Approaches Research Papers | ||
09:45 15mTalk | Enhancing Software Maintainability through LLM-Assisted Code Refactoring Research Papers Tommaso Fulcini Politecnico di Torino, Riccardo Coppola Politecnico di Torino, Flavio Giobergia Politecnico di Torino, Amirali Changizi Politecnico di Torino, Meelad Dashti Politecnico di Torino, Kimia Dorrani Politecnico di Torino, Domenico Amalfitano University of Naples Federico II, Damiano Distante UnitelmaSapienza University of Rome, Filippo Ricca DIBRIS, Università di Genova | ||
10:00 15mTalk | Temporal Evolution of Architectural Complexity and Technical Debt in Microservices: An Exploratory Case Study Research Papers Bhuwan Paudel Blekinge Institute of Technology, Javier Gonzalez-Huerta Blekinge Institute of Technology, Ehsan Zabardast Nordea / Blekinge Institute of Technology | ||
10:15 15mTalk | Detecting Technical Debt in Source Code Changes using Large Language Models Research Papers Merve Astekin SINTEF, Arda Goknil SINTEF Digital, Sagar Sen , Simeon Tverdal SINTEF Digital, Phu Nguyen SINTEF | ||
10:30 7mTalk | LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code Short Papers and Posters Shahbaz Siddeeq Tampere University, Muhammad Waseem Faculty of Information Technology and Communication Sciences, Tampere University, 33014 Tampere, Finland, Zeeshan Rasheed Tampere University, Md Mahade Hasan Tampere University, Jussi Rasku Tampere University, Mika Saari Tampere University, Henri Terho Eficode Oy, Kalle Mäkelä Eficode Oy, Kai-Kristian Kemell Tampere University, Pekka Abrahamsson Tampere University | ||
10:37 7mTalk | Architecture Degradation at Scale: Challenges and Insights from Practice Short Papers and Posters Ehsan Zabardast Nordea / Blekinge Institute of Technology, Bhuwan Paudel Blekinge Institute of Technology, Javier Gonzalez-Huerta Blekinge Institute of Technology | ||
10:44 7mTalk | How Well Small Language Models Can Be Adapted for Software Maintenance and Refactoring Tasks Short Papers and Posters Gabija Asvydyte University of Groningen, Sushant Kumar Pandey University of Groningen, The Netherlands, Sivajeet Chand Technical University of Munich | ||