SANER 2026
Tue 17 - Fri 20 March 2026 Limassol, Cyprus
Wed 18 Mar 2026 15:00 - 15:15 at Megaron Gamma - Session 2C - Testing and Analysis Chair(s): Simin Sun

Performance bugs are non-functional defects that significantly impact software performance. Identifying such bugs can be challenging, as they are typically harder to detect than other types of software defects and often require specialized expertise that may not be readily available within software organizations.

Previous approaches have attempted to automate performance bug detection using static analysis or traditional machine learning models trained on static code metrics. However, despite the growing potential and widespread adoption of Large Language Models (LLMs) for automating various software engineering tasks, no studies have directly investigated their capabilities in detecting performance bugs.

In this paper, we aim to fill this gap by exploring the potential of LLMs–such as CodeLlama, Qwen, and Artigenz–for detecting performance bugs directly from source code. We focus on zero-shot and few-shot prompting, as well as supervised fine-tuning, to evaluate these models using Java code from open-source projects with labeled performance bugs. Our results highlight the limitations of current LLMs in this domain: they achieved low F1 scores, with few-shot prompting providing only marginal improvements over zero-shot configurations, and fine-tuning yielding slight gains.

Wed 18 Mar

Displayed time zone: Athens change

14:00 - 15:30
Session 2C - Testing and AnalysisResearch Track / Tool Demo Track / Early Research Achievement (ERA) Track / Industrial Track / Reproducibility Studies and Negative Results (RENE) Track at Megaron Gamma
Chair(s): Simin Sun Chalmers University of Technology and University of Gothenburg
14:00
15m
Talk
PiCo: Privacy-preserving Code Sanitization for Cloud-based LLMs
Research Track
Xinyuan Zhang Sun Yat-sen University, Yuhong Nan Sun Yat-sen University, Jiequan Zheng Sun Yat-sen University, Jiangrong Wu Sun Yat-sen University, Yixi Lin Sun Yat-sen University, Zibin Zheng Sun Yat-sen University
14:15
15m
Talk
Coverage-Guided Road Selection and Prioritization for Efficient Testing in Autonomous Driving Systems
Research Track
Qurban Ali University of Milano-Bicocca, Andrea Stocco Technical University of Munich, fortiss, Leonardo Mariani University of Milano-Bicocca, Oliviero Riganelli University of Milano - Bicocca
Pre-print
14:30
15m
Talk
CloudFix: Automated Policy Repair for Cloud Access Control Policies Using Large Language Models
Research Track
Bethel Hall Stevens Institute of Technology, USA, Owen Ungaro Stevens Institute of Technolgoy, William Eiers
Pre-print
14:45
15m
Talk
Search-based Testing for an Autonomous Delivery Robots Scheduler
Industrial Track
Thomas Laurent Lero@Trinity College Dublin, Paolo Arcaini National Institute of Informatics , Fuyuki Ishikawa National Institute of Informatics
15:00
15m
Talk
An Empirical Investigation on the use of Large Language Models for Performance Bug Detection
Reproducibility Studies and Negative Results (RENE) Track
Muhammad Imran Università degli Studi dell'Aquila, Vittorio Cortellessa University of L'Aquila, Davide Di Ruscio University of L'Aquila, Riccardo Rubei Malardalen University, Luca Traini University of L'Aquila
15:15
7m
Talk
WasmWeaver: A Framework for Runtime-Aware WebAssembly Program Generation with Reinforcement Learning
Tool Demo Track
Kilian Müller Friedrich-Alexander University Erlangen-Nürnberg (FAU), Siddharth Mane , Peter Wägemann Friedrich-Alexander University Erlangen-Nürnberg (FAU), Norman Franchi
15:22
7m
Talk
An Agentic AI Framework for Conflict-Aware Smart Home Automation via Natural Language
Early Research Achievement (ERA) Track
Sayyada Aisha Mehvish Toronto Metropolitan University, Manar Alalfi Toronto Metropolitan University