LLM4JMH: Studying the Use of LLMs for Generating Java Performance Microbenchmarks
This program is tentative and subject to change.
Performance regressions pose a significant threat to software reliability. In recent years, smaller-scale performance testing, such as performance microbenchmarks, has been adopted in practice to detect such regressions in an early stage of development. Developing these microbenchmarks is costly, error-prone, and demands specialized expertise. Motivated by the recent progress of large language models (LLMs) in code-related tasks, we study whether LLMs can step in as performance experts to automate the generation of microbenchmarks for performance regression detection. While existing approaches typically depend on functional unit tests to generate performance microbenchmarks, we instead design an LLM-based approach, \emph{LLM4JMH}, to assess the capability of LLMs in generating reliable and effective performance microbenchmarks directly from source code. We further explore how program analysis techniques, such as static analysis, can enhance the reliability and effectiveness of LLM-generated performance microbenchmarks. Experimental results show that LLM-generated performance microbenchmarks achieve comparable bug detection effectiveness to expert-written microbenchmarks, while reducing overall expected detection latency by up to 52.60% across \emph{RxJava}, \emph{Eclipse Collections}, and \emph{Zipkin}. In the study, the generated tests can successfully identify six out of 11 real-world performance bugs in \emph{Apache Flink}. These results demonstrate that LLM-based performance microbenchmark generation can automate early performance regression detection in continuous integration pipelines of the software development process and reduce reliance on expert-crafted tests, advancing performance-aware software engineering.
This program is tentative and subject to change.
Wed 15 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
14:00 - 15:30 | Dependability and Security 2Research Track / Journal-first Papers / New Ideas and Emerging Results (NIER) at Oceania X | ||
14:00 15mTalk | TraceCaps: Inline Provenance and Risk Enforcement for Agentic Software Engineering New Ideas and Emerging Results (NIER) Andre Catarino Faculty of Engineering, University of Porto, Claudia Mamede Carnegie Mellon University, Rui Melo Carnegie Mellon University & FEUP, Rui Maranhao Abreu University of Lisbon | ||
14:15 15mTalk | Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks Journal-first Papers | ||
14:30 15mTalk | PenForge: On-the-Fly Expert Agent Construction for Automated Penetration Testing New Ideas and Emerging Results (NIER) Huihui Huang Singapore Management University, Singapore, Jieke Shi Singapore Management University, Junkai Chen Singapore Management University, Singapore, Ting Zhang Monash University, Yikun Li Singapore Management University, Chengran Yang Singapore Management University, Singapore, Eng Lieh Ouh Singapore Management University, Singapore, Lwin Khin Shar Singapore Management University, David Lo Singapore Management University | ||
14:45 15mTalk | Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs Journal-first Papers Samuele Pasini Università della Svizzera italiana, Jinhan Kim Università della Svizzera italiana, Tommaso Aiello SAP Security Research, Rocio Cabrera Lozoya SAP Security Research, Antonino Sabetta SAP, Paolo Tonella USI Lugano | ||
15:00 15mTalk | LLM4JMH: Studying the Use of LLMs for Generating Java Performance Microbenchmarks Research Track Zongxiong Chen Fraunhofer FOKUS, Derui Zhu Technical University of Munich, Kundi Yao Ontario Tech University, Weiyi Shang University of Waterloo, Jinfu Chen Wuhan University, Jiahui Geng Mohamed bin Zayed University of Artificial Intelligence, Alexander Pretschner TU Munich, Jens Grossklags Technical University of Munich, Manfred Hauswirth Fraunhofer FOKUS, Sonja Schimmler Fraunhofer FOKUS & TU Berlin | ||
15:15 15mTalk | RulePilot: An LLM-Powered Agent for Security Rule Generation Research Track Hongtai Wang National University of Singapore, Ming Xu Shanghai Jiao Tong University / National University of Singapore, Yanpei Guo National University of Singapore, Weili Han Fudan University, Hoon Wei Lim Cyber Special Ops-R&D, NCS Group, Jin Song Dong National University of Singapore | ||