Keynote1: Trust No Bot? Forging Confidence in AI for Software Engineering
Abstract:
The truth is out there… and so is the AI revolution. Foundation models and AI-driven tools are transforming software engineering, offering unprecedented efficiencies while introducing new uncertainties. As developers, we find ourselves in uncharted territory: these tools promise to accelerate productivity and reshape our workflows, but can we really trust them? Like any good investigator, we must question the systems we rely on. Are AI-based tools reliable, transparent, and aligned with developer needs? Or are they inscrutable black boxes with hidden risks? Trust isn’t just a nice-to-have—it’s the key factor determining whether AI integration succeeds or spirals into skepticism. In this keynote, I will uncover the evolving role of AI in software engineering and explore how we can build, measure, and foster trust in these tools. I will also reveal why the FORGE community is uniquely positioned to lead this charge, ensuring that AI becomes a trusted partner—not an unsolved mystery. After all, when it comes to AI in software development… should we trust no bot? (This abstract came to life with a little help from ChatGPT and a lot of love for The X-Files.)

Prof. Thomas Zimmermann
Thomas Zimmermann is a Chancellor's Professor and Donald Bren Chair at the University of California, Irvine. He works on cutting-edge research and innovation in data science, machine learning, software engineering, and digital games. He has over 15 years of experience in the field, with more than 100 publications that have been cited over 30,000 times. His research mission is to empower software developers and organizations to build better software and services with AI. He is best known for his pioneering work on systematic mining of software repositories and his empirical studies of software development in industry. He has contributed to several Microsoft products and tools, such as Visual Studio, GitHub, and Xbox. He is an ACM Fellow, an IEEE Fellow, and recipient of the IE. Further details can be found on:
https://thomas-zimmermann.com/
Keynote 2

Prof. Prem Devanbu, the “father” of Naturalness of Code, is a Distinguished Research Professor on the Faculty of the Computer Science Department at the University of California at Davis. He works in the areas of empirical software engineering, and Software Engineering applications of ML. Devanbu was elected ACM Fellow in 2018 and has received multiple awards, including the 2021 ACM SIGSOFT Outstanding Research Award, the 2024 IEEE Harlan Mills Award, and the 2022 Humboldt Research Award.
Further details can be found on: https://www.cs.ucdavis.edu/~devanbu/
Keynote 3

Prof. Graham Neubig, the “father” of OpenDevin, is an Associate Professor at the Carnegie Mellon University, Language Technology Institute in the School of Computer Science. He leads NeuLab and is also chief scientist at All Hands AI. His research focuses on machine learning and natural language processing. In particular, he is interested in basic research and applications of large language models, with a particular focus on question answering, code generation, multilingual processing, and evaluation/interpretability. Further details can be found on: https://www.phontron.com/
Industry Keynote 1

Darya Rovdo, based in The Hague, NL, is a Machine Learning Engineer at JetBrains. With a background in software engineering, she understands the development process from both perspectives - building software and enhancing it with AI. Her main focus is on making product features as effective and useful as possible, favouring simple, practical solutions over unnecessary complexity. Further details can be found on: https://nl.linkedin.com/in/darya-rovdo-85aa9111a
Industry Keynote 2

Dong Qiu is currently a Director of Waterloo Research Centre. His research interests span software analysis and testing, regression testing and monitoring in web services & SOA, database applications and programming languages. Further details can be found on: https://dong-qiu.github.io/
Dates
Tracks
This program is tentative and subject to change.
Sun 27 AprDisplayed time zone: Eastern Time (US & Canada) change
Sun 27 Apr
Displayed time zone: Eastern Time (US & Canada) change
09:00 - 10:30 | |||
09:00 10mDay opening | Introduction from The Chairs Keynotes | ||
09:10 60mKeynote | Keynote Keynotes Prem Devanbu University of California at Davis |
11:00 - 12:30 | |||
11:00 60mKeynote | Keynote Keynotes Graham Neubig Carnegie Mellon University | ||
12:00 30mPanel | Panel Discussion Panel |
12:30 - 14:00 | |||
14:00 - 15:30 | |||
14:00 12mLong-paper | PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback Research Papers Yun Peng The Chinese University of Hong Kong, Akhilesh Deepak Gotmare Salesforce Research, Michael Lyu The Chinese University of Hong Kong, Caiming Xiong Salesforce Research, Silvio Savarese Salesforce Research, Doyen Sahoo Salesforce Research | ||
14:12 12mLong-paper | RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code Completion Research Papers Huy Nhat Phan FPT Software AI Center, Hoang Nhat Phan Nanyang Technological University, Tien N. Nguyen University of Texas at Dallas, Nghi D. Q. Bui Salesforce Research | ||
14:24 12mLong-paper | SoTaNa: An Open-Source Software Engineering Instruction-Tuned Model Research Papers Ensheng Shi Xi’an Jiaotong University, Yanlin Wang Sun Yat-sen University, Fengji Zhang Microsoft Research Asia, Bei Chen Microsoft Research Asia, Hongyu Zhang Chongqing University, yanli wang Sun Yat-sen University, Daya Guo Sun Yat-sen University, Lun Du Microsoft Research, Shi Han Microsoft Research, Dongmei Zhang Microsoft Research, Hongbin Sun Xi’an Jiaotong University | ||
14:36 12mLong-paper | Automated Codebase Reconciliation using Large Language Models Research Papers Aneri Gandhi University of Toronto, Sanjukta De Advanced Micro Devices, Marsha Chechik University of Toronto, Vinay Pandit Advanced Micro Devices, Max Kiehn Advanced Micro Devices, Matthieu Chan Chee Advanced Micro Devices, Yonas Bedasso Advanced Micro Devices | ||
14:48 12mLong-paper | AI-Powered, But Power-Hungry? Energy Efficiency of LLM-Generated Code Research Papers Lola Solovyeva University of Twente, Sophie Weidmann University of Twente, Fernando Castor University of Twente | ||
15:00 6mShort-paper | HyRACC: A Hybrid Retrieval-Augmented Framework for More Efficient Code Completion Research Papers Chuanyi Li Nanjing University, Jiwei Shang Nanjing University, Yi Feng Nanjing University, Bin Luo Nanjing University | ||
15:06 6mShort-paper | OptCodeTrans: Boost LLMs on Low-Resource Programming Language Translation Research Papers Jianbo Lin Nanjing University, Yi Shen Nanjing University, Chuanyi Li Nanjing University, Changan Niu Software Institute, Nanjing University, Bin Luo Nanjing University | ||
15:12 6mShort-paper | SwiftEval: Developing a Language-Specific Benchmark for LLM-generated Code Evaluation Data and Benchmarking | ||
15:18 6mShort-paper | SE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering Research Papers Zhimin Zhao Queen's University |
16:00 - 17:30 | |||
16:00 12mLong-paper | Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements Research Papers | ||
16:12 12mLong-paper | Benchmarking Prompt Engineering Techniques for Secure Code Generation with GPT Models Research Papers Marc Bruni University of Applied Sciences and Arts Northwestern Switzerland, Fabio Gabrielli University of Applied Sciences and Arts Northwestern Switzerland, Mohammad Ghafari TU Clausthal, Martin Kropp University of Applied Sciences and Arts Northwestern Switzerland Pre-print | ||
16:24 12mLong-paper | ELDetector: An Automated Approach Detecting Endless-loop in Mini Programs Research Papers Nan Hu Xi’an Jiaotong University, Ming Fan Xi'an Jiaotong University, Jingyi Lei Xi'an Jiaotong University, Jiaying He Xi'an Jiaotong University, Zhe Hou China Mobile System Integration Co. | ||
16:36 12mLong-paper | Testing Android Third Party Libraries with LLMs to Detect Incompatible APIs Research Papers Tarek Mahmud Texas State University, bin duan University of Queensland, Meiru Che Central Queensland University, Anne Ngu Texas State University, Guowei Yang University of Queensland | ||
16:48 12mLong-paper | Vulnerability-Triggering Test Case Generation from Third-Party Libraries Research Papers Yi Gao Zhejiang University, Xing Hu Zhejiang University, Zirui Chen , Tongtong Xu Nanjing University, Xiaohu Yang Zhejiang University | ||
17:00 6mShort-paper | Microservices Performance Testing with Causality-enhanced Large Language Models Research Papers Cristian Mascia University of Naples Federico II, Roberto Pietrantuono Università di Napoli Federico II, Antonio Guerriero Università di Napoli Federico II, Luca Giamattei Università di Napoli Federico II, Stefano Russo Università di Napoli Federico II | ||
17:06 6mShort-paper | MaRV: A Manually Validated Refactoring Dataset Data and Benchmarking Henrique Gomes Nunes Universidade Federal de Minas Gerais, Tushar Sharma Dalhousie University, Eduardo Figueiredo Federal University of Minas Gerais | ||
17:12 6mShort-paper | PyResBugs: A Dataset of Residual Python Bugs for Natural Language-Driven Fault Injection Data and Benchmarking Domenico Cotroneo University of Naples Federico II, Giuseppe De Rosa University of Naples Federico II, Pietro Liguori University of Naples Federico II | ||
17:18 6mShort-paper | The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Data and Benchmarking Jonathan Katzy Delft University of Technology, Răzvan Mihai Popescu Delft University of Technology, Arie van Deursen TU Delft, Maliheh Izadi Delft University of Technology |
Mon 28 AprDisplayed time zone: Eastern Time (US & Canada) change
Mon 28 Apr
Displayed time zone: Eastern Time (US & Canada) change
09:00 - 10:30 | |||
09:00 60mKeynote | Keynote: Trust No Bot? Forging Confidence in AI for Software Engineering Keynotes Thomas Zimmermann University of California, Irvine | ||
10:00 12mLong-paper | AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology Research Papers Minh Nguyen Huynh FPT Software AI Center, Thang Phan Chau FPT Software AI Center, Phong X. Nguyen FPT Software AI Center, Nghi D. Q. Bui Salesforce Research | ||
10:12 12mLong-paper | Enhancing Pull Request Reviews: Leveraging Large Language Models to Detect Inconsistencies Between Issues and Pull Requests Research Papers Ali Tunahan Işık Bilkent University, Hatice Kübra Çağlar Bilkent University, Eray Tüzün Bilkent University |
14:00 - 15:30 | |||
14:00 45mKeynote | Industry Keynote Keynotes Dong Qiu Huawei Technologies | ||
14:45 45mKeynote | Industry Keynote Keynotes Darya Rovdo JetBrains |
16:00 - 17:30 | |||
16:00 45mTutorial | Beyond Code Generation: Evaluating and Improving LLMs for Code Intelligence Tutorials Fatemeh Hendijani Fard University of British Columbia | ||
16:45 12mLong-paper | A Comprehensive Study of Bug Characteristics on Foundation Language Models Research Papers Junxiao Han , Guanqi Wang Zhejiang University, Jiakun Liu Singapore Management University, Lingfeng Bao Zhejiang University, Xing Hu Zhejiang University, Jinling Wei Hangzhou City University, Shuiguang Deng Zhejiang University; Alibaba-Zhejiang University Joint Institute of Frontier Technologies | ||
16:57 12mLong-paper | Cyber-Attack Detection and Localization for SCADA system of CPSs Research Papers Dan Li Sun Yat-sen University, Junnan Tang Sun Yat-Sen University, Shunyu Wu Sun Yat-Sen University, Zibin Zheng Sun Yat-sen University, See-Kiong Ng National University of Singapore | ||
17:09 12mLong-paper | Testing Refactoring Engine via Historical Bug Report driven LLM Research Papers Pre-print | ||
17:21 9mDay closing | Closing Session Keynotes |
Accepted Papers
Title | |
---|---|
Closing Session Keynotes |