Sun 12 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
08:00 - 17:30 | Sunday RegistrationICSE Social, Networking and Special Rooms at Main Entrance Registration for ICSE 2026. | ||
08:00 9h30mRegistration | ICSE 2026 Registration ICSE Social, Networking and Special Rooms | ||
08:00 - 17:30 | Sunday RegistrationICSE Social, Networking and Special Rooms at Main Entrance Registration for ICSE 2026. | ||
08:00 9h30mRegistration | ICSE 2026 Registration ICSE Social, Networking and Special Rooms | ||
09:00 - 10:30 | Opening + Keynote (Prof. Zhi Jin)FORGE Program / Keynotes at Oceania I Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Xing Hu Zhejiang University, Christoph Treude Singapore Management University | ||
09:00 15mDay opening | Opening Message from the Chairs Keynotes | ||
09:15 75mKeynote | Keynote: How to Enable Large Language Models to Accelerate the Process of Requirements Understanding and Analysis Keynotes Zhi Jin Peking University, Wuhan University | ||
09:00 - 10:30 | Opening + Keynote (Prof. Zhi Jin)FORGE Program / Keynotes at Oceania I Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Xing Hu Zhejiang University, Christoph Treude Singapore Management University | ||
09:00 15mDay opening | Opening Message from the Chairs Keynotes | ||
09:15 75mKeynote | Keynote: How to Enable Large Language Models to Accelerate the Process of Requirements Understanding and Analysis Keynotes Zhi Jin Peking University, Wuhan University | ||
10:30 - 11:00 | Sunday Morning BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) This break will provide an opportunity for networking and relaxation between sessions. | ||
10:30 30mCoffee break | Break ICSE Catering | ||
10:30 - 11:00 | Sunday Morning BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) This break will provide an opportunity for networking and relaxation between sessions. | ||
10:30 30mCoffee break | Break ICSE Catering | ||
11:00 - 12:30 | Session I - Testing, Quality, and RefactoringResearch Papers / Data and Benchmarking at Oceania I Chair(s): Rosalia Tufano Università della Svizzera Italiana | ||
11:00 12mTalk | Backporting in Robot Operating System: Identifying Commit Purpose and Propagation Need with Large Language Models Research Papers Pankaj Manoharlal Thakur North Carolina State University, Kyle Thomson North Carolina State University, Wesley K.G. Assunção North Carolina State University, Bowen Xu North Carolina State University | ||
11:12 6mTalk | PatchGPT: Multi-Agent Patch Backporting without Model Fine-Tuning Research Papers Ye Liu Singapore Management University, Ruidong Han Singapore Management University, Chengyan Ma Singapore Management University, Yuqing Niu , David Lo Singapore Management University, Ye Liu Nanyang Technological University | ||
11:18 12mTalk | From Human to Machine Refactoring: Assessing GPT-4’s Impact on Python Class Quality and Readability Research Papers Alessandro Midolo University of Catania, Emiliano Tramontana University of Catania, Massimiliano Di Penta University of Sannio, Italy Pre-print | ||
11:30 12mTalk | XMENTOR: A Rank-Aware Aggregation Approach for Human-Centered Explainable AI in Just-in-Time Software Defect Prediction Research Papers Saumendu Roy University of Saskatchewan, Banani Roy University of Saskatchewan, Chanchal K. Roy University of Saskatchewan, Richard Bassey University of Saskatchewan | ||
11:42 12mTalk | Exploring the Potential of Large Language Models in Simulink-Stateflow Mutant Generation Research Papers Pablo Valle Mondragon University, Shaukat Ali Simula Research Laboratory and Oslo Metropolitan University, Aitor Arrieta Mondragon University Pre-print | ||
11:54 6mTalk | Tricky²: Towards a Benchmark for Evaluating Human and LLM Error Interactions Data and Benchmarking Cole Granger William & Mary, Dipin Khati William & Mary, Daniel Rodriguez-Cardenas William & Mary, Denys Poshyvanyk William & Mary | ||
12:00 12mTalk | Reasoning about Multi-hop Fault Propagation with LLM Agents for Root Cause Analysis in Cloud-based Systems Research Papers Evelien Riddell University of Waterloo, James Riddell University of Waterloo, Gengyi Sun University of Waterloo, Michal Antkiewicz University of Waterloo, Canada, Krzysztof Czarnecki University of Waterloo, Canada, Evelien Riddell University of Waterloo Pre-print | ||
12:12 12mTalk | Execution-free Agentic Program Repair for Enterprise-Scale Development Research Papers Saurabh Bodhe Advanced Micro Devices (AMD), Sanjukta De Advanced Micro Devices, Subhayan Roy Advanced Micro Devices (AMD), Jaydip Pokiya Advanced Micro Devices (AMD), Indira Vats University of Toronto; Advanced Micro Devices (AMD), Sehajpreet Kaur Advanced Micro Devices (AMD), Monica Li Advanced Micro Devices (AMD), Lejin Varghese Advanced Micro Devices (AMD), Max Kiehn Advanced Micro Devices (AMD), Yonas Bedasso Advanced Micro Devices | ||
12:30 - 14:00 | Sunday LunchICSE Catering at Catering and Exhibition Hall (Europa I to IV) Lunch time with a variety of meal options available for attendees, including vegetarian choices. This session will provide an opportunity for attendees to enjoy a meal while networking with colleagues and discussing the day’s events. | ||
12:30 90mLunch | Lunch ICSE Catering | ||
12:30 - 14:00 | Sunday LunchICSE Catering at Catering and Exhibition Hall (Europa I to IV) Lunch time with a variety of meal options available for attendees, including vegetarian choices. This session will provide an opportunity for attendees to enjoy a meal while networking with colleagues and discussing the day’s events. | ||
12:30 90mLunch | Lunch ICSE Catering | ||
14:00 - 15:30 | Session II - Human-AI Collaboration, Multi-agent Systems, & BenchmarkingData and Benchmarking / Research Papers at Oceania I Chair(s): Massimiliano Di Penta University of Sannio, Italy | ||
14:00 12mTalk | Reporting LLM Prompting in Automated Software Engineering: A Guideline Based on Current Practices and Expectations Research Papers Alexander Korn University of Duisburg-Essen, Lea Zaruchas University of Cologne, Chetan Arora Monash University, Andreas Metzger paluno – The Ruhr Institute for Software Technology, University of Duisburg-Essen, Sven Smolka University of Duisburg-Essen, Fanyu Wang Monash University, Andreas Vogelsang paluno – The Ruhr Institute for Software Technology, University of Duisburg-Essen, Alexander Korn University of Duisburg-Essen | ||
14:12 12mTalk | Impacts of Generative AI on Agile Teams' Productivity: A Multi-Case Longitudinal Study Research Papers Rafael Tomaz Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Paloma Guenes Pontifical Catholic University of Rio de Janeiro (PUC-Rio) | University of Bari (UniBa), Allysson Allex Araújo Federal University of Cariri, Maria Teresa Baldassarre Department of Computer Science, University of Bari , Marcos Kalinowski Pontifical Catholic University of Rio de Janeiro (PUC-Rio) Pre-print | ||
14:24 6mTalk | Visual Loop: Bridging the Cognitive Gap in Software Development Through Visual-AI Collaboration Research Papers Luis F. Gomes Carnegie Mellon University, Xin Zhou Singapore Management University, Singapore, David Lo Singapore Management University, Rui Abreu Faculty of Engineering of the University of Porto, Portugal, Rui Abreu University of Porto | ||
14:30 12mTalk | AutoReSpec: A Framework for Generating Specification using Large Language Models Research Papers Pre-print Media Attached | ||
14:42 6mTalk | CodeViz: Collaborative Multi-Agent System for Analytical and Visualization Tasks in Data Science Research Papers Sai Sanjna Chintakunta Pennsylvania State University, Nathalia Nascimento Pennsylvania State University, Everton Guimaraes Pennsylvania State University | ||
14:48 6mTalk | SEMODS: A Validated Dataset of Open-Source Software Engineering Models Data and Benchmarking Alexandra González Universitat Politècnica de Barcelona - BarcelonaTech (UPC), Xavier Franch Universitat Politècnica de Catalunya, Silverio Martínez-Fernández UPC-BarcelonaTech DOI Pre-print | ||
14:54 6mTalk | Towards Comprehensive Benchmarking Infrastructure for LLMs In Software Engineering Data and Benchmarking Daniel Rodriguez-Cardenas William & Mary, Xiaochang Li William & Mary, Marcos Macedo Queen's University, Antonio Mastropaolo William and Mary, USA, Dipin Khati William & Mary, Yuan Tian Queen's University, Kingston, Ontario, Huajie Shao College of William & Mary, Denys Poshyvanyk William & Mary | ||
15:00 6mTalk | OmniBench-RAG: A Multi-Domain Evaluation Platform for Retrieval-Augmented Generation Tools Data and Benchmarking Jiaxuan Liang Huazhong University of Science and Technology, China, shide zhou Huazhong University of Science and Technology, Kailong Wang Huazhong University of Science and Technology, JIAXUAN LIANG Huazhong University of Science and Technology | ||
15:30 - 16:00 | Sunday Afternoon BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) Afternoon Break with a variety of beverages and snacks available for attendees. This break will provide an opportunity for networking and relaxation between sessions. | ||
15:30 30mCoffee break | Break ICSE Catering | ||
15:30 - 16:00 | Sunday Afternoon BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) Afternoon Break with a variety of beverages and snacks available for attendees. This break will provide an opportunity for networking and relaxation between sessions. | ||
15:30 30mCoffee break | Break ICSE Catering | ||
18:30 - 22:00 | |||
18:30 3h30mDinner | Informal FORGE Dinner FORGE Program | ||
Mon 13 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
08:00 - 17:30 | Monday RegistrationICSE Social, Networking and Special Rooms at Main Entrance Registration for ICSE 2026. | ||
08:00 9h30mRegistration | ICSE 2026 Registration ICSE Social, Networking and Special Rooms | ||
08:00 - 17:30 | Monday RegistrationICSE Social, Networking and Special Rooms at Main Entrance Registration for ICSE 2026. | ||
08:00 9h30mRegistration | ICSE 2026 Registration ICSE Social, Networking and Special Rooms | ||
09:00 - 10:30 | Keynote (Prof. Michael Pradel) + AwardsFORGE Program / Keynotes at Oceania I Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana | ||
09:00 75mKeynote | Keynote: Natural Language as Specification: LLM-Based Validation of Software Evolution Keynotes Michael Pradel CISPA Helmholtz Center for Information Security | ||
10:15 15mAwards | Awards session Keynotes | ||
09:00 - 10:30 | Keynote (Prof. Michael Pradel) + AwardsFORGE Program / Keynotes at Oceania I Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana | ||
09:00 75mKeynote | Keynote: Natural Language as Specification: LLM-Based Validation of Software Evolution Keynotes Michael Pradel CISPA Helmholtz Center for Information Security | ||
10:15 15mAwards | Awards session Keynotes | ||
10:30 - 11:00 | Monday Morning BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) This break will provide an opportunity for networking and relaxation between sessions. | ||
10:30 30mCoffee break | Break ICSE Catering | ||
10:30 - 11:00 | Monday Morning BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) This break will provide an opportunity for networking and relaxation between sessions. | ||
10:30 30mCoffee break | Break ICSE Catering | ||
12:30 - 14:00 | Monday LunchICSE Catering at Catering and Exhibition Hall (Europa I to IV) Lunch time with a variety of meal options available for attendees, including vegetarian choices. This session will provide an opportunity for attendees to enjoy a meal while networking with colleagues and discussing the day’s events. | ||
12:30 90mLunch | Lunch ICSE Catering | ||
12:30 - 14:00 | Monday LunchICSE Catering at Catering and Exhibition Hall (Europa I to IV) Lunch time with a variety of meal options available for attendees, including vegetarian choices. This session will provide an opportunity for attendees to enjoy a meal while networking with colleagues and discussing the day’s events. | ||
12:30 90mLunch | Lunch ICSE Catering | ||
14:00 - 15:30 | Industry Keynotes IIFORGE Program / Keynotes at Oceania I Chair(s): Christoph Treude Singapore Management University | ||
14:00 45mKeynote | Lingxi - Towards Domain Knowledge Guided Repo-level Code Generation Practice in Huawei Keynotes Kui Liu Huawei | ||
14:45 45mKeynote | Same Brain, Superpowers: Programming in the Age of AI Keynotes Chris Parnin Microsoft | ||
14:00 - 15:30 | Industry Keynotes IIFORGE Program / Keynotes at Oceania I Chair(s): Christoph Treude Singapore Management University | ||
14:00 45mKeynote | Lingxi - Towards Domain Knowledge Guided Repo-level Code Generation Practice in Huawei Keynotes Kui Liu Huawei | ||
14:45 45mKeynote | Same Brain, Superpowers: Programming in the Age of AI Keynotes Chris Parnin Microsoft | ||
15:30 - 16:00 | Monday Afternoon BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) Afternoon Break with a variety of beverages and snacks available for attendees. This break will provide an opportunity for networking and relaxation between sessions. | ||
15:30 30mCoffee break | Break ICSE Catering | ||
15:30 - 16:00 | Monday Afternoon BreakICSE Catering at Catering and Exhibition Hall (Europa I to IV) Afternoon Break with a variety of beverages and snacks available for attendees. This break will provide an opportunity for networking and relaxation between sessions. | ||
15:30 30mCoffee break | Break ICSE Catering | ||
16:00 - 17:00 | Session IV - Security & PrivacyResearch Papers / Data and Benchmarking at Oceania I Chair(s): Zhou Yang University of Alberta, Alberta Machine Intelligence Institute | ||
16:00 12mTalk | PIChecker: Automatic Privacy Detector for Third Party Libraries in Android Apps Research Papers | ||
16:12 12mTalk | DynamicsLLM: a Dynamic Analysis-based Tool for Generating Intelligent Execution Traces Using LLMs to Detect Android Behavioural Code Smells Research Papers Houcine Abdelkader Cherief Ecole de Technologie Supérieure, Florent AVELLANEDA Université du Québec à Montréal, Naouel Moha École de Technologie Supérieure (ETS) | ||
16:24 6mTalk | MIRROR: A Dataset of Structural Metrics for Repackaged Android Apps Data and Benchmarking | ||
16:30 12mTalk | MalCVE: Malware Detection and CVE Association Using Large Language Models Research Papers Eduard Andrei Cristea Norwegian University of Science and Technology, Petter Molnes Norwegian University of Science and Technology, Jingyue Li Norwegian University of Science and Technology (NTNU) | ||
16:42 6mTalk | Finding Missing Input Validation in TEEs via LLM-Assisted Symbolic Execution Research Papers Chengyan Ma Singapore Management University, Jieke Shi Singapore Management University, Ruidong Han Singapore Management University, Ye Liu Singapore Management University, Yuqing Niu , David Lo Singapore Management University Pre-print | ||
16:48 6mTalk | Jailbreaking Large Language Models via Multi-Task Embedding-based Prompt Research Papers Songrui Li Tianjin university, Hanmo You Tianjin University, Jiajun Jiang Tianjin University, li songrui | ||
17:00 - 17:15 | |||
17:00 15mDay closing | Closing & FORGE 2027 FORGE Program | ||
20:00 - 23:00 | Social Event for Co-located ConferencesICSE Social, Networking and Special Rooms at Rio Scenarium Co-located event participants are invited to join us at Rio Scenarium for an informal evening with live Brazilian music, food, drinks, and great company in the heart of Lapa, a traditional samba region in Rio. Buses depart from the conference venue starting at 18:00. | ||
20:00 3hDinner | Social Event for Co-located Conferences ICSE Social, Networking and Special Rooms | ||
20:00 - 23:00 | Social Event for Co-located ConferencesICSE Social, Networking and Special Rooms at Rio Scenarium Co-located event participants are invited to join us at Rio Scenarium for an informal evening with live Brazilian music, food, drinks, and great company in the heart of Lapa, a traditional samba region in Rio. Buses depart from the conference venue starting at 18:00. | ||
20:00 3hDinner | Social Event for Co-located Conferences ICSE Social, Networking and Special Rooms | ||
Accepted Papers
| Title | |
|---|---|
| COMPASS: A Psychometrics-Guided Multi-Dimensional Benchmark for Code Generation Evaluation Data and Benchmarking Pre-print | |
| MiG.4: A Curated Dataset of Library Migrations in Java and Python Data and Benchmarking | |
| MIRROR: A Dataset of Structural Metrics for Repackaged Android Apps Data and Benchmarking | |
| OmniBench-RAG: A Multi-Domain Evaluation Platform for Retrieval-Augmented Generation Tools Data and Benchmarking | |
| PromiseAwait: A Dataset of JavaScript Migrations from Promises to Async/Await Data and Benchmarking | |
| SEMODS: A Validated Dataset of Open-Source Software Engineering Models Data and Benchmarking DOI Pre-print | |
| Towards Comprehensive Benchmarking Infrastructure for LLMs In Software Engineering Data and Benchmarking | |
| Tricky²: Towards a Benchmark for Evaluating Human and LLM Error Interactions Data and Benchmarking | |
| VHDL-Instruct: Training Open Dataset for LLMs Benchmarking and HDL Code Generation Data and Benchmarking Media Attached |
Call for Papers
High-quality datasets and robust evaluation frameworks are essential for the advancement of foundation models (FM). The Benchmarking Track at FORGE 2026 provides a dedicated forum for publishing rigorous research on machine learning datasets and benchmarking methodologies that go beyond conventional evaluation metrics. In particular, this track encourages work that advances the state of the art in data quality, reproducibility, and benchmarking standards, with a focus on enabling the development and assessment of FM for software engineering (SE).
Scope
This track welcomes two types of submissions in the context of software engineering: (1) Data papers and (2) Benchmarking papers.
-
Data Papers
Contributions may include:
-
New datasets, or thoughtfully designed (collections of) datasets built from previously available data.
-
Data generators and reinforcement learning environments.
-
Data-centric AI methods and tools (e.g., methods to measure/improve data quality, or studies that bring new insights).
-
Advanced practices in data collection and curation, even if the data itself cannot be shared.
-
Frameworks for responsible dataset development, dataset audits, or identifying significant issues in existing datasets.
-
-
Benchmarking Papers
Contributions may include:
-
New benchmarks, metrics, or benchmarking tools.
-
Systematic analyses of existing systems on novel datasets that yield new insights.
-
Evaluation Criteria
-
Data Papers
-
Value, usefulness, and reusability of the dataset or tools.
-
Quality and clarity of presentation.
-
Clear positioning within related work and relevance to SE.
-
Accessibility of datasets/tools (e.g., data and required code should be findable, usable, and open source where possible).
-
-
Benchmarking Papers
-
Relevance of the proposed benchmark to the FORGE audience.
-
Originality of the ideas.
-
Clarity and quality of presentation.
-
Usefulness and outreach of the results, tools, or datasets.
-
Submission Instructions
-
Length: Maximum of 4 pages, plus 1 page of references.
-
Artifacts: Authors are strongly encouraged to share (anonymized/curated) data and artifacts to foster reproducibility, though this is not mandatory. If artifacts are not shared, justification is expected.
-
Format: All submissions must be in PDF format and conform, at time of submission, to the official “ACM Primary Article Template”, which can be obtained from the ACM Proceedings Template page. LaTeX users should use the sigconf option, as well as the review (to produce line numbers for easy reference by the reviewers) and anonymous (omitting author names) options. To that end, the following LaTeX code can be placed at the start of the LaTeX document:
\documentclass[sigconf,review,anonymous]{acmart} -
Reviewing: Double-anonymous. Remove author names and affiliations; cite your prior work in the third person. For guidance, see the ICSE Research Track Q&A on double-anonymous reviewing.
-
Language: All papers must be in English. Use the HotCRP format checker before submission. Minor warnings (e.g., small fonts in figures/footnotes) will not cause rejection if the main text complies.
-
Accessibility: Authors should ensure papers are accessible to people with disabilities. See SIGACCESS guidelines.
Submission site: https://forge26-benchmarking.hotcrp.com/
Important dates (AOE)
-
Full paper submission deadline: Friday, Nov 21, 2025
-
Author notification: Friday, Dec 26, 2025
-
Camera-ready deadline: Friday, Jan 23, 2026