AGENT 2026 - International Workshop on Agentic Engineering (AGENT 2026)

Agentic engineering is an emerging discipline focused on the design, development, and operation of systems that exhibit goal-directed autonomy. Foundation models (FMs), such as large language models (LLMs), have been accelerating progress in this area across academia and industry.

Agentic systems often involve multiple interacting agents, humans, and tools, requiring rigorous system-level engineering to ensure critical qualities like robustness, safety, and observability. A key design challenge in agentic engineering is the growing capability of FMs/LLMs. Developers must decide whether to rely on the FM/LLM or external tools/systems for the same functionality. These decisions can be made at various stages depending on the problem and context: during design time, development time, or even at runtime from a software engineering perspective, and at pre-training time, post-training time, test/inference time, and post-inference time from an AI perspective. Highly autonomous agentic systems also require continuous monitoring, evaluation, observability, intervention, and oversight after deployment—an emerging discipline referred to as AgentOps.

While workshops on agentic or multi-agent systems have appeared at AI-focused venues, they typically emphasize theoretical modeling, multi-agent learning, or coordination protocols. In contrast, this workshop is situated within the software engineering community and addresses concrete engineering methods, design trade-offs, and operational practices needed to develop and maintain agentic systems built on foundation models.

We also recognize that agentic engineering builds on foundational work from the agent-oriented software engineering (AOSE) community. However, the emergence of foundation models introduces new challenges around autonomy, tool integration, prompt-driven behavior, and post-deployment adaptation. This workshop seeks to update and re-contextualize those principles to address the design and assurance of modern agentic systems, particularly those grounded in large-scale pretrained models.

This workshop will provide a forum for exploring engineering methods, techniques, and tools for agentic systems in general and agentic systems for software engineering in particular. It will bring together researchers and practitioners to share insights, innovations, and real-world experiences in the design, development, and operation of agentic systems.

Time Zone

The program is currently displayed in (GMT-03:00) Brasilia, Distrito Federal, Brazil.

Use conference time zone: (GMT-03:00) Brasilia, Distrito Federal, BrazilSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

You're viewing the program in a time zone which is different from your device's time zone change time zone

Tue 14 Apr
Displayed time zone: Brasilia, Distrito Federal, Brazil change

08:00 - 17:30	Tuesday Quiet RoomSocial, Networking and Special Rooms at Capri V Quiet Room for you to relax or work in a peaceful environment during ICSE 2026.

08:00 9h30m Other		Quiet Room Social, Networking and Special Rooms

08:00 - 19:00	Tuesday RegistrationSocial, Networking and Special Rooms at Main Entrance Registration for ICSE 2026.

08:00 11h Registration		ICSE 2026 Registration Social, Networking and Special Rooms

09:00 - 12:30	Tuesday Morning Child CareSocial, Networking and Special Rooms at Ibiza III Child Care services available during ICSE 2026 to support attendees with children.

09:00 3h30m Other		Child Care Social, Networking and Special Rooms

09:00 - 10:30	Keynotes and Lightning TalksAGENT at Oceania VIII Chair(s): Qinghua Lu Data61, CSIRO

09:00 8m Day opening		Opening AGENT
09:08 30m Keynote		Keynote: AI in SE: The Era of Coding Agents AGENT K: Satish Chandra Meta Platforms, Inc.
09:38 30m Keynote		Keynote: Agentic Software Engineering Will Eat the World: AI-Based Systems as the New Operating System of Society AGENT K: Robert Feldt Chalmers \| University of Gothenburg Media Attached File Attached
10:08 2m Talk		Toward Agentic Software Engineering Beyond Code: Framing Vision, Values, and Vocabulary AGENT Rashina Hoda Monash University
10:10 2m Talk		Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment AGENT Aylton Almeida UFMG, Marco Tulio Valente Brazil, Laerte Xavier Pontifical Catholic University of Minas Gerais (PUC-Minas)
10:12 2m Talk		A Catalogue of Evaluation Metrics for LLM-Based Multi-Agent Frameworks in Software Engineering AGENT Ingrid Lima State University of Ceará, Brazil, Vitor Linhares State University of Ceará, Brazil, Anderson Martins Gomes State University of Ceará, Brazil, Paulo Maia State University of Ceará
10:14 2m Talk		From Business Meetings to Requirement Artifacts: An Agentic AI Approach with MARARE AGENT Malik Sami Tampere University, Gessé Evangelista Federal University of São Carlos, Kai-Kristian Kemell Tampere University, Luciana Zaina Federal University of São Carlos, Muhammad Waseem Faculty of Information Technology and Communication Sciences, Tampere University, 33014 Tampere, Finland, Zheying Zhang Tampere University, Pekka Abrahamsson Tampere University
10:16 2m Talk		AgenticAKM : Enroute to Agentic Architecture Knowledge Management AGENT Rudra Dhar IIIT Hyderabad, Karthik Vaidhyanathan IIIT Hyderabad, Vasudeva Varma IIIT-H
10:18 2m Talk		Agents — The Missing Piece in the Cognitive Cloud Continuum AGENT Sergio Moreschini Tampere University, Diyaz Yakubov Tampere University, Antti Kolehmainen Tampere University, David Hastbacka Tampere University
10:20 2m Talk		Toward Author-Guided Review: An Agentic Architecture for Reflective Code Review AGENT Nicole Davila Universidade Federal do Rio Grande do Sul, Igor Wiese Federal University of Technology
10:22 2m Talk		CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening AGENT GIULIANO LORENZONI University of Waterloo, Ivens da Silva Portugal University of Waterloo, Paulo Alencar University of Waterloo, Donald Cowan University of Waterloo
10:24 2m Talk		GeoAIAgent-Agentic workflow for automated geospatial data management AGENT Leonardo Pondian Tizzei IBM Research, Gabrielle Nyirjesy IBM Research, Levente Klein IBM Research, Ildar Khabibrakhmanov IBM Research, Maciel Zortea IBM Research, Hiyam Debary IBM Research, Mustansar Fiaz IBM Research, James Barry IBM Research, João Lucas de Sousa Almeida IBM Research, Theodore van Kessel IBM Research, Segev Shlomov IBM Research, Juan Bernabe Moreno IBM Research Media Attached
10:26 2m Talk		Meta-RAG on Large Codebases Using Code SummarizationNo Record AGENT Vali Tawosi J.P. Morgan AI Research, Salwa Alamir J.P. Morgan AI Research, Xiaomo Liu J.P. Morgan AI Research, Manuela Veloso J.P. Morgan AI Research Media Attached
10:28 2m Talk		Decoding the Configuration of AI Coding Agents: Insights from Claude Code Projects AGENT Hélio Victor Flexa dos Santos UFMG, Vitor Costa UFMG, João Eduardo Montandon Universidade Federal de Minas Gerais (UFMG), Marco Tulio Valente Federal University of Minas Gerais, Brazil

10:30 - 11:00	Tuesday Morning BreakCatering at Catering and Exhibition Hall (Europa I to IV) This break will provide an opportunity for networking and relaxation between sessions.

10:30 30m Coffee break		Break Catering

11:00 - 12:30	Agent Architectures and CoordinationAGENT at Oceania VIII Chair(s): Jayani Samaraweera University of Victoria

11:00 30m Keynote		Keynote: Towards Self-Evolving Software Intelligence AGENT K: Lingming Zhang University of Illinois at Urbana-Champaign
11:30 6m Talk		ContReAct: A Feedback-Based Architecture for Continuous Agentic Operation AGENT Stefan Szeider Vienna University of Technology (TU Wien)
11:36 6m Talk		AgentFixer: From Failure Detection to Fix Recommendations in Agentic Systems AGENT Hadar Mulian IBM Research, Sergey Zeltyn IBM Research, Ido Levy IBM Research, Liane Galanti IBM Research, Avi Yaeli IBM Research, Segev Shlomov IBM Research
11:42 6m Talk		TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation AGENT Ming-Tung Shen , Yuh-Jzer Joung National Taiwan University
11:48 6m Talk		LLM-X: A Scalable Negotiation-Oriented Exchange for Communication Among Personal LLM Agents AGENT GIULIANO LORENZONI University of Waterloo, Paulo Alencar University of Waterloo, Donald Cowan University of Waterloo
11:54 6m Talk		From Stateless Code Generation to Living Ontologies: Business Autonomous Entities in Action AGENT Anderson Martins Gomes State University of Ceará, Brazil, Paulo Maia State University of Ceará, Vitor Linhares State University of Ceará, Brazil, Ingrid Lima State University of Ceará, Brazil
12:00 6m Talk		AVATAR-AGENT: A Multi-Agent System for Expressive 3D Avatar GenerationNo Record AGENT Jason Ding Roblox, Rohan Gangaraju Roblox, Krishna C. Garikipati Roblox, Foad Dabiri Roblox, Chang Xu Roblox
12:06 24m Live Q&A		Session 2 Joint Q&A AGENT

12:30 - 14:00	Tuesday LunchCatering at Catering and Exhibition Hall (Europa I to IV) Lunch time with a variety of meal options available for attendees, including vegetarian choices. This session will provide an opportunity for attendees to enjoy a meal while networking with colleagues and discussing the day’s events.

12:30 90m Lunch		Lunch Catering

14:00 - 17:00	Tuesday Afternoon Child CareSocial, Networking and Special Rooms at Ibiza III Child Care services available during ICSE 2026 to support attendees with children.

14:00 3h Other		Child Care Social, Networking and Special Rooms

14:00 - 15:30	Evaluation, Reliability, and Engineering PracticeAGENT at Oceania VIII Chair(s): Oshani Weerakoon Department of Computing, University of Turku

14:00 30m Keynote		Keynote: On the Evaluation of AI Coding Agents AGENT K: Chao Peng ByteDance
14:30 6m Talk		Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems AGENT Sreemaee Akshathala IIIT Hyderabad, Bassam Adnan IIIT Hyderabad, Mahisha Ramesh IIIT Hyderabad, Karthik Vaidhyanathan IIIT Hyderabad, Basil Muhammed MontyCloud, Kannan Parthasarathy MontyCloud
14:36 6m Talk		PerfBench: Can Agents Resolve Real-World Performance Bugs? AGENT Spandan Garg Microsoft Corporation, Roshanak Zilouchian Moghaddam Microsoft, Neel Sundaresan Microsoft Pre-print Media Attached
14:42 6m Talk		SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs AGENT Arihant Tripathy IIIT Hyderabad, India, Ch Pavan Harshit IIIT Hyderabad, India, Karthik Vaidhyanathan IIIT Hyderabad
14:48 6m Talk		Context Matters: Evaluating MCP-Based Context-Aware AI — A Case Study of Email Communications in Nonprofit Organizations AGENT Nitin Gupta University of Victoria, Jayani Samaraweera University of Victoria, Raaj Chatterjee Meaningful Technology Inc., Riya Shrestha University of Victoria, Trinity West University of Victoria, Dana Damian University of Victoria
14:54 6m Talk		Toward Agentic Software Project Management: A Vision and Roadmap AGENT Lakshana Assalaarachchi Monash University, Australia, Zainab Masood Prince Sultan University, Rashina Hoda Monash University, John Grundy Monash University
15:00 6m Talk		Not All Problems Are Nails, Not All Tools Should Be Hammers: A Position Paper on Agent Usage in Software Engineering Tasks AGENT Juuso Rytilahti Department of Computing, University of Turku, Panu Puhtila University of Turku, Oshani Weerakoon Department of Computing, University of Turku, Erkki Kaila Department of Computing, University of Turku, Tuomas Mäkilä University of Turku
15:06 24m Live Q&A		Session 3 Joint Q&A AGENT

15:30 - 16:00	Tuesday Afternoon BreakCatering at Catering and Exhibition Hall (Europa I to IV) Afternoon Break with a variety of beverages and snacks available for attendees. This break will provide an opportunity for networking and relaxation between sessions.

15:30 30m Coffee break		Break Catering

16:00 - 17:30	Applications, Experience Reports, and PanelAGENT at Oceania VIII Chair(s): Qinghua Lu Data61, CSIRO

16:00 30m Keynote		Keynote: Lessons from the Frontlines: Deploying AI Agents in the Real World AGENT K: Gustavo Soares Microsoft
16:30 6m Talk		Building LLM-Based Voice Agents for Requirements Elicitation: An Experience Report on Early Prototypes AGENT Oshani Weerakoon Department of Computing, University of Turku, Tuomas Mäkilä University of Turku, Erkki Kaila Department of Computing, University of Turku, Shola Oyedeji LUT University
16:36 6m Talk		An Agentic System for LLM-Driven Public Transportation Analytics: A Practical Application and Case Study in Salvador-Brazil AGENT Lucas Teixeira Borges Federal University of Bahia, Fei T. Liu Artificial General Intelligence Pty Ltd., Tatiane Rios Federal University of Bahia, Marcos V. Ferreira Federal University of Bahia, Clovis Carmo Federal University of Bahia, Danilo B. Coimbra Federal University of Bahia, Jorge Nery Federal University of Bahia, Matheus Souza Integra - Association of Transport Companies of Salvador, Noe O. Garcia Arcadis, Albert Bifet University of Waikato, Institut Polytechnique de Paris, RICARDO RIOS Federal University of Bahia
16:42 6m Talk		AI in Insurance: Adaptive Questionnaires for Improved Risk Profiling AGENT Diogo Silva Deloitte and Faculty of Engineering, University of Porto, João Teixeira Deloitte, Bruno Lima LIACC, Faculty of Engineering, University of Porto Pre-print
16:48 6m Talk		Progressive Gated Co-Teaching for Weakly Supervised Deepfake Detection AGENT Rui Lang FEIT-UTS, Guangsheng Yu University of Technology Sydney, Qin Wang CSIRO Data61, Xu Wang Global Big Data Technologies Centre
16:54 11m Live Q&A		Session 4 Joint Q&A AGENT
17:05 20m Live Q&A		Panel Discussion AGENT Gustavo Soares Microsoft, Chao Peng ByteDance, Satish Chandra Meta Platforms, Inc.
17:25 5m Day closing		Closing AGENT

18:00 - 22:00	ICSE Steering Committee MeetingMeetings and BOF Events at Capri IV Dinner will be included for members.

18:00 4h Meeting		ICSE Steering Committee Meeting Meetings and BOF Events Arie van Deursen TU Delft, Margaret-Anne Storey University of Victoria

19:00 - 21:00	ICSE ReceptionSocial, Networking and Special Rooms at Catering and Exhibition Hall (Europa I to IV) A reception for all attendees to network and socialize. Join us for an evening of fun and connection at ICSE 2026!

19:00 2h Meeting		ICSE Reception Social, Networking and Special Rooms

19:00 - 21:00	ICSE Newcomer ReceptionSocial, Networking and Special Rooms at Europa II A special reception to welcome newcomers to ICSE 2026. Join us for an evening of networking and fun!

19:00 2h Meeting		ICSE Newcomer Reception Social, Networking and Special Rooms

Accepted Papers

	Title
	A Catalogue of Evaluation Metrics for LLM-Based Multi-Agent Frameworks in Software Engineering AGENT Ingrid Lima, Vitor Linhares, Anderson Martins Gomes, Paulo Maia
	AgentFixer: From Failure Detection to Fix Recommendations in Agentic Systems AGENT Hadar Mulian, Sergey Zeltyn, Ido Levy, Liane Galanti, Avi Yaeli, Segev Shlomov
	AgenticAKM : Enroute to Agentic Architecture Knowledge Management AGENT Rudra Dhar, Karthik Vaidhyanathan, Vasudeva Varma
	Agents — The Missing Piece in the Cognitive Cloud Continuum AGENT Sergio Moreschini, Diyaz Yakubov, Antti Kolehmainen, David Hastbacka
	AI in Insurance: Adaptive Questionnaires for Improved Risk Profiling AGENT Diogo Silva, João Teixeira, Bruno Lima Pre-print
	An Agentic System for LLM-Driven Public Transportation Analytics: A Practical Application and Case Study in Salvador-Brazil AGENT Lucas Teixeira Borges, Fei T. Liu, Tatiane Rios, Marcos V. Ferreira, Clovis Carmo, Danilo B. Coimbra, Jorge Nery, Matheus Souza, Noe O. Garcia, Albert Bifet, RICARDO RIOS
	AVATAR-AGENT: A Multi-Agent System for Expressive 3D Avatar GenerationNo Record AGENT Jason Ding, Rohan Gangaraju, Krishna C. Garikipati, Foad Dabiri, Chang Xu
	Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems AGENT Sreemaee Akshathala, Bassam Adnan, Mahisha Ramesh, Karthik Vaidhyanathan, Basil Muhammed, Kannan Parthasarathy
	Building LLM-Based Voice Agents for Requirements Elicitation: An Experience Report on Early Prototypes AGENT Oshani Weerakoon, Tuomas Mäkilä, Erkki Kaila, Shola Oyedeji
	Context Matters: Evaluating MCP-Based Context-Aware AI — A Case Study of Email Communications in Nonprofit Organizations AGENT Nitin Gupta, Jayani Samaraweera, Raaj Chatterjee, Riya Shrestha, Trinity West, Dana Damian
	ContReAct: A Feedback-Based Architecture for Continuous Agentic Operation AGENT Stefan Szeider
	CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening AGENT GIULIANO LORENZONI, Ivens da Silva Portugal, Paulo Alencar, Donald Cowan
	Decoding the Configuration of AI Coding Agents: Insights from Claude Code Projects AGENT Hélio Victor Flexa dos Santos, Vitor Costa, João Eduardo Montandon, Marco Tulio Valente
	From Business Meetings to Requirement Artifacts: An Agentic AI Approach with MARARE AGENT Malik Sami, Gessé Evangelista, Kai-Kristian Kemell, Luciana Zaina, Muhammad Waseem, Zheying Zhang, Pekka Abrahamsson
	From Stateless Code Generation to Living Ontologies: Business Autonomous Entities in Action AGENT Anderson Martins Gomes, Paulo Maia, Vitor Linhares, Ingrid Lima
	GeoAIAgent-Agentic workflow for automated geospatial data management AGENT Leonardo Pondian Tizzei, Gabrielle Nyirjesy, Levente Klein, Ildar Khabibrakhmanov, Maciel Zortea, Hiyam Debary, Mustansar Fiaz, James Barry, João Lucas de Sousa Almeida, Theodore van Kessel, Segev Shlomov, Juan Bernabe Moreno Media Attached
	LLM-X: A Scalable Negotiation-Oriented Exchange for Communication Among Personal LLM Agents AGENT GIULIANO LORENZONI, Paulo Alencar, Donald Cowan
	Meta-RAG on Large Codebases Using Code SummarizationNo Record AGENT Vali Tawosi, Salwa Alamir, Xiaomo Liu, Manuela Veloso Media Attached
	Not All Problems Are Nails, Not All Tools Should Be Hammers: A Position Paper on Agent Usage in Software Engineering Tasks AGENT Juuso Rytilahti, Panu Puhtila, Oshani Weerakoon, Erkki Kaila, Tuomas Mäkilä
	PerfBench: Can Agents Resolve Real-World Performance Bugs? AGENT Spandan Garg, Roshanak Zilouchian Moghaddam, Neel Sundaresan Pre-print Media Attached
	Progressive Gated Co-Teaching for Weakly Supervised Deepfake Detection AGENT Rui Lang, Guangsheng Yu, Qin Wang, Xu Wang
	SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs AGENT Arihant Tripathy, Ch Pavan Harshit, Karthik Vaidhyanathan
	TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation AGENT Ming-Tung Shen, Yuh-Jzer Joung
	Toward Agentic Software Engineering Beyond Code: Framing Vision, Values, and Vocabulary AGENT Rashina Hoda
	Toward Agentic Software Project Management: A Vision and Roadmap AGENT Lakshana Assalaarachchi, Zainab Masood, Rashina Hoda, John Grundy
	Toward Author-Guided Review: An Agentic Architecture for Reflective Code Review AGENT Nicole Davila, Igor Wiese
	Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment AGENT Aylton Almeida, Marco Tulio Valente, Laerte Xavier

Call for Papers

Topics of interest include, but are not limited to:

Requirements engineering for agentic systems
Architectural design for agentic systems
Verification, validation, and testing of agentic systems
AgentOps – DevOps for agentic systems
Development processes and lifecycle management for agentic systems
Evaluation methodologies, tools, and benchmarks for agentic systems
Responsible AI and AI safety of agentic systems
Agentic systems for software engineering, including requirements, design, coding, testing, deployment, and operations
Human-agent interaction, collaboration, and oversight
Risk and impact assessment (e.g. economic/social impact)
Real-world case studies and practical experiences in different domains

The workshop will be highly interactive, including invited keynotes or talks, and paper presentations for different topics in the area of agentic engineering.

Submission Guidelines

We invite the following types of submissions:

Full papers (research or experience): up to 8 pages
Short papers (research or experience): up to 4 pages
Extended abstracts: up to 5 pages. These are free of APC (article processing charge).

All submissions must be in English, in PDF format, and must not exceed the page limits (including references and appendices) listed above. The workshop follows a single-anonymous review process. Submissions should be made via HotCRP. Note this year the official “ACM Primary Article Template” should be used for submissions, which can be obtained from the ACM Proceedings Template page. LaTeX users should use the sigconf option and the review (to produce line numbers for easy reference by the reviewers) options. To that end, the following LaTeX code can be placed at the start of the LaTeX document: \documentclass[sigconf,review]{acmart}

Other detailed submission policies and formatting guidelines are aligned with the ICSE 2026 Research Track submission process.

The official publication date of the workshop proceedings is the date the proceedings are made available by ACM. This date may be up to two weeks prior to the first day of ICSE 2026. The official publication date affects the deadline for any patent filings related to published work.

Authors of selected accepted papers will be invited to submit extended versions of their work for consideration in the IEEE Software Special Issue on Engineering Agentic Systems.

Satish Chandra

Research Scientist, Meta

AI in SE: The Era of Coding Agents

Abstract

Coding agents have taken the world by storm. I will start this talk with a brief history of how we arrived here, beginning with the early days of using AI in software engineering. I will share experiences on working on AI for software engineering, and more specifically, on coding agents at two large companies. I will conclude by discussing how we can control agent behavior and what are some of the interesting problems in this space for software engineering researchers.

Brief Bio

Satish Chandra is a research scientist at Meta, where he applies machine learning techniques to improve developer productivity. His work has spanned many areas of programming languages and software engineering, including program analysis, type systems, software synthesis, bug finding and repair, software testing and, of course, application of AI to software development. His research has been widely published in leading conferences in his field. Satish Chandra obtained a PhD from the University of Wisconsin-Madison, and a B.Tech from the Indian Institute of Technology-Kanpur, both in computer science. He is an ACM Fellow and an elected member of WG 2.4.

Lingming Zhang

Associate Professor, University of Illinois Urbana-Champaign

Towards Self-Evolving Software Intelligence

Abstract

In recent years, Large Language Models (LLMs) have shown impressive performance across a wide range of downstream applications, including software engineering. This talk explores the evolution and recent trends of software engineering agents, featuring our recent work on Live-SWE-agent and Self-Play SWE-RL. We will discuss how these recent efforts enable live coding LLMs and software agents capable of autonomous and continuous self-improvement, paving the way toward next-level software intelligence.

Bio

Lingming Zhang is an associate professor at University of Illinois Urbana-Champaign. His research lies at the intersection of Software Engineering and Machine Learning. His group has pioneered a series of work on LLM-based software testing, repair, and synthesis (such as TitanFuzz, AlphaRepair, and Agentless), and also released multiple open code LLMs (including the recent SWE-RL, PurpCode, and Code World Model), with millions of downloads worldwide. Their techniques for training and improving code LLMs or agents have been adopted by leading AI companies, including Meta, Google, OpenAI, MiniMax, and DeepSeek.

Gustavo Soares

Principal Research Manager, Microsoft

Lessons from the Frontlines: Deploying AI Agents in the Real World

Abstract

What happens when AI agents leave the lab and land in the hands of real users? In this talk, I’ll share lessons from building and deploying agentic systems like GitHub Copilot and Excel’s M365 Copilot—where writing code is cheap, but testing and debugging are hard; prompts and tools age fast; and users don’t always know what to ask for. We’ll explore what’s working, what’s not, and why testing, trust, and thoughtful design matter more than ever. Along the way, I’ll offer a few critiques of our own assumptions—and a look at where we go from here.

Bio

Gustavo Soares is a Principal Research Manager at Microsoft, where he leads the Excel Agent Science team and drives research and product innovation at the intersection of agentic AI, large language models, and human–AI interaction. His work centers on the design, evaluation, and deployment of intelligent agents that can plan, reason, and act within complex productivity tools, with a strong emphasis on reliability, usability, and real‑world adoption at global scale. Previously, his research on agentic AI contributed to the development of GitHub Copilot agents for debugging and migration scenarios, and his work on modeless program synthesis helped lay the foundation for Visual Studio IntelliCode Suggestions, a capability used by millions of developers to automate repetitive and large‑scale code transformations. He received his Ph.D. in Computer Science from the Federal University of Campina Grande, Brazil; his doctoral thesis was recognized with the CAPES Thesis Award for the best Ph.D. thesis in Computer Science in Brazil, and he is also a recipient of the ACM John Vlissides Award.

Robert Feldt

Professor of Software Engineering, Chalmers University of Technology

Agentic Software Engineering Will Eat the World: AI-Based Systems as the New Operating System of Society

Abstract

AI-based and LLM-based systems are often cast as a threat to software engineering: coding agents grow more capable, junior roles shrink or disappear, and end-user programming appears ready to bypass experts altogether. This talk argues for a different conclusion. The deeper shift is not mainly about tools or tasks, but about organizational operating logic: how decisions are made, how knowledge moves, how work is coordinated, and how value is created. As these systems become more agentic, they will not merely support that shift; they will increasingly help analyze, adapt, and redesign it.

This change does not diminish the importance of software engineering. It broadens it. More of society will come to depend on semi-executable systems built from natural language, code, tools, policies, and workflows, while agentic environments increasingly generate, coordinate, and evolve many of those components. I take seriously concerns about reliability, maintainability, organizational inertia, and power. But these are better seen as constraints on the route than as arguments against the direction itself. Even imperfect AI can be transformative when deployed continuously at scale, and the central activities of software engineering are more likely to be redefined than replaced. For researchers, this creates a real risk that some AI4SE work targets practices already being folded into agentic workflows. For practitioners, it means that engineering discipline becomes more important, not less, as solution creation extends beyond traditional software developers.

Slides and paper

Slides: Agentic Software Engineering Will Eat the World: AI-based Systems as the new OS of Society
Paper: The Semi-Executable Stack: Agentic Software Engineering and the Expanding Scope of SE

Bio

Robert Feldt is professor of software engineering at Chalmers University of Technology, Sweden, and at Blekinge Institute of Technology, Sweden. He has broad research interests spanning from human factors to hardcore automation and statistics, and including software testing and quality, requirements engineering, human-centred (behavioural) software engineering. He was an early contributor to search-based software engineering and have recently argued for increased application of psychology and social science to better understand and improve software engineering. Most of his research is empirical and conducted in close collaboration with industry partners in Sweden, Europe and Asia but he also leads more basic research. He received a PhD in computer engineering from the Chalmers University of Technology in 2002, studied psychology at Gothenburg University in the ’90s and has also worked as an IT and software consultant for more than 25 years. He is passionate about empirical research and methods and to change organisations through technological innovation.

Chao Peng

Principal Research Scientist, ByteDance

On the Evaluation of AI Coding Agents

Abstract

AI coding agents are rapidly evolving and capable of performing complex software engineering tasks. As these systems move toward real-world deployment, rigorous and meaningful evaluation becomes increasingly critical for understanding their capabilities, limitations, and risks. In this talk, I present a series of evaluation methods that aim to bridge this gap, covering code generation, question answering and agentic systems. I will discuss lessons learned from building realistic evaluation frameworks and outline future directions toward trustworthy and effective AI software engineering agents.

Bio

Dr. Chao Peng is a Principal Research Scientist at ByteDance, where he leads the Software Engineering Lab focusing on AI agents for software engineering. His research interests include software testing, program repair and optimisation, as well as their synergy with machine learning and compiler techniques. His work has been published in premier venues such as ICSE, FSE, ASE, ACL, and NeurIPS. He received the Distinguished Reviewer Award at FSE 2025.

International Workshop on Agentic Engineering (AGENT 2026)AGENT 2026

Program Display Configuration

Tue 14 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change

Accepted Papers

Call for Papers

Submission Guidelines

Keynote Speakers

Qinghua LuCo-chair

Data61, CSIRO

Australia

Ahmed E. HassanCo-chair

Queen’s University

Canada

David LoCo-chair

Singapore Management University

Singapore

Liming ZhuCo-chair

CSIRO’s Data61

Australia

Boming XiaWeb & Publicity Chair

Responsible AI Research (RAIR) Centre, Adelaide University

Australia

Yue LiuProceedings Chair

Australian National University

Australia

Bram Adams

Queen's University

Canada

Shaukat Ali

Simula Research Laboratory and Oslo Metropolitan University

Norway

Angela An

Deakin University

Australia

Scott Barnett

Applied Artificial Intelligence Initiative, Deakin University

Australia

Tingting Bi

The University of Melbourne

Australia

Jan Bosch

Chalmers University of Technology

Sweden

Lionel Briand

University of Ottawa, Canada; Lero centre, University of Limerick, Ireland

Canada

Di Cao

Swinburne University of Technology

Jieshan Chen

CSIRO's Data61

Australia

Jifeng Chen

The University of New South Wales Canberra

Jane Cleland-Huang

University of Notre Dame

United States

Liming Dong

CSIRO's Data61

Australia

Rodrigo Falcão

Fraunhofer IESE

Keheliya Gallaba

Centre for Software Excellence, Huawei Canada

Canada

Jennifer Horkoff

Chalmers and the University of Gothenburg

Sweden

Zhen Ming (Jack) Jiang

York University

Canada

Yasutaka Kamei

Kyushu University

Japan

Foutse Khomh

Polytechnique Montréal

Canada

Sung Une (Sunny) Lee

CSIRO's Data61

Australia

Grace Lewis

Tue 14 Apr
Displayed time zone: Brasilia, Distrito Federal, Brazil change