CAIN 2025
Sun 27 - Mon 28 April 2025 Ottawa, Ontario, Canada
co-located with ICSE 2025
Sun 27 Apr 2025 11:55 - 12:05 at 208 - Engineering AI systems with LLMs Chair(s): Justus Bogner

The rapid advancement of Large Language Models (LLMs) has opened new possibilities for intelligent multi-agent systems capable of autonomously performing complex tasks. To build such multi-agent systems, developers can leverage LLMs for task-solving, tool interaction, and code generation but should manage their costs and unpredictability. This experience paper introduces COPMA, a model-based approach to enabling continuous human-LLM co-programming of multi-agent LLM applications. COPMA uses ``feature-block" models to track application features and their implementations as agents and code blocks. Supported by co-programming patterns, it guides developers in constructing, refining, and refactoring feature implementations through trial-and-errors with LLM agents, leveraging their feedback, suggestions, and code examples. The patterns guide the shift of feature implementations between agents and code to balance flexibility, predictability, and cost. Our experience in developing LLM agents for collecting and reviewing medical research papers demonstrates that human-LLM co-programming can reduce development effort and achieve stable behavior to enable rapid prototyping of multi-agent LLM applications

Sun 27 Apr

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Engineering AI systems with LLMsResearch and Experience Papers at 208
Chair(s): Justus Bogner Vrije Universiteit Amsterdam
11:00
15m
Talk
Rule-Based Assessment of Reinforcement Learning Practices Using Large Language ModelsDistinguished paper Award Candidate
Research and Experience Papers
Evangelos Ntentos University of Vienna, Stephen John Warnett University of Vienna, Uwe Zdun University of Vienna
11:15
10m
Talk
Designing and implementing LLM guardrails components in production environments
Research and Experience Papers
11:25
15m
Talk
Themes of Building LLM-based Applications for Production: A Practitioner's View
Research and Experience Papers
Alina Mailach Leipzig University, Sebastian Simon Leipzig University, Johannes Dorn Leipzig University, Norbert Siegmund Leipzig University
Pre-print
11:40
15m
Talk
InsightAI: Root Cause Analysis in Large Hierarchical Log Files with Private Data Using Large Language Models
Research and Experience Papers
Maryam Ekhlasi Polytechnique Montreal, Anurag Prakash Ciena, Michel Dagenais Polytechnique Montréal, Maxime Lamothe Polytechnique Montreal
11:55
10m
Talk
Developing Multi-Agent LLM Applications through Continuous Human-LLM Co-Programming
Research and Experience Papers
Hui Song SINTEF Digital, Arda Goknil SINTEF Digital, Xiaojun Jiang Oslo University Hospital, Espen Melum Oslo University Hospital, Hyunwhan Joe Seoul National University, Caterina Gazzotti University of Modena, Valerio Frascolla Intel, Adela Nedisan Videsjorden SINTEF, Phu Nguyen SINTEF
12:05
25m
Other
Discussion
Research and Experience Papers

:
:
:
: