Developing Multi-Agent LLM Applications through Continuous Human-LLM Co-Programming (CAIN 2025 - Research and Experience Papers)

Who

Hui Song, Arda Goknil, Xiaojun Jiang, Espen Melum, Hyunwhan Joe, Caterina Gazzotti, Valerio Frascolla, Adela Nedisan Videsjorden, Phu Nguyen

Track

CAIN 2025 Research and Experience Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 27 Apr 2025 11:55 - 12:05 at 208 - Engineering AI systems with LLMs Chair(s): Justus Bogner

Abstract

The rapid advancement of Large Language Models (LLMs) has opened new possibilities for intelligent multi-agent systems capable of autonomously performing complex tasks. To build such multi-agent systems, developers can leverage LLMs for task-solving, tool interaction, and code generation but should manage their costs and unpredictability. This experience paper introduces COPMA, a model-based approach to enabling continuous human-LLM co-programming of multi-agent LLM applications. COPMA uses ``feature-block" models to track application features and their implementations as agents and code blocks. Supported by co-programming patterns, it guides developers in constructing, refining, and refactoring feature implementations through trial-and-errors with LLM agents, leveraging their feedback, suggestions, and code examples. The patterns guide the shift of feature implementations between agents and code to balance flexibility, predictability, and cost. Our experience in developing LLM agents for collecting and reviewing medical research papers demonstrates that human-LLM co-programming can reduce development effort and achieve stable behavior to enable rapid prototyping of multi-agent LLM applications

Hui Song

SINTEF Digital

Norway

Arda Goknil

SINTEF Digital

Xiaojun Jiang

Oslo University Hospital

Espen Melum

Oslo University Hospital

Hyunwhan Joe

Seoul National University

Caterina Gazzotti

University of Modena

Valerio Frascolla

Intel

Adela Nedisan Videsjorden

SINTEF

Phu Nguyen