Automated Exploration of Conversational Agents for the Synthesis of Testing Profiles (ICTSS 2025 - General Track)

Who

Iván Sotillo del Horno, Alejandro del Pozzo, Esther Guerra, Juan de Lara

Track

ICTSS 2025 General Track

Time Zone

The program is currently displayed in (GMT+03:00) Athens.

Use conference time zone: (GMT+03:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 19 Sep 2025 14:30 - 15:00 at Atrium C - LLMs and Agent-Based Testing Chair(s): Jørn Eirik Betten

Abstract

Conversational agents – or chatbots – are increasingly being used to access all sorts of services, like citizen services in city halls, customer support, or shopping. Moreover, recent advances in generative artificial intelligence are prompting the integration of conversational assistants into many applications, like programming IDEs, office automation software, or operating systems. Given the prominence of these agents, their correctness is a rising concern. However, automated and robust testing techniques for conversational systems are still needed.

In this paper, we present a technique for extracting a model of a deployed chatbot (i.e., treated as a black-box) through the automated exploration of its functionality via Large Language Models. This model is used for automated testing by generating testing conversation profiles, which a user simulator employs to conduct focused conversations with the chatbot-under-test. We describe our tool support, and report on an evaluation showing that our exploration technique can accurately model the chatbot-under-test, and the subsequent testing can discover existing errors in the chatbot.

Link to Preprint

https://miso.es/pubs/ICTSS25.pdf

Iván Sotillo del Horno

Universidad Autónoma de Madrid

Spain

Alejandro del Pozzo

Universidad Autónoma de Madrid

Spain

Esther Guerra

Universidad Autónoma de Madrid

Spain

Juan de Lara

Autonomous University of Madrid

Spain

Deployed tool

Time Zone

The program is currently displayed in (GMT+03:00) Athens.

Use conference time zone: (GMT+03:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 19 Sep
Displayed time zone: Athens change

14:00 - 15:30	LLMs and Agent-Based TestingGeneral Track at Atrium C Chair(s): Jørn Eirik Betten Simula Research Laboratory; Oslo Metropolitan University

14:00 30m Talk		Reverse Engineering for Input Modeling: Input Parameter Model Inference from Network Traces General Track Manuel Leithner SBA Research, Salzburg University of Applied Sciences, Dimitris E. Simos Salzburg University of Applied Sciences, Paris LodronUniversity of Salzburg
14:30 30m Talk		Automated Exploration of Conversational Agents for the Synthesis of Testing Profiles General Track Iván Sotillo del Horno Universidad Autónoma de Madrid, Alejandro del Pozzo Universidad Autónoma de Madrid, Esther Guerra Universidad Autónoma de Madrid, Juan de Lara Autonomous University of Madrid Pre-print Media Attached
15:00 30m Talk		Extracting Threats from System Descriptions with LLMs - Comparing One and Two Agents Strategies General Track Leonid Zelenskiy Innopolis University, Andrey Sadovykh Softeam