Coverage-based Strategies for the Automated Synthesis of Test Scenarios for Conversational Agents
Conversational agents – or chatbots – are increasingly used as the user interface to many software services. While open-domain chatbots like ChatGPT excel in their ability to chat about any topic, task-oriented conversational agents are designed to perform goaloriented tasks (e.g., booking or shopping) guided by a dialoguebased user interaction, which is explicitly designed. Like any kind of software system, task-oriented conversational agents need to be properly tested to ensure their quality. For this purpose, some tools permit defining and executing conversation test cases. However, there are currently no established means to assess the coverage of the design of a task-oriented agent by a test suite, or mechanisms to automate quality test case generation ensuring the agent coverage.
To attack this problem, we propose test coverage criteria for task-oriented conversational agents, and define coverage-based strategies to synthesise test scenarios, some oriented to test case reduction. We provide an implementation of the criteria and the strategies that is independent of the agent development platform. Finally, we report on their evaluation on open-source Dialogflow and Rasa agents, and a comparison against a state-of-the-art testing tool. The experiment shows benefits in terms of test generation correctness, increased coverage and reduced testing time.
Mon 15 AprDisplayed time zone: Lisbon change
11:00 - 12:30 | Session 1: Test CoverageAST 2024 at Amália Rodrigues Chair(s): Gilles Perrouin Fonds de la Recherche Scientifique - FNRS & University of Namur | ||
11:00 20mFull-paper | Mutation Coverage is not Strongly Correlated with Mutation Coverage AST 2024 | ||
11:20 20mFull-paper | Running a Red Light: An Investigation into Why Software Engineers (Occasionally) Ignore Coverage Checks AST 2024 Alexander Sterk Delft University of Technology, Mairieli Wessel Radboud University, Eli Hooten Sentry.io, Andy Zaidman Delft University of Technology DOI Pre-print | ||
11:40 20mFull-paper | Coverage-based Strategies for the Automated Synthesis of Test Scenarios for Conversational Agents AST 2024 Pablo C Canizares Autonomous University of Madrid, Spain, Daniel Ávila Autonomous University of Madrid, Sara Perez-Soler Universidad Autónoma de Madrid, Esther Guerra Universidad Autónoma de Madrid, Juan de Lara Autonomous University of Madrid Pre-print | ||
12:00 20mFull-paper | WallMauer: Robust Code Coverage Instrumentation for Android Apps AST 2024 Michael Auer University of Passau, Iván Arcuschin Moreno University of Buenos Aires, Argentina, Gordon Fraser University of Passau |