ICTSS 2025
Wed 17 - Fri 19 September 2025 Limassol, Cyprus
co-located with ECSA 2025
Fri 19 Sep 2025 11:00 - 11:30 at Atrium C - Reinforcement Learning and Generative Testing Chair(s): Li Huang

Reinforcement learning (RL) agents show great promise in solving sequential decision-making tasks. However, validating the reli- ability and performance of the agent policies’ behavior for deployment remains challenging. Most reinforcement learning policy testing methods produce test suites tailored to the agent policy being tested, and their relevance to other policies is unclear. This work presents Multi-Policy Test Case Selection (MPTCS), a novel automated test suite selection method for RL environments, designed to extract test cases generated by any policy testing framework based on their solvability, diversity, and general difficulty. MPTCS uses a set of policies to select a diverse col- lection of reusable policy-agnostic test cases that reveal typical flaws in the agents’ behavior. The set of policies selects test cases from a candi- date pool, which can be generated by any policy testing method, based on a difficulty score. We assess the effectiveness of the difficulty score and how the method’s effectiveness and cost depend on the number of policies in the set. Additionally, a method for promoting diversity in the test suite, a discretized general test case descriptor surface inspired by quality-diversity algorithms, is examined to determine how it covers the state space and which policies it triggers to produce faulty behaviors.

Fri 19 Sep

Displayed time zone: Athens change

11:00 - 12:30
Reinforcement Learning and Generative TestingGeneral Track at Atrium C
Chair(s): Li Huang Constructor Institute Schaffhausen
11:00
30m
Talk
Reusable Test Suites for Reinforcement Learning
General Track
Jørn Eirik Betten Simula Research Laboratory; Oslo Metropolitan University, Quentin Mazouni Simula Research Laboratory, Dennis Gross Simula Research Laboratory, Pedro Lind Oslo Metropolitan University; School of Economics,Innovation and Technology, Kristiania University of AppliedSciences, Helge Spieker Simula Research Laboratory
11:30
30m
Talk
Test Generation for Deep Reinforcement Learning Using LRP-Guided Mutation of Classified Configurations
General Track
Brice Tchuenkam Université du Québec en Outaouais, Omer Nguena Timo Université du Québec en Outaouais
12:00
30m
Talk
Test Amplification for REST APIs via Single and Multi-Agent LLM Systems
General Track
Robbe Nooyens University of Antwerp, Tolgahan Bardakci University of Antwerp and Flanders Make, Mutlu Beyazıt University of Antwerp and Flanders Make vzw, Serge Demeyer University of Antwerp and Flanders Make vzw