The Pursuit of Diversity: Multi-Objective Testing of Deep Reinforcement Learning Agents (SSBSE 2025 - Research Papers)

Who

Antony Bartlett, Cynthia C. S. Liem, Annibale Panichella

Track

SSBSE 2025 Research Papers

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 16 Nov 2025 09:40 - 10:00 at Grand Hall 4 - Research 1

Abstract

Testing deep reinforcement learning (DRL) agents in safety-critical domains requires discovering diverse failure scenarios. Existing tools such as INDAGO rely on single-objective optimization focused solely on maximizing failure counts, but this does not ensure discovered scenarios are diverse or reveal distinct error types. We introduce INDAGO-Nexus, a multi-objective search approach that jointly optimizes for failure likelihood and test scenario diversity using multi-objective evolutionary algorithms with multiple diversity metrics and Pareto front selection strategies. We evaluated INDAGO-Nexus on three DRL agents: humanoid walker, self-driving car, and parking agent. On average, INDAGO-Nexus discovers up to 50% more unique failures (test effectiveness) than INDAGO while reducing time-to-failure by up to 52% across all agents.

Antony Bartlett

TU Delft, The Netherlands

Netherlands

Cynthia C. S. Liem

Delft University of Technology

Netherlands

Annibale Panichella

Delft University of Technology

Netherlands

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 16 Nov
Displayed time zone: Seoul change

08:30 - 10:00	Research 1Keynote / Research Papers at Grand Hall 4

08:30 10m Talk		Opening Keynote Shin Hong Chungbuk National University
08:40 20m Talk		Search-based Hyperparameter Tuning for Python Unit Test Generation Research Papers Stephan Lukasczyk JetBrains Research, Gordon Fraser University of Passau Pre-print
09:00 20m Talk		Constraint-Guided Unit Test Generation for Machine Learning Libraries Research Papers Lukas Krodinger University of Passau, Altin Hajdari University of Passau, Stephan Lukasczyk JetBrains Research, Gordon Fraser University of Passau Pre-print
09:20 20m Talk		LLM-Guided Fuzzing for Pathological Input Generation Research Papers Didier Ishimwe George Mason University, ThanhVu Nguyen George Mason University
09:40 20m Talk		The Pursuit of Diversity: Multi-Objective Testing of Deep Reinforcement Learning Agents Research Papers Antony Bartlett TU Delft, The Netherlands, Cynthia C. S. Liem Delft University of Technology, Annibale Panichella Delft University of Technology