The Pursuit of Diversity: Multi-Objective Testing of Deep Reinforcement Learning Agents
Testing deep reinforcement learning (DRL) agents in safety-critical domains requires discovering diverse failure scenarios. Existing tools such as INDAGO rely on single-objective optimization focused solely on maximizing failure counts, but this does not ensure discovered scenarios are diverse or reveal distinct error types. We introduce INDAGO-Nexus, a multi-objective search approach that jointly optimizes for failure likelihood and test scenario diversity using multi-objective evolutionary algorithms with multiple diversity metrics and Pareto front selection strategies. We evaluated INDAGO-Nexus on three DRL agents: humanoid walker, self-driving car, and parking agent. On average, INDAGO-Nexus discovers up to 50% more unique failures (test effectiveness) than INDAGO while reducing time-to-failure by up to 52% across all agents.
Sun 16 NovDisplayed time zone: Seoul change
08:30 - 10:00 | |||
08:30 10mTalk | Opening Keynote Shin Hong Chungbuk National University | ||
08:40 20mTalk | Search-based Hyperparameter Tuning for Python Unit Test Generation Research Papers Pre-print | ||
09:00 20mTalk | Constraint-Guided Unit Test Generation for Machine Learning Libraries Research Papers Lukas Krodinger University of Passau, Altin Hajdari University of Passau, Stephan Lukasczyk JetBrains Research, Gordon Fraser University of Passau Pre-print | ||
09:20 20mTalk | LLM-Guided Fuzzing for Pathological Input Generation Research Papers | ||
09:40 20mTalk | The Pursuit of Diversity: Multi-Objective Testing of Deep Reinforcement Learning Agents Research Papers Antony Bartlett TU Delft, The Netherlands, Cynthia C. S. Liem Delft University of Technology, Annibale Panichella Delft University of Technology | ||