Metamorphic Testing of Deep Reinforcement Learning Agents with MDPMORPH (ASE 2025 - Tool Demonstration Track)

Who

Jiapeng Li, Zheng Zheng, Yuning Xing, Daixu Ren, Steven Cho, Valerio Terragni

Track

ASE 2025 Tool Demonstration Track

This program is tentative and subject to change.

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 17 Nov 2025 15:00 - 18:00 at Walker Hall - Tools - Testing & Analysis

Abstract

We present MDPMORPH, a tool for metamorphic testing of Deep Reinforcement Learning (DRL) agents. MDPMORPH is based on the Markov Decision Process (MDP) and targets the core reasoning properties of DRL agents to automatically uncover potential faults. It can generate metamorphic test suites and corresponding mutants directly from the DRL system under test. MDPMORPH uses a subset of the metamorphic test suite and models to train the thresholds of the nine proposed Metamorphic Relations (MRs) using Stochastic Gradient Descent. These MRs are based on the temporal characteristics of the Markov Decision Process (MDP), and the training aims to determine the optimal threshold for each MR. After obtaining the optimal threshold, MDPMORPH leverages the MRs to compare the execution results of different metamorphic test suites on the model under test and reports whether each test passes or fails. Finally, by collecting the execution results, MDPMORPH calculates the mutant detection rate of MR to validate its effectiveness. Experimental results show that MDPMORPH and the proposed MRs are highly effective in automatically detecting seeded faults (mutants).

Jiapeng Li

Beihang University

Zheng Zheng

Beihang University

China

Yuning Xing

University of Auckland

Daixu Ren

Beihang University

Steven Cho

The University of Auckland, New Zealand

Valerio Terragni

University of Auckland

New Zealand

This program is tentative and subject to change.

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 17 Nov
Displayed time zone: Seoul change

15:00 - 18:00	Tools - Testing & AnalysisTool Demonstration Track at Walker Hall

15:00 3h Demonstration		Towards Context-aware Mobile Privacy Notice: Implementation of A Deployable Contextual Privacy Policies Generator Tool Demonstration Track Haochen Gong Australian National University, Zhen Tao Technical University of Munich, Shidong Pan Columbia University & New York University, Zhenchang Xing CSIRO's Data61, Xiaoyu Sun Australian National University, Australia
15:00 3h Demonstration		Metamorphic Testing of Deep Reinforcement Learning Agents with MDPMORPH Tool Demonstration Track Jiapeng Li Beihang University, Zheng Zheng Beihang University, Yuning Xing University of Auckland, Daixu Ren Beihang University, Steven Cho The University of Auckland, New Zealand, Valerio Terragni University of Auckland
15:00 3h Demonstration		FlowStrider: Low-friction Continuous Threat Modeling Tool Demonstration Track Bernd Gruner German Aerospace Center (DLR), Institute of Data Science, Noah Erthel German Aerospace Center (DLR), Clemens-Alexander Brust German Aerospace Center (DLR)
15:00 3h Demonstration		ReFuzzer: Feedback-Driven Approach to Enhance Validity of LLM-Generated Test Programs Tool Demonstration Track Iti Shree King's College London, Karine Even-Mendoza King’s College London, Tomasz Radzik King's College London
15:00 3h Demonstration		DESIGNATOR: a Toolset for Automated GAN-enhanced Search-based Testing and Retraining of DNNs in Martian Environments Tool Demonstration Track Mohammed Attaoui University of Luxembourg, Fabrizio Pastore University of Luxembourg Pre-print
15:00 3h Demonstration		Chrysalis: A Lightweight Framework for Metamorphic Testing in Python Tool Demonstration Track Jai Parera University of California, Los Angeles, Nathan Huey University of California, Los Angeles, Ben Limpanukorn University of California, Los Angeles, Miryung Kim UCLA and Amazon Web Services
15:00 3h Demonstration		AndroFL: Evolutionary-Driven Fault Localization for Android Apps Tool Demonstration Track Vishal Singh Indian Institute of Technology Kanpur, Ravi Shankar Das Indian Institute of Technology Kanpur, Prajwal H G InMobi, Subhajit Roy IIT Kanpur DOI
15:00 3h Demonstration		XRintTest: An Automated Framework for User Interaction Testing in Extended Reality Applications Tool Demonstration Track Ruizhen Gu University of Sheffield, José Miguel Rojas University of Sheffield, Donghwan Shin University of Sheffield Pre-print
15:00 3h Demonstration		Training-Control-as-Code: Towards a declarative solution to control training Tool Demonstration Track Padmanabha V. Seshadri IBM India Research Lab, Harikrishnan Balagopal IBM India Research Lab, Mehant Kammakomati IBM India Research Lab, Ashok Pon Kumar IBM Research - India, Dushyant Behl IBM Research Media Attached
15:00 3h Demonstration		VUSC: An Extensible Research Platform for Java-Based Static Analysis Tool Demonstration Track Marc Miltenberger Fraunhofer SIT; ATHENE, Steven Arzt Fraunhofer SIT; ATHENE
15:00 3h Demonstration		BASHIRI: Learning Failure Oracles from Execution Features Tool Demonstration Track Marius Smytzek CISPA Helmholtz Center for Information Security, Martin Eberlein Humboldt-Universtität zu Berlin, Tural Mammadov CISPA Helmholtz Center for Information Security, Lars Grunske Humboldt-Universität zu Berlin, Andreas Zeller CISPA Helmholtz Center for Information Security
15:00 3h Demonstration		FETT: Fault Injection as an Educational and Training Tool in Cybersecurity Tool Demonstration Track Anaé De Baets University of Namur, Guillaume Nguyen University of Namur, Xavier Devroey University of Namur, Fabian Gilson University of Canterbury Pre-print