Write a Blog >>
ICSE 2023
Sun 14 - Sat 20 May 2023 Melbourne, Australia
Fri 19 May 2023 12:00 - 12:15 at Meeting Room 101 - AI testing 2 Chair(s): Gunel Jahangirova

Deep Neural Networks (DNNs) have been widely used to perform real-world tasks in cyber-physical systems such as Autonomous Diving Systems (ADS). Ensuring the correct behavior of such DNN-Enabled Systems (DES) is a crucial topic. Online testing is one of the promising modes for testing such systems with their application environments (simulated or real) in a closed loop taking into account the continuous interaction between the systems and their environments. However, the environmental variables (e.g., lighting conditions) that might change during the systems’ operation in the real world, causing the DES to violate requirements (safety, functional), are often kept constant during the execution of an online test scenario due to the two major challenges: (1) the space of all possible scenarios to explore would become even larger if they changed and (2) there are typically many requirements to test simultaneously.

In this paper, we present MORLOT (Many-Objective Reinforcement Learning for Online Testing), a novel online testing approach to address these challenges by combining Reinforcement Learning (RL) and many-objective search. MORLOT leverages RL to incrementally generate sequences of environmental changes while relying on many-objective search to determine the changes so that they are more likely to achieve any of the uncovered objectives. We empirically evaluate MORLOT using CARLA, a high-fidelity simulator widely used for autonomous driving research, integrated with Transfuser, a DNN-enabled ADS for end-to-end driving. The evaluation results show that MORLOT is significantly more effective and efficient than alternatives with a large effect size. In other words, MORLOT is a good option to test DES with dynamically changing environments while accounting for multiple safety requirements.

Fri 19 May

Displayed time zone: Hobart change

11:00 - 12:30
AI testing 2Technical Track / Journal-First Papers at Meeting Room 101
Chair(s): Gunel Jahangirova USI Lugano, Switzerland
11:00
15m
Talk
Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation
Technical Track
Qiang Hu University of Luxembourg, Yuejun GUo University of Luxembourg, Xiaofei Xie Singapore Management University, Maxime Cordy University of Luxembourg, Luxembourg, Lei Ma University of Alberta, Mike Papadakis University of Luxembourg, Luxembourg, Yves Le Traon University of Luxembourg, Luxembourg
Pre-print
11:15
15m
Talk
Testing the Plasticity of Reinforcement Learning Based Systems
Journal-First Papers
Matteo Biagiola UniversitĂ  della Svizzera italiana, Paolo Tonella USI Lugano
Link to publication DOI Pre-print
11:30
15m
Talk
CC: Causality-Aware Coverage Criterion for Deep Neural Networks
Technical Track
Zhenlan Ji The Hong Kong University of Science and Technology, Pingchuan Ma HKUST, Yuanyuan Yuan The Hong Kong University of Science and Technology, Shuai Wang Hong Kong University of Science and Technology
11:45
15m
Talk
Balancing Effectiveness and Flakiness of Non-Deterministic Machine Learning Tests
Technical Track
Chunqiu Steven Xia University of Illinois at Urbana-Champaign, Saikat Dutta University of Illinois at Urbana-Champaign, Sasa Misailovic University of Illinois at Urbana-Champaign, Darko Marinov University of Illinois at Urbana-Champaign, Lingming Zhang University of Illinois at Urbana-Champaign
12:00
15m
Talk
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems
Technical Track
Fitash ul haq , Donghwan Shin The University of Sheffield, Lionel Briand University of Luxembourg; University of Ottawa
Pre-print
12:15
15m
Talk
Reliability Assurance for Deep Neural Network Architectures Against Numerical Defects
Technical Track
Linyi Li University of Illinois at Urbana-Champaign, Yuhao Zhang University of Wisconsin-Madison, Luyao Ren Peking University, China, Yingfei Xiong Peking University, Tao Xie Peking University
Pre-print