Curiosity-Driven Testing for Sequential Decision-Making Process
Sequential decision-making processes (SDPs) are essential in addressing complex real-world challenges, such as autonomous driving, robotic control, and traffic management. While recent advances in Deep Learning (DL) have led to mature solutions for solving these complex problems, SDMs remain vulnerable to learning unsafe behaviors, posing significant risks in safety-critical applications. The state-of-the-art testing framework for SDMs primarily concentrates on identifying crash-triggering scenarios but neglects the diversity aspect of the crashes. As a result, redundant crash-triggering scenarios may be generated. This can reduce the overall testing performance and introduce additional analysis costs. To address this, we propose CureFuzz, a novel curiosity-driven black-box fuzz testing approach for SDMs. CureFuzz proposes a curiosity mechanism that allows a fuzzer to effectively explore novel and diverse scenarios, leading to improved detection of crash-triggering scenarios. Additionally, we introduce a multi-objective seed selection technique to balance the exploration of novel scenarios and the generation of crash-triggering scenarios, thereby optimizing the fuzzing process. We evaluate CureFuzz on various SDMs and sequential decision-making problems, such as autonomous driving and video game playing. Experimental results demonstrate that CureFuzz outperforms the state-of-the-art method by a substantial margin in the total number of faults and distinct types of crash-triggering scenarios. We also demonstrate that the crash-triggering scenarios found by CureFuzz can repair SDMs, which highlights CureFuzz as a valuable tool for testing SDMs and optimizing their performance.
Thu 18 AprDisplayed time zone: Lisbon change
14:00 - 15:30 | Testing 4Research Track / Journal-first Papers at Grande Auditório Chair(s): Shiva Nejati University of Ottawa | ||
14:00 15mTalk | Concrete Constraint Guided Symbolic Execution Research Track Yue Sun Institute of Information Engineering, CAS, China, Guowei Yang University of Queensland, Shichao Lv College of Cyberspace Security, Chinese Academy of Sciences, Zhi Li Institute of Information Engineering, Chinese Academy of Sciences, China, Limin Sun Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences, Pre-print | ||
14:15 15mTalk | Improving Testing Behavior by Gamifying IntelliJ Research Track DOI Pre-print | ||
14:30 15mTalk | SCTrans: Constructing a Large Public Scenario Dataset for Simulation Testing of Autonomous Driving Systems Research Track Jiarun Dai Fudan University, Bufan Gao Fudan University, Mingyuan Luo Fudan University, Zongan Huang Fudan University, Zhongrui Li Fudan University, Yuan Zhang Fudan University, Min Yang Fudan University | ||
14:45 15mTalk | Curiosity-Driven Testing for Sequential Decision-Making Process Research Track Junda He Singapore Management University, Zhou Yang Singapore Management University, Jieke Shi Singapore Management University, Chengran Yang Singapore Management University, Singapore, Kisub Kim Singapore Management University, Singapore, Bowen Xu North Carolina State University, Xin Zhou Singapore Management University, Singapore, David Lo Singapore Management University | ||
15:00 15mTalk | Detecting Logic Bugs in Graph Database Management Systems via Injective and Surjective Graph Pattern Transformation Research Track Yuancheng Jiang National University of Singapore, Jiahao Liu National University of Singapore, Jinsheng Ba National University of Singapore, Roland H. C. Yap National University of Singapore, Singapore, Zhenkai Liang National University of Singapore, Manuel Rigger National University of Singapore DOI Pre-print | ||
15:15 7mTalk | Testing Causality in Scientific Modelling Software Journal-first Papers Andrew Graham Clark The University of Sheffield, Michael Foster The University of Sheffield, Neil Walkinshaw The University of Sheffield, Robert Hierons The University of Sheffield, Benedikt Prifling Ulm University, Volker Schmidt Ulm University, Robert D. Turner The University of Sheffield | ||
15:22 7mTalk | HybridCISave: A Combined Build and Test Selection Approach in Continuous Integration Journal-first Papers |