Write a Blog >>
ICST 2020
Sat 24 - Wed 28 October 2020 Porto, Portugal

There is a growing body of research on developing testing techniques for Deep Neural Networks (DNNs). We distinguish two general modes of testing for DNNs: Offline testing where DNNs are tested as individual units based on test datasets obtained independently from the DNNs under test, and online testing where DNNs are embedded into a specific application and tested in a close-loop mode in interaction with the application environment. In addition, we identify two sources for generating test datasets for DNNs: Datasets obtained from real-life and datasets generated by simulators. While offline testing can be used with datasets obtained from either sources, online testing is largely confined to using simulators since online testing within real-life applications can be time consuming, expensive and dangerous. In this paper, we study the following two important questions aiming to compare test datasets and testing modes for DNNs: First, can we use simulator-generated data as a reliable substitute to real-world data for the purpose of DNN testing? Second, how do online and offline testing results differ and complement each other? Though these questions are generally relevant to all autonomous systems, we study them in the context of automated driving systems where, as study subjects, we use DNNs automating end-to-end control of cars’ steering actuators. Our results show that simulator-generated datasets are able to yield DNN prediction errors that are similar to those obtained by testing DNNs with real-life datasets. Further, offline testing is more optimistic than online testing as many safety violations identified by online testing could not be identified by offline testing, while large prediction errors generated by offline testing always led to severe safety violations detectable by online testing.

Sun 25 Oct

Displayed time zone: Lisbon change

11:00 - 12:30
RT3 - Testing Deep Learning and Robotic SystemsResearch Papers at Farfetch (D. Maria) +11h
Chair(s): Antonio Filieri Imperial College London
11:00
30m
Talk
An Empirical Evaluation of Mutation Operators for Deep Learning SystemsDistinguished Paper Award
Research Papers
Gunel Jahangirova USI Lugano, Switzerland, Paolo Tonella Università della Svizzera Italiana (USI)
Link to publication DOI
11:30
30m
Talk
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study
Research Papers
Fitash Ul Haq University of Luxembourg, Donghwan Shin University of Luxembourg, Shiva Nejati University of Luxembourg, Lionel Briand University of Luxembourg, University of Ottawa
Link to publication DOI
12:00
30m
Talk
A Study on Challenges of Testing Robotic Systems
Research Papers
Afsoon Afzal Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA, Christopher Steven Timperley Carnegie Mellon University
Link to publication DOI
22:00 - 23:30
RT3 - Testing Deep Learning and Robotic SystemsResearch Papers at Farfetch (D. Maria)
Chair(s): João Pascoal Faria Faculty of Engineering, University of Porto and INESC TEC
22:00
30m
Talk
An Empirical Evaluation of Mutation Operators for Deep Learning SystemsDistinguished Paper Award
Research Papers
Gunel Jahangirova USI Lugano, Switzerland, Paolo Tonella Università della Svizzera Italiana (USI)
Link to publication DOI
22:30
30m
Talk
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study
Research Papers
Fitash Ul Haq University of Luxembourg, Donghwan Shin University of Luxembourg, Shiva Nejati University of Luxembourg, Lionel Briand University of Luxembourg, University of Ottawa
Link to publication DOI
23:00
30m
Talk
A Study on Challenges of Testing Robotic Systems
Research Papers
Afsoon Afzal Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA, Christopher Steven Timperley Carnegie Mellon University
Link to publication DOI