Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study (ICST 2020 - Research Papers)

Who

Fitash Ul Haq, Donghwan Shin, Shiva Nejati, Lionel Briand

Track

ICST 2020 Research Papers

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 25 Oct 2020 11:30 - 12:00 at Farfetch (D. Maria) - RT3 - Testing Deep Learning and Robotic Systems Chair(s): Antonio Filieri
Sun 25 Oct 2020 22:30 - 23:00 at Farfetch (D. Maria) - RT3 - Testing Deep Learning and Robotic Systems Chair(s): João Pascoal Faria

Abstract

There is a growing body of research on developing testing techniques for Deep Neural Networks (DNNs). We distinguish two general modes of testing for DNNs: Offline testing where DNNs are tested as individual units based on test datasets obtained independently from the DNNs under test, and online testing where DNNs are embedded into a specific application and tested in a close-loop mode in interaction with the application environment. In addition, we identify two sources for generating test datasets for DNNs: Datasets obtained from real-life and datasets generated by simulators. While offline testing can be used with datasets obtained from either sources, online testing is largely confined to using simulators since online testing within real-life applications can be time consuming, expensive and dangerous. In this paper, we study the following two important questions aiming to compare test datasets and testing modes for DNNs: First, can we use simulator-generated data as a reliable substitute to real-world data for the purpose of DNN testing? Second, how do online and offline testing results differ and complement each other? Though these questions are generally relevant to all autonomous systems, we study them in the context of automated driving systems where, as study subjects, we use DNNs automating end-to-end control of cars’ steering actuators. Our results show that simulator-generated datasets are able to yield DNN prediction errors that are similar to those obtained by testing DNNs with real-life datasets. Further, offline testing is more optimistic than online testing as many safety violations identified by online testing could not be identified by offline testing, while large prediction errors generated by offline testing always led to severe safety violations detectable by online testing.

Link to Publication

https://doi.org/10.1109/ICST46399.2020.00019

DOI

https://doi.org/10.1109/ICST46399.2020.00019

Fitash Ul Haq

University of Luxembourg

Luxembourg

Donghwan Shin

University of Luxembourg

Luxembourg

Shiva Nejati

University of Luxembourg

Luxembourg

Lionel Briand

University of Luxembourg, University of Ottawa

Luxembourg

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 25 Oct
Displayed time zone: Lisbon change

11:00 - 12:30	RT3 - Testing Deep Learning and Robotic SystemsResearch Papers at Farfetch (D. Maria) +11h Chair(s): Antonio Filieri Imperial College London

11:00 30m Talk		An Empirical Evaluation of Mutation Operators for Deep Learning Systems Research Papers Gunel Jahangirova USI Lugano, Switzerland, Paolo Tonella Università della Svizzera Italiana (USI) Link to publication DOI
11:30 30m Talk		Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study Research Papers Fitash Ul Haq University of Luxembourg, Donghwan Shin University of Luxembourg, Shiva Nejati University of Luxembourg, Lionel Briand University of Luxembourg, University of Ottawa Link to publication DOI
12:00 30m Talk		A Study on Challenges of Testing Robotic Systems Research Papers Afsoon Afzal Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA, Christopher Steven Timperley Carnegie Mellon University Link to publication DOI

22:00 - 23:30	RT3 - Testing Deep Learning and Robotic SystemsResearch Papers at Farfetch (D. Maria) Chair(s): João Pascoal Faria Faculty of Engineering, University of Porto and INESC TEC

22:00 30m Talk		An Empirical Evaluation of Mutation Operators for Deep Learning Systems Research Papers Gunel Jahangirova USI Lugano, Switzerland, Paolo Tonella Università della Svizzera Italiana (USI) Link to publication DOI
22:30 30m Talk		Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study Research Papers Fitash Ul Haq University of Luxembourg, Donghwan Shin University of Luxembourg, Shiva Nejati University of Luxembourg, Lionel Briand University of Luxembourg, University of Ottawa Link to publication DOI
23:00 30m Talk		A Study on Challenges of Testing Robotic Systems Research Papers Afsoon Afzal Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA, Christopher Steven Timperley Carnegie Mellon University Link to publication DOI