Adaptive Deep Learning Model Selection on Embedded Systems (LCTES 2018)

Who

Ben Taylor, Vicent Sanz Marco, Willy Wolff, Yehia Elkhatib, Zheng Wang

Track

LCTES 2018

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 19 Jun 2018 14:00 - 14:25 at Discovery AB - Full paper session on Adaptation and Hardware

Abstract

The recent ground-breaking advances in deep learning networks (DNNs) make them attractive for embedded systems. However, it can take a long time for DNNs to make an inference on resource-limited embedded devices. Offloading the computation into the cloud is often infeasible due to privacy concerns, high latency, or the lack of connectivity. As such, there is a critical need to find a way to effectively execute the DNN models locally on the devices. This paper presents an adaptive scheme to determine which DNN model to use for a given input, by considering the desired accuracy and inference time. Our approach employs machine learning to develop a predictive model to quickly select a pre-trained DNN to use for a given input and the optimization constraint. We achieve this by first training off-line a predictive model, and then use the learnt model to select a DNN model to use for new, unseen inputs. We apply our approach to the image classification task and evaluate it on a Jetson TX2 embedded deep learning platform using the ImageNet ILSVRC 2012 validation dataset. We consider a range of influential DNN models. Experimental results show that our approach achieves a 7.52% improvement in inference accuracy, and a 1.8x reduction in inference time over the most-capable single DNN model.

Ben Taylor

Lancaster University, UK

United Kingdom

Vicent Sanz Marco

Lancaster University

Willy Wolff

Lancaster University

Yehia Elkhatib

Lancaster University

Zheng Wang

Lancaster University

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 19 Jun
Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:40	Full paper session on Adaptation and HardwareLCTES 2018 at Discovery AB

14:00 25m Full-paper		Adaptive Deep Learning Model Selection on Embedded Systems LCTES 2018 Ben Taylor Lancaster University, UK, Vicent Sanz Marco Lancaster University, Willy Wolff Lancaster University, Yehia Elkhatib Lancaster University, Zheng Wang Lancaster University
14:25 25m Full-paper		Optimizing RAID/SSD Controllers with Lifetime Extension for Flash-based SSD Array LCTES 2018 Lei Han , Zhaoyan Shen The Hong Kong Polytechnic University, Zili Shao The Hong Kong Polytechnic University, Tao Li University of Florida
14:50 25m Full-paper		Verification of Coarse-Grained Reconfigurable Arrays through Random Test Programs LCTES 2018 Bernhard Egger Seoul National University, Eunjin Song Seoul National University, Hochan Lee Seoul National University, Daeyoung Shin Seoul National University
15:15 25m Full-paper		Decoupling Address Generation from Loads and Stores to Improve Data Access Energy Efficiency LCTES 2018 Michael Stokes Florida State University, Ryan Baird Florida State University, Zhaoxiang Jin Michigan Technological University, David B. Whalley , Soner Onder Michigan Technological University