Iterative Assessment and Improvement of DNN Operational Accuracy (ICSE 2023 - NIER - New Ideas and Emerging Results)

Who

Antonio Guerriero, Roberto Pietrantuono, Stefano Russo

Track

ICSE 2023 NIER - New Ideas and Emerging Results

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 17 May 2023 15:07 - 15:15 at Meeting Room 104 - AI systems engineering Chair(s): Xin Peng

Abstract

Deep Neural Networks (DNN) are nowadays largely adopted in many application domains thanks to their human-like, or even superhuman, performance in specific tasks. However, due to unpredictable/unconsidered operating conditions, unexpected failures show up on field, making the performance of a DNN in operation very different from the one estimated prior to release.

In the life cycle of DNN systems, the assessment of accuracy is typically addressed in two ways: offline, via sampling of operational inputs, or online, via pseudo-oracles. The former is considered more expensive due to the need for manual labeling of the sampled inputs. The latter is automatic but less accurate.

We believe that emerging iterative industrial-strength life cycle models for Machine Learning systems, like MLOps, offer the possibility to leverage inputs observed in operation not only to provide faithful estimates of a DNN accuracy, but also to improve it through remodeling/retraining actions.

To this aim we propose DAIC (DNN Assessment and Improvement Cycle), an approach which combines “low-cost” pseudo-oracles and “high-cost” sampling techniques to estimate and improve the operational accuracy of a DNN within iterations of its life cycle. Preliminary results show the benefits of combining online and offline approaches and integrating them in the DNN life cycle.

Link to Preprint

https://arxiv.org/abs/2303.01295

Antonio Guerriero

Università di Napoli Federico II

Roberto Pietrantuono

Università di Napoli Federico II

Stefano Russo

Università di Napoli Federico II

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 17 May
Displayed time zone: Hobart change

13:45 - 15:15	AI systems engineeringSEIP - Software Engineering in Practice / Technical Track / NIER - New Ideas and Emerging Results / Journal-First Papers at Meeting Room 104 Chair(s): Xin Peng Fudan University

13:45 15m Talk		FedDebug: Systematic Debugging for Federated Learning Applications Technical Track Waris Gill Virginia Tech, Ali Anwar University of Minnesota, Muhammad Ali Gulzar Virginia Tech
14:00 15m Talk		Practical and Efficient Model Extraction of Sentiment Analysis APIs Technical Track Weibin Wu Sun Yat-sen University, Jianping Zhang The Chinese University of Hong Kong, Victor Junqiu Wei The Hong Kong Polytechnic University, Xixian Chen Tencent, Zibin Zheng School of Software Engineering, Sun Yat-sen University, Irwin King The Chinese University of Hong Kong, Michael Lyu The Chinese University of Hong Kong
14:15 15m Talk		CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models Technical Track Changan Niu Software Institute, Nanjing University, Chuanyi Li Nanjing University, Vincent Ng Human Language Technology Research Institute, University of Texas at Dallas, Richardson, TX 75083-0688, Bin Luo Nanjing University Pre-print
14:30 15m Talk		Challenges in Adopting Artificial Intelligence Based User Input Verification Framework in Reporting Software Systems SEIP - Software Engineering in Practice Dong Jae Kim Concordia University, Tse-Hsun (Peter) Chen Concordia University, Steve Sporea , Andrei Toma ERA Environmental Management Solutions, Laura Weinkam , Sarah Sajedi ERA Environmental Management Solutions, Steve Sporea
14:45 7m Talk		Towards Understanding Quality Challenges of the Federated Learning for Neural Networks: A First Look from the Lens of Robustness Journal-First Papers Amin Eslami Abyane University of Calgary, Derui Zhu Technical University of Munich, Roberto Souza University of Calgary, Lei Ma University of Alberta, Hadi Hemmati York University
14:52 7m Talk		An Empirical Study of the Impact of Hyperparameter Tuning and Model Optimization on the Performance Properties of Deep Neural Networks Journal-First Papers Lizhi Liao Concordia University, Heng Li Polytechnique Montréal, Weiyi Shang University of Waterloo, Lei Ma University of Alberta
15:00 7m Talk		Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction and Clustering Journal-First Papers Mohammed Attaoui University of Luxembourg, Hazem FAHMY University of Luxembourg, Fabrizio Pastore University of Luxembourg, Lionel Briand University of Luxembourg; University of Ottawa Link to publication Pre-print
15:07 7m Talk		Iterative Assessment and Improvement of DNN Operational Accuracy NIER - New Ideas and Emerging Results Antonio Guerriero Università di Napoli Federico II, Roberto Pietrantuono Università di Napoli Federico II, Stefano Russo Università di Napoli Federico II Pre-print