DeepStability: A Study of Unstable Numerical Methods and Their Solutions in Deep Learning
Fri 13 May 2022 11:15 - 11:20 at ICSE room 1-odd hours - Reliability and Safety 6 Chair(s): Pasqualina Potena
Wed 25 May 2022 09:45 - 09:50 at Room 304+305 - Papers 3: Reliability and Safety Chair(s): Cristian Cadar
Wed 25 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 1
Deep learning (DL) has become an integral part of solutions to various important problems, which is why ensuring the quality of DL systems is essential. One of the challenges of achieving reliability and robustness of DL software is to ensure that algorithm implementations are numerically stable. DL algorithms require a large amount and a wide variety of numerical computations. A naive implementation of numerical computation can lead to errors that may result incorrect or inaccurate learning and results. A numerical algorithm or a mathematical formula can have several implementations that are mathematically equivalent, but have different numerical stability properties. Designing numerically stable algorithm implementations is challenging, because it requires an interdisciplinary knowledge of software engineering, DL, and numerical analysis. In this paper, we study two mature DL libraries PyTorch and Tensorflow with the goal of identifying unstable numerical methods and their solutions. Specifically, we investigate which DL algorithms are numerically unstable and conduct an in-depth analysis of the root cause, manifestation, and patches to numerical instabilities. Based on these findings, we launch DeepStability, the first database of numerical stability issues and solutions in DL. Our findings and DeepStability provide future references to developers and tool builders to prevent, detect, localize and fix numerically unstable algorithm implementations. To demonstrate that, using DeepStability we have located numerical stability issues in Tensorflow, and submitted a fix which has been accepted and merged in.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
20:00 - 21:00 | Reliability and Safety 5Technical Track / SEIP - Software Engineering in Practice at ICSE room 1-even hours Chair(s): David Lo Singapore Management University | ||
20:00 5mTalk | When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward SEIP - Software Engineering in Practice Jiayang Song University of Alberta, Deyun Lyu Kyushu university, Zhenya Zhang Nanyang Technological University, Zhijie Wang University of Alberta, Tianyi Zhang Purdue University, Lei Ma University of Alberta DOI Pre-print Media Attached | ||
20:05 5mTalk | Multi-Intention-Aware Configuration Selection for Performance Tuning Technical Track Haochen He National University of Defense Technology, Zhouyang Jia National University of Defense Technology, Shanshan Li National University of Defense Technology, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Chenglong Zhou National University of Defense Technology, Qing Liao Harbin Institute of Technology, Ji Wang National University of Defense Technology, Liao Xiangke National University of Defense Technology Pre-print Media Attached | ||
20:10 5mTalk | DeepStability: A Study of Unstable Numerical Methods and Their Solutions in Deep Learning Technical Track Pre-print Media Attached | ||
20:15 5mTalk | If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components Technical Track Boyue Caroline Hu University of Toronto, Lina Marsso University of Toronto, Krzysztof Czarnecki University of Waterloo, Canada, Rick Salay University of Toronto, Huakun Shen University of Toronto, Marsha Chechik University of Toronto DOI Pre-print Media Attached |