DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs
Thu 12 May 2022 04:15 - 04:20 at ICSE room 1-even hours - Machine Learning with and for SE 3 Chair(s): Antinisca Di Marco
As Deep Learning (DL) systems are widely deployed for mission-critical applications, debugging such systems becomes essential. Most existing works identify and repair suspicious neurons on the trained Deep Neural Network (DNN), which, unfortunately, might be a detour. Specifically, several existing studies have reported that many unsatisfactory behaviors are actually originated from the faults residing in DL programs. Besides, locating faulty neurons is not actionable for developers, while locating the faulty statements in DL programs can provide developers with more useful information for debugging. Though a few recent studies were proposed to pinpoint the faulty statements in DL programs or the training settings (e.g. too large learning rate), they were mainly designed based on predefined rules, leading to many false alarms or false negatives, especially when the faults are beyond their capabilities.
In view of these limitations, in this paper, we proposed DeepFD, a learning-based fault diagnosis and localization framework which maps the fault localization task to a learning problem. In particular, it infers the suspicious fault types via monitoring the runtime features extracted during DNN model training, and then locates the diagnosed faults in DL programs. It overcomes the limitations by identifying the root causes of faults in DL programs instead of neurons, and diagnosing the faults by a learning approach instead of a set of hard-coded rules. The evaluation exhibits the potential of DeepFD. It correctly diagnoses 52% of cases, compared with half (27%) achieved by the best state-of-the-art works. Besides, for fault localization, DeepFD also outperforms the existing works, reaching 42%, which almost doubles the best result (23%) achieved by the existing works.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
21:00 - 22:00 | Machine Learning with and for SE 6Technical Track at ICSE room 3-odd hours Chair(s): Ali Ouni ETS Montreal, University of Quebec | ||
21:00 5mTalk | DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs Technical Track Jialun Cao Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Meiziniu LI Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Xiao Chen Huazhong University of Science and Technology, Ming Wen Huazhong University of Science and Technology, Yongqiang Tian University of Waterloo, Bo Wu MIT-IBM Watson AI Lab in Cambridge, Shing-Chi Cheung Hong Kong University of Science and Technology DOI Pre-print Media Attached | ||
21:05 5mTalk | Fast Changeset-based Bug Localization with BERT Technical Track Agnieszka Ciborowska Virginia Commonwealth University, Kostadin Damevski Virginia Commonwealth University Pre-print Media Attached | ||
21:10 5mTalk | Multilingual training for Software Engineering Technical Track Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis DOI Pre-print Media Attached | ||
21:15 5mTalk | NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification Technical Track haibin zheng Zhejiang University of Technology, Zhiqing Chen Zhejiang University of Technology, Tianyu Du Zhejiang University, Xuhong Zhang Zhejiang University, Yao Cheng Huawei International, Shouling Ji Zhejiang University, Jingyi Wang Zhejiang University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Jinyin Chen College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China DOI Pre-print Media Attached | ||
21:20 5mTalk | Type4Py: Practical Deep Similarity Learning-Based Type Inference for Python Technical Track Amir Mir Delft University of Technology, Evaldas Latoskinas Delft University of Technology, Sebastian Proksch Delft University of Technology, Netherlands, Georgios Gousios Endor Labs & Delft University of Technology DOI Pre-print Media Attached | ||
21:25 5mTalk | Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules Technical Track Pre-print Media Attached |
Thu 12 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 05:00 | Machine Learning with and for SE 3Technical Track / Journal-First Papers / SEIP - Software Engineering in Practice at ICSE room 1-even hours Chair(s): Antinisca Di Marco University of L'Aquila | ||
04:00 5mTalk | In-IDE Code Generation from Natural Language: Promise and Challenges Journal-First Papers Frank Xu Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Graham Neubig Carnegie Mellon University | ||
04:05 5mTalk | Active Learning of Discriminative Subgraph Patterns for API Misuse Detection Journal-First Papers Pre-print Media Attached File Attached | ||
04:10 5mTalk | Dependency Tracking for Risk Mitigation in Machine Learning (ML) Systems SEIP - Software Engineering in Practice Xiwei (Sherry) Xu CSIRO Data61, Chen Wang CSIRO DATA61, Zhen Wang CSIRO Data61, Qinghua Lu CSIRO’s Data61, Liming Zhu CSIRO’s Data61; UNSW Media Attached | ||
04:15 5mTalk | DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs Technical Track Jialun Cao Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Meiziniu LI Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Xiao Chen Huazhong University of Science and Technology, Ming Wen Huazhong University of Science and Technology, Yongqiang Tian University of Waterloo, Bo Wu MIT-IBM Watson AI Lab in Cambridge, Shing-Chi Cheung Hong Kong University of Science and Technology DOI Pre-print Media Attached | ||
04:20 5mTalk | What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code Technical Track Yao Wan Huazhong University of Science and Technology, Wei Zhao Huazhong University of Science and Technology, Hongyu Zhang University of Newcastle, Yulei Sui University of Technology Sydney, Guandong Xu University of Technology, Sydney, Hai Jin Huazhong University of Science and Technology Pre-print Media Attached | ||
04:25 5mTalk | A Universal Data Augmentation Approach for Fault Localization Technical Track Huan Xie Chongqing University, Yan Lei School of Big Data & Software Engineering, Chongqing University, Meng Yan Chongqing University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Xin Xia Huawei Software Engineering Application Technology Lab, Xiaoguang Mao National University of Defense Technology DOI Pre-print Media Attached | ||
04:30 5mTalk | DeepState: Selecting Test Suites to Enhance the Robustness of Recurrent Neural Networks Technical Track Zixi Liu Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zhenyu Chen Nanjing University DOI Pre-print Media Attached |