Write a Blog >>
ICSE 2021
Mon 17 May - Sat 5 June 2021

Deep neural networks (DNNs) are becoming an integral part of most software systems. Previous work has shown that DNNs have bugs. Unfortunately, existing debugging techniques don’t support localizing DNN bugs because of the lack of understanding of model behaviors. The entire DNN model appears as a black box. To address these problems, we propose an approach and a tool that automatically determines whether the model is buggy or not, and identifies the root causes for DNN errors. Our key insight is that historic trends in values propagated between layers can be analyzed to identify faults, and also localize faults. To that end, we first enable dynamic analysis of deep learning applications: by converting it into an imperative representation and alternatively using a callback mechanism. Both mechanisms allows us to insert probes that enable dynamic analysis over the traces produced by the DNN while it is being trained on the training data. We then conduct dynamic analysis over the traces to identify the faulty layer or hyperparameter that causes the error. We propose an algorithm for identifying root causes by capturing any numerical error and monitoring the model during training and finding the relevance of every layer/parameter on the DNN outcome. We have collected a benchmark containing 40 buggy models and patches that contain real errors in deep learning applications from Stack Overflow and GitHub. Our benchmark can be used to evaluate automated debugging tools and repair techniques. We have evaluated our approach using this DNN bug-and-patch benchmark, and the results showed that our approach is much more effective than the existing debugging approach used in the state-of-the-practice Keras library. For 34/40 cases, our approach was able to detect faults whereas the best debugging approach provided by Keras detected 32/40 faults. Our approach was able to localize 21/40 bugs whereas Keras did not localize any faults.

Tue 25 May

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

19:35 - 20:55
1.5.1. Deep Neural Networks: General IssuesTechnical Track / Journal-First Papers / SEIP - Software Engineering in Practice at Blended Sessions Room 1 +12h
Chair(s): Ignacio Panach Universidad de Valencia
19:35
20m
Paper
Asset Management in Machine Learning: A SurveySEIP
SEIP - Software Engineering in Practice
Samuel Idowu Chalmers | University of Gothenburg, Daniel Strüber Radboud University Nijmegen, Thorsten Berger Chalmers | University of Gothenburg
Pre-print Media Attached
19:55
20m
Paper
An Empirical Study of Refactorings and Technical Debt in Machine Learning SystemsTechnical Track
Technical Track
Yiming Tang City University of New York (CUNY) Graduate Center, Raffi Khatchadourian CUNY Hunter College, Mehdi Bagherzadeh Oakland University, Rhia Singh City University of New York (CUNY) Macaulay Honors College, Ajani Stewart City University of New York (CUNY) Hunter College, Anita Raja City University of New York (CUNY) Hunter College
Pre-print Media Attached
20:15
20m
Paper
Logram: Efficient Log Parsing Using n-Gram DictionariesJournal-First
Journal-First Papers
Hetong Dai Concordia University, Heng Li Polytechnique Montréal, Che-Shao Chen Concordia University, Weiyi Shang Concordia University, Tse-Hsun (Peter) Chen Concordia University
DOI Pre-print Media Attached
20:35
20m
Paper
DeepLocalize: Fault Localization for Deep Neural NetworksTechnical Track
Technical Track
Mohammad Wardat Dept. of Computer Science, Iowa State University, Wei Le Dept. of Computer Science, Iowa State University, Hridesh Rajan Dept. of Computer Science, Iowa State University
Pre-print Media Attached

Wed 26 May

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

07:35 - 08:55
07:35
20m
Paper
Asset Management in Machine Learning: A SurveySEIP
SEIP - Software Engineering in Practice
Samuel Idowu Chalmers | University of Gothenburg, Daniel Strüber Radboud University Nijmegen, Thorsten Berger Chalmers | University of Gothenburg
Pre-print Media Attached
07:55
20m
Paper
An Empirical Study of Refactorings and Technical Debt in Machine Learning SystemsTechnical Track
Technical Track
Yiming Tang City University of New York (CUNY) Graduate Center, Raffi Khatchadourian CUNY Hunter College, Mehdi Bagherzadeh Oakland University, Rhia Singh City University of New York (CUNY) Macaulay Honors College, Ajani Stewart City University of New York (CUNY) Hunter College, Anita Raja City University of New York (CUNY) Hunter College
Pre-print Media Attached
08:15
20m
Paper
Logram: Efficient Log Parsing Using n-Gram DictionariesJournal-First
Journal-First Papers
Hetong Dai Concordia University, Heng Li Polytechnique Montréal, Che-Shao Chen Concordia University, Weiyi Shang Concordia University, Tse-Hsun (Peter) Chen Concordia University
DOI Pre-print Media Attached
08:35
20m
Paper
DeepLocalize: Fault Localization for Deep Neural NetworksTechnical Track
Technical Track
Mohammad Wardat Dept. of Computer Science, Iowa State University, Wei Le Dept. of Computer Science, Iowa State University, Hridesh Rajan Dept. of Computer Science, Iowa State University
Pre-print Media Attached