Wed 11 May 2022 12:05 - 12:10 at ICSE room 1-even hours - Machine Learning with and for SE 11 Chair(s): Ipek Ozkaya
Crash localization, an important step in debugging crashes, is challenging when dealing with an extremely large number of diverse applications and platforms and underlying root causes. Large-scale error reporting systems, e.g., Windows Error Reporting (WER), commonly rely on manually developed rules and heuristics to localize blamed frames causing the crashes. As new applications and features are routinely introduced and existing applications are run under new environments, developing new rules and maintaining existing ones become extremely challenging.
We propose a data-driven solution to address the problem. We start with the first large-scale empirical study of 362K crashes and their blamed methods reported to WER by tens of thousands of applications running in the field. The analysis provides valuable insights on where and how the crashes happen and what methods to blame for the crashes. These insights enable us to develop DeepAnalyze, a novel multi-task sequence labeling approach for identifying blamed frames in stack traces. We evaluate our model with over a million real-world crashes from four popular Microsoft applications and show that DeepAnalyze, trained with crashes from one set of applications, not only accurately localizes crashes of the same applications, but also bootstrap crash localization for other applications with zero to very little training data.
Wed 11 MayDisplayed time zone: Eastern Time (US & Canada) change
12:00 - 13:00 | Machine Learning with and for SE 11Journal-First Papers / Technical Track at ICSE room 1-even hours Chair(s): Ipek Ozkaya Carnegie Mellon Software Engineering Institute | ||
12:00 5mTalk | Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection Journal-First Papers Nadia Daoudi SnT, University of Luxembourg, Kevin Allix University of Luxembourg, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of Luxembourg Link to publication Pre-print Media Attached | ||
12:05 5mTalk | DeepAnalyze: Learning to Localize Crashes at Scale Technical Track Manish Shetty Microsoft Research, India, Chetan Bansal Microsoft Research, Suman Nath Microsoft Corporation, Sean Bowles Microsoft, Henry Wang Microsoft, Ozgur Arman Microsoft, Siamak Ahari Microsoft Pre-print Media Attached | ||
12:10 5mTalk | EREBA: Black-box Energy Testing of Adaptive Neural Networks Technical Track Mirazul Haque UT Dallas, Yaswanth Yadlapalli University of Texas at Dallas, Wei Yang University of Texas at Dallas, Cong Liu University of Texas at Dallas, USA Pre-print Media Attached | ||
12:15 5mTalk | Fast Changeset-based Bug Localization with BERT Technical Track Agnieszka Ciborowska Virginia Commonwealth University, Kostadin Damevski Virginia Commonwealth University Pre-print Media Attached | ||
12:20 5mTalk | Multilingual training for Software Engineering Technical Track Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis DOI Pre-print Media Attached | ||
12:25 5mTalk | Using Pre-Trained Models to Boost Code Review Automation Technical Track Rosalia Tufano Università della Svizzera Italiana, Simone Masiero Software Institute @ Università della Svizzera Italiana, Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Denys Poshyvanyk William and Mary, Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached |