The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects (ICSE 2022 - Journal-First Papers)

Write a Blog >>

Sun 8 - Fri 27 May 2022

Who

Davide Falessi, Aalok Ahluwalia, Massimiliano Di Penta

Track

ICSE 2022 Journal-First Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 10 May 2022 04:00 - 04:05 at ICSE room 2 - Software Testing 1 Chair(s): Ajitha Rajan
Wed 11 May 2022 21:05 - 21:10 at ICSE room 4 - Software Testing 7 Chair(s): Upsorn Praphamontripong
Thu 26 May 2022 09:30 - 09:35 at Room 304+305 - Papers 11: Release Engineering and DevOps Chair(s): Andy Zaidman

Abstract

Defect prediction models can be beneficial to prioritize testing, analysis, or code review activities, and has been the subject of a substantial effort in academia, and some applications in industrial contexts. A necessary precondition when creating a defect prediction model is the availability of defect data from the history of projects. If this data is noisy, the resulting defect prediction model could result to be unreliable. One of the causes of noise for defect datasets is the presence of “dormant defects,” i.e., of defects discovered several releases after their introduction. This can cause a class to be labeled as defect-free while it is not, and is, therefore “snoring.” In this article, we investigate the impact of snoring on classifiers’ accuracy and the effectiveness of a possible countermeasure, i.e., dropping too recent data from a training set. We analyze the accuracy of 15 machine learning defect prediction classifiers, on data from more than 4,000 defects and 600 releases of 19 open source projects from the Apache ecosystem. Our results show that on average across projects (i) the presence of dormant defects decreases the recall of defect prediction classifiers, and (ii) removing from the training set the classes that in the last release are labeled as not defective significantly improves the accuracy of the classifiers. In summary, this article provides insights on how to create defects datasets by mitigating the negative effect of dormant defects on defect prediction.

Link to Publication

https://dl.acm.org/doi/10.1145/3467895

DOI

https://doi.org/10.1145/3467895

Davide Falessi

University of Rome Tor Vergata, Italy

Italy

Aalok Ahluwalia

California Polytechnic State University

Massimiliano Di Penta

University of Sannio, Italy

Italy

The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 10 May
Displayed time zone: Eastern Time (US & Canada) change

04:00 - 05:00	Software Testing 1Technical Track / Journal-First Papers at ICSE room 2 Chair(s): Ajitha Rajan University of Edinburgh

5m Talk		The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached
5m Talk		Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects Journal-First Papers Steffen Herbold TU Clausthal, Tobias Haar University of Goettingen DOI Media Attached
5m Talk		RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems Journal-First Papers Jianmin Guo Tsinghua University, Quan Zhang Tsinghua University, Yue Zhao Huawei Technologies Co., Ltd., Heyuan Shi Central South University, Yu Jiang Tsinghua University, Jia-Guang Sun Link to publication DOI Pre-print Media Attached
5m Talk		Adaptive Test Selection for Deep Neural Networks Technical Track Xinyu Gao Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zixi Liu Nanjing University, Zhenyu Chen Nanjing University, Baowen Xu Nanjing University Pre-print Media Attached
5m Talk		Evaluating and Improving Neural Program-Smoothing-based Fuzzing Technical Track Mingyuan Wu Southern University of Science and Technology, Ling Jiang Southern University of Science and Technology, Jiahong Xiang Southern University of Science and Technology, Yuqun Zhang Southern University of Science and Technology, Guowei Yang The University of Queensland, Huixin Ma Tencent Security Keen Lab, Sen Nie Keen Security Lab, Tencent, Shi Wu Tencent Security Keen Lab, Heming Cui University of Hong Kong, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print Media Attached
5m Talk		Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing Technical Track Jiazhen Gu Fudan University, China, Xuchuan Luo Fudan University, Yangfan Zhou Fudan University, Xin Wang Fudan University Pre-print Media Attached

Wed 11 May
Displayed time zone: Eastern Time (US & Canada) change

21:00 - 22:00	Software Testing 7Journal-First Papers / Technical Track at ICSE room 4 Chair(s): Upsorn Praphamontripong Computer Science, University of Virginia

5m Talk		A Family of Experiments on Test-Driven Development Journal-First Papers Adrian Santos Parrilla University of Oulu, Sira Vegas Universidad Politecnica de Madrid, Oscar Dieste Universidad Politécnica de Madrid, Fernando Uyaguari ETAPA Telecommunications Company, Ayse Tosun Istanbul Technical University, Davide Fucci Blekinge Institute of Technology, Burak Turhan University of Oulu, Giuseppe Scanniello University of Basilicata, Simone Romano University of Bari, Itir Karac University of Oulu, Marco Kuhrmann Reutlingen University, Vladimir Mandić Faculty of Technical Sciences, University of Novi Sad, Robert Ramač Faculty of Technical Sciences, University of Novi Sad, Dietmar Pfahl University of Tartu, Christian Engblom Ericsson, Jarno Kyykka Ericsson, Kerli Rungi Testlio, Carolina Palomeque ETAPA Telecommunications Company, Jaroslav Spisak PAF, Markku Oivo University of Oulu, Natalia Juristo Universidad Politecnica de Madrid Link to publication DOI Pre-print Media Attached
5m Talk		The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached
5m Talk		RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems Journal-First Papers Jianmin Guo Tsinghua University, Quan Zhang Tsinghua University, Yue Zhao Huawei Technologies Co., Ltd., Heyuan Shi Central South University, Yu Jiang Tsinghua University, Jia-Guang Sun Link to publication DOI Pre-print Media Attached
5m Talk		DeepState: Selecting Test Suites to Enhance the Robustness of Recurrent Neural Networks Technical Track Zixi Liu Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zhenyu Chen Nanjing University DOI Pre-print Media Attached
5m Talk		Evaluating and Improving Neural Program-Smoothing-based Fuzzing Technical Track Mingyuan Wu Southern University of Science and Technology, Ling Jiang Southern University of Science and Technology, Jiahong Xiang Southern University of Science and Technology, Yuqun Zhang Southern University of Science and Technology, Guowei Yang The University of Queensland, Huixin Ma Tencent Security Keen Lab, Sen Nie Keen Security Lab, Tencent, Shi Wu Tencent Security Keen Lab, Heming Cui University of Hong Kong, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print Media Attached
5m Talk		Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing Technical Track Jiazhen Gu Fudan University, China, Xuchuan Luo Fudan University, Yangfan Zhou Fudan University, Xin Wang Fudan University Pre-print Media Attached

Thu 26 May
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:30	Papers 11: Release Engineering and DevOpsTechnical Track / Journal-First Papers at Room 304+305 Chair(s): Andy Zaidman Delft University of Technology

09:00 5m Talk		An Empirical Study on Release Notes Patterns of Popular Apps in the Google Play Store Journal-First Papers Aidan Z.H. Yang Carnegie Mellon University, Safwat Hassan Thompson Rivers University, Ying Zou Queen's University, Kingston, Ontario, Ahmed E. Hassan Queen's University Link to publication DOI Pre-print Media Attached
09:05 5m Talk		Within-project Defect Prediction of Infrastructure-as-Code Using Product and Process Metrics Journal-First Papers Stefano Dalla Palma Tilburg University, Dario Di Nucci University of Salerno, Fabio Palomba University of Salerno, Damian Andrew Tamburri TU/e Link to publication DOI Authorizer link Pre-print Media Attached
09:10 5m Talk		Change Is the Only Constant: Dynamic Updates for WorkflowsBest Artifact Award Technical Track Daniel Sokolowski University of St. Gallen, Pascal Weisenburger University of St. Gallen, Guido Salvaneschi University of St. Gallen DOI Pre-print Media Attached
09:15 5m Talk		GitHub Discussions: An exploratory study of early adoption Journal-First Papers Hideaki Hata Shinshu University, Nicole Novielli University of Bari, Sebastian Baltes SAP SE & University of Adelaide, Raula Gaikovina Kula Nara Institute of Science and Technology, Christoph Treude University of Melbourne Link to publication DOI Pre-print Media Attached
09:20 5m Talk		"Did You Miss My Comment or What?" Understanding Toxicity in Open Source DiscussionsDistinguished Paper Award Technical Track Courtney Miller Carnegie Mellon University, Sophie Cohen Wesleyan University, Daniel Klug Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University Pre-print Media Attached
09:25 5m Talk		"This Is Damn Slick!" Estimating the Impact of Tweets on Open Source Project Popularity and New ContributorsDistinguished Paper Award Technical Track Hongbo Fang Carnegie Mellon University, Hemank Lamba Carnegie Mellon University, Jim Herbsleb Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA DOI Pre-print Media Attached
09:30 5m Talk		The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached
09:35 5m Talk		Continuously Managing NFRs: Opportunities and Challenges in Practice Journal-First Papers Colin Werner University of Victoria, Ze Shi (Zane) Li University of Victoria, Canada, Derek Lowlind University of Victoria, Omar Elazhary University of Victoria, Neil Ernst University of Victoria, Daniela Damian University of Victoria Link to publication Pre-print Media Attached

Information for Participants

Tue 10 May 2022 04:00 - 05:00 at ICSE room 2 - Software Testing 1 Chair(s): Ajitha Rajan

Info for room ICSE room 2-even hours:

Click here to go to the room on Midspace

Wed 11 May 2022 21:00 - 22:00 at ICSE room 4 - Software Testing 7 Chair(s): Upsorn Praphamontripong

Info for room ICSE room 4-odd hours:

Click here to go to the room on Midspace