The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects
Wed 11 May 2022 21:05 - 21:10 at ICSE room 4-odd hours - Software Testing 7 Chair(s): Upsorn Praphamontripong
Thu 26 May 2022 09:30 - 09:35 at Room 304+305 - Papers 11: Release Engineering and DevOps Chair(s): Andy Zaidman
Defect prediction models can be beneficial to prioritize testing, analysis, or code review activities, and has been the subject of a substantial effort in academia, and some applications in industrial contexts. A necessary precondition when creating a defect prediction model is the availability of defect data from the history of projects. If this data is noisy, the resulting defect prediction model could result to be unreliable. One of the causes of noise for defect datasets is the presence of “dormant defects,” i.e., of defects discovered several releases after their introduction. This can cause a class to be labeled as defect-free while it is not, and is, therefore “snoring.” In this article, we investigate the impact of snoring on classifiers’ accuracy and the effectiveness of a possible countermeasure, i.e., dropping too recent data from a training set. We analyze the accuracy of 15 machine learning defect prediction classifiers, on data from more than 4,000 defects and 600 releases of 19 open source projects from the Apache ecosystem. Our results show that on average across projects (i) the presence of dormant defects decreases the recall of defect prediction classifiers, and (ii) removing from the training set the classes that in the last release are labeled as not defective significantly improves the accuracy of the classifiers. In summary, this article provides insights on how to create defects datasets by mitigating the negative effect of dormant defects on defect prediction.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 05:00 | Software Testing 1Technical Track / Journal-First Papers at ICSE room 2-even hours Chair(s): Ajitha Rajan University of Edinburgh | ||
04:00 5mTalk | The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached | ||
04:05 5mTalk | Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects Journal-First Papers DOI Media Attached | ||
04:10 5mTalk | RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems Journal-First Papers Jianmin Guo Tsinghua University, Quan Zhang Tsinghua University, Yue Zhao Huawei Technologies Co., Ltd., Heyuan Shi Central South University, Yu Jiang Tsinghua University, Jia-Guang Sun Link to publication DOI Pre-print Media Attached | ||
04:15 5mTalk | Adaptive Test Selection for Deep Neural Networks Technical Track Xinyu Gao Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zixi Liu Nanjing University, Zhenyu Chen Nanjing University, Baowen Xu Nanjing University Pre-print Media Attached | ||
04:20 5mTalk | Evaluating and Improving Neural Program-Smoothing-based Fuzzing Technical Track Mingyuan Wu Southern University of Science and Technology, Ling Jiang Southern University of Science and Technology, Jiahong Xiang Southern University of Science and Technology, Yuqun Zhang Southern University of Science and Technology, Guowei Yang The University of Queensland, Huixin Ma Tencent Security Keen Lab, Sen Nie Keen Security Lab, Tencent, Shi Wu Tencent Security Keen Lab, Heming Cui University of Hong Kong, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print Media Attached | ||
04:25 5mTalk | Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing Technical Track Jiazhen Gu Fudan University, China, Xuchuan Luo Fudan University, Yangfan Zhou Fudan University, Xin Wang Fudan University Pre-print Media Attached |
Wed 11 MayDisplayed time zone: Eastern Time (US & Canada) change
21:00 - 22:00 | Software Testing 7Journal-First Papers / Technical Track at ICSE room 4-odd hours Chair(s): Upsorn Praphamontripong Computer Science, University of Virginia | ||
21:00 5mTalk | A Family of Experiments on Test-Driven Development Journal-First Papers Adrian Santos Parrilla University of Oulu, Sira Vegas Universidad Politecnica de Madrid, Oscar Dieste Universidad Politécnica de Madrid, Fernando Uyaguari ETAPA Telecommunications Company, Ayse Tosun Istanbul Technical University, Davide Fucci Blekinge Institute of Technology, Burak Turhan University of Oulu, Giuseppe Scanniello University of Basilicata, Simone Romano University of Bari, Itir Karac University of Oulu, Marco Kuhrmann Reutlingen University, Vladimir Mandić Faculty of Technical Sciences, University of Novi Sad, Robert Ramač Faculty of Technical Sciences, University of Novi Sad, Dietmar Pfahl University of Tartu, Christian Engblom Ericsson, Jarno Kyykka Ericsson, Kerli Rungi Testlio, Carolina Palomeque ETAPA Telecommunications Company, Jaroslav Spisak PAF, Markku Oivo University of Oulu, Natalia Juristo Universidad Politecnica de Madrid Link to publication DOI Pre-print Media Attached | ||
21:05 5mTalk | The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached | ||
21:10 5mTalk | RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems Journal-First Papers Jianmin Guo Tsinghua University, Quan Zhang Tsinghua University, Yue Zhao Huawei Technologies Co., Ltd., Heyuan Shi Central South University, Yu Jiang Tsinghua University, Jia-Guang Sun Link to publication DOI Pre-print Media Attached | ||
21:15 5mTalk | DeepState: Selecting Test Suites to Enhance the Robustness of Recurrent Neural Networks Technical Track Zixi Liu Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zhenyu Chen Nanjing University DOI Pre-print Media Attached | ||
21:20 5mTalk | Evaluating and Improving Neural Program-Smoothing-based Fuzzing Technical Track Mingyuan Wu Southern University of Science and Technology, Ling Jiang Southern University of Science and Technology, Jiahong Xiang Southern University of Science and Technology, Yuqun Zhang Southern University of Science and Technology, Guowei Yang The University of Queensland, Huixin Ma Tencent Security Keen Lab, Sen Nie Keen Security Lab, Tencent, Shi Wu Tencent Security Keen Lab, Heming Cui University of Hong Kong, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print Media Attached | ||
21:25 5mTalk | Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing Technical Track Jiazhen Gu Fudan University, China, Xuchuan Luo Fudan University, Yangfan Zhou Fudan University, Xin Wang Fudan University Pre-print Media Attached |