Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects (ICSE 2022 - Journal-First Papers)

Write a Blog >>

Sun 8 - Fri 27 May 2022

Who

Steffen Herbold, Tobias Haar

Track

ICSE 2022 Journal-First Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 10 May 2022 04:05 - 04:10 at ICSE room 2 - Software Testing 1 Chair(s): Ajitha Rajan
Tue 10 May 2022 12:00 - 12:05 at ICSE room 2 - Software Testing 11 Chair(s): Jonathan Bell

Abstract

Machine learning is nowadays a standard technique for data analysis within software applications. Software engineers need quality assurance techniques that are suitable for these new kinds of systems. Within this article, we discuss the question whether standard software testing techniques that have been part of textbooks since decades are also useful for the testing of machine learning software. Concretely, we try to determine generic and simple smoke tests that can be used to assert that basic functions can be executed without crashing. We found that we can derive such tests using techniques similar to equivalence classes and boundary value analysis. Moreover, we found that these concepts can also be applied to hyperparameters, to further improve the quality of the smoke tests. Even though our approach is almost trivial, we were able to find bugs in all three machine learning libraries that we tested and severe bugs in two of the three libraries. This demonstrates that common software testing techniques are still valid in the age of machine learning and that considerations how they can be adapted to this new context can help to find and prevent severe bugs, evenin mature machine learning libraries.

DOI

https://doi.org/10.1007/s10664-021-10073-7

Steffen Herbold

TU Clausthal

Germany

Tobias Haar

University of Goettingen

ICSE'22: Smoke testing for machine learning: simple tests to discover severe bugs

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 10 May
Displayed time zone: Eastern Time (US & Canada) change

04:00 - 05:00	Software Testing 1Technical Track / Journal-First Papers at ICSE room 2 Chair(s): Ajitha Rajan University of Edinburgh

5m Talk		The Impact of Dormant Defects on Defect Prediction: a Study of 19 Apache Projects Journal-First Papers Davide Falessi University of Rome Tor Vergata, Italy, Aalok Ahluwalia California Polytechnic State University, Massimiliano Di Penta University of Sannio, Italy Link to publication DOI Media Attached
5m Talk		Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects Journal-First Papers Steffen Herbold TU Clausthal, Tobias Haar University of Goettingen DOI Media Attached
5m Talk		RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems Journal-First Papers Jianmin Guo Tsinghua University, Quan Zhang Tsinghua University, Yue Zhao Huawei Technologies Co., Ltd., Heyuan Shi Central South University, Yu Jiang Tsinghua University, Jia-Guang Sun Link to publication DOI Pre-print Media Attached
5m Talk		Adaptive Test Selection for Deep Neural Networks Technical Track Xinyu Gao Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zixi Liu Nanjing University, Zhenyu Chen Nanjing University, Baowen Xu Nanjing University Pre-print Media Attached
5m Talk		Evaluating and Improving Neural Program-Smoothing-based Fuzzing Technical Track Mingyuan Wu Southern University of Science and Technology, Ling Jiang Southern University of Science and Technology, Jiahong Xiang Southern University of Science and Technology, Yuqun Zhang Southern University of Science and Technology, Guowei Yang The University of Queensland, Huixin Ma Tencent Security Keen Lab, Sen Nie Keen Security Lab, Tencent, Shi Wu Tencent Security Keen Lab, Heming Cui University of Hong Kong, Lingming Zhang University of Illinois at Urbana-Champaign DOI Pre-print Media Attached
5m Talk		Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing Technical Track Jiazhen Gu Fudan University, China, Xuchuan Luo Fudan University, Yangfan Zhou Fudan University, Xin Wang Fudan University Pre-print Media Attached

12:00 - 13:00	Software Testing 11Technical Track / Journal-First Papers / SEIP - Software Engineering in Practice at ICSE room 2 Chair(s): Jonathan Bell Northeastern University

5m Talk		Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects Journal-First Papers Steffen Herbold TU Clausthal, Tobias Haar University of Goettingen DOI Media Attached
5m Talk		Algorithmic Profiling for Real-World Complexity Problems Journal-First Papers Boqin Qin China Telecom Cloud Computing Corporation, Tengfei Tu Beijing University of Posts and Telecommunications, Ziheng Liu University of California, San Diego, Tingting Yu University of Cincinnati, Linhai Song Pennsylvania State University, USA DOI Pre-print Media Attached
5m Talk		Build System Aware Multi-language Regression Test Selection in Continuous Integration SEIP - Software Engineering in Practice Daniel Elsner TU Munich, Roland Würsching Technical University of Munich, Markus Schnappinger , Alexander Pretschner TU Munich, Maria Graber IVU Traffic Technologies, René Dammer IVU Traffic Technologies, Silke Reimer IVU Traffic Technologies DOI Pre-print Media Attached
5m Talk		Free Lunch for Testing: Fuzzing Deep-Learning Libraries from Open Source Technical Track Anjiang Wei Stanford University, Yinlin Deng University of Illinois at Urbana-Champaign, Chenyuan Yang Nanjing University, Lingming Zhang University of Illinois at Urbana-Champaign Pre-print Media Attached
5m Talk		Automatic Detection of Performance Bugs in Database Systems using Equivalent Queries Technical Track Xinyu Liu Georgia Institute of Technology, Qi Zhou Facebook, Joy Arulraj Georgia Institute of Technology, Alessandro Orso Georgia Tech Pre-print Media Attached
5m Talk		R2Z2: Detecting Rendering Regressions in Web Browsers through Differential Fuzz Testing Technical Track Suhwan Song Seoul National University, South Korea, Jaewon Hur Seoul National University, Sunwoo Kim Samsung Research, Samsung Electronics, Philip Rogers Google, Byoungyoung Lee Seoul National University, South Korea Pre-print Media Attached

Information for Participants

Info for room ICSE room 2-even hours:

Click here to go to the room on Midspace