BuildSheriff: Change-Aware Test Failure Triage for Continuous Integration Builds (ICSE 2022 - Technical Track)

Write a Blog >>

Sun 8 - Fri 27 May 2022

Who

Chen Zhang, Bihuan Chen, Xin Peng, Wenyun Zhao

Track

ICSE 2022 Technical Track

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 10 May 2022 05:20 - 05:25 at ICSE room 3 - Software Testing 2 Chair(s): Aldeida Aleti
Tue 10 May 2022 20:25 - 20:30 at ICSE room 2 - Software Testing 5 Chair(s): Leonardo Sousa

Abstract

Test failures are one of the most common reasons for broken builds in continuous integration. It is expensive to diagnose all test failures in a build. As test failures are usually caused by a few underlying faults, triaging test failures with respect to their underlying root causes can reduce test failure diagnose cost. Existing failure triage methods are mostly developed for triaging crash or bug reports, and hence not applicable in the context of test failure triage in continuous integration. In this paper, we first present a large-scale empirical study on 163,371 broken builds caused by test failures to characterize test failures in real-world Java projects. Then, motivated by our study, we propose a new change-aware approach, BuildSheriff, to triage test failures in each continuous integration build such that test failures with the same root cause are put in the same cluster. Specifically, we design a pipeline of three strategies based on complexity of code changes, change-aware stack trace similarity, and exception message similarity to triage one type of test failures, i.e., exception failures, and a pipeline of two strategies based on complexity of code changes and change-aware test code similarity to triage the other type, i.e., assertion failures. Our evaluation on 200 broken builds has demonstrated that BuildSheriff can significantly improve the state-of-the-art methods.

Link to Preprint

https://chenbihuan.github.io/paper/icse22-zhang-buildsheriff.pdf

Chen Zhang

Fudan University

Bihuan Chen

Fudan University, China

China

Xin Peng

Fudan University

China

Wenyun Zhao

Fudan University, China

BuildSheriff: Change-Aware Test Failure Triage for Continuous Integration Builds

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 10 May
Displayed time zone: Eastern Time (US & Canada) change

05:00 - 06:00	Software Testing 2SEIP - Software Engineering in Practice / Technical Track / Journal-First Papers at ICSE room 3 Chair(s): Aldeida Aleti Monash University

5m Talk		Reinforcement Learning for Test Case Prioritization Journal-First Papers Mojtaba Bagherzadeh University of Ottawa, Nafiseh Kahani , Lionel Briand University of Luxembourg; University of Ottawa Link to publication DOI Pre-print Media Attached
5m Talk		Build System Aware Multi-language Regression Test Selection in Continuous Integration SEIP - Software Engineering in Practice Daniel Elsner TU Munich, Roland Würsching Technical University of Munich, Markus Schnappinger , Alexander Pretschner TU Munich, Maria Graber IVU Traffic Technologies, René Dammer IVU Traffic Technologies, Silke Reimer IVU Traffic Technologies DOI Pre-print Media Attached
5m Talk		Testing Machine Learning Systems in Industry: An Empirical Study SEIP - Software Engineering in Practice Shuyue Li Xi'an Jiaotong University, Jiaqi Guo Xi'an Jiaotong University, Jian-Guang Lou Microsoft Research, Ming Fan Xi'an Jiaotong University, Ting Liu Xi'an Jiaotong University, Dongmei Zhang Microsoft Research DOI Pre-print Media Attached
5m Talk		GIFdroid: Automated Replay of Visual Bug Reports for Android Apps Technical Track Sidong Feng Monash University, Chunyang Chen Monash University DOI Pre-print Media Attached
5m Talk		BuildSheriff: Change-Aware Test Failure Triage for Continuous Integration Builds Technical Track Chen Zhang Fudan University, Bihuan Chen Fudan University, China, Xin Peng Fudan University, Wenyun Zhao Fudan University, China Pre-print Media Attached
5m Talk		Natural Attack for Pre-trained Models of Code Technical Track Zhou Yang Singapore Management University, Jieke Shi Singapore Management University, Junda He Singapore Management University, David Lo Singapore Management University DOI Pre-print Media Attached

20:00 - 21:00	Software Testing 5SEIP - Software Engineering in Practice / Journal-First Papers / Technical Track at ICSE room 2 Chair(s): Leonardo Sousa

5m Talk		Demystifying the Challenges and Benefits of Analyzing User-Reported Logs in Bug Reports Journal-First Papers An Ran Chen Concordia University, Tse-Hsun (Peter) Chen Concordia University, Shaowei Wang University of Manitoba Link to publication Media Attached
5m Talk		Reinforcement Learning for Test Case Prioritization Journal-First Papers Mojtaba Bagherzadeh University of Ottawa, Nafiseh Kahani , Lionel Briand University of Luxembourg; University of Ottawa Link to publication DOI Pre-print Media Attached
5m Talk		Surveying the Developer Experience of Flaky Tests SEIP - Software Engineering in Practice Owain Parry The University of Sheffield, Gregory Kapfhammer Allegheny College, Michael Hilton Carnegie Mellon University, USA, Phil McMinn University of Sheffield Pre-print Media Attached
5m Talk		The Impact of Flaky Tests on Historical Test Prioritization on Chrome SEIP - Software Engineering in Practice Emad Fallahzadeh Concordia University, Peter Rigby Concordia University, Montreal, Canada Pre-print Media Attached
5m Talk		Demystifying the Dependency Challenge in Kernel Fuzzing Technical Track Yu Hao University of California at Riverside, USA, Hang Zhang Georgia Institute of Technology, Guoren Li UC Riverside, Xingyun Du UC Riverside, Zhiyun Qian University of California at Riverside, USA, Ardalan Amiri Sani UC Irvine Pre-print Media Attached
5m Talk		BuildSheriff: Change-Aware Test Failure Triage for Continuous Integration Builds Technical Track Chen Zhang Fudan University, Bihuan Chen Fudan University, China, Xin Peng Fudan University, Wenyun Zhao Fudan University, China Pre-print Media Attached

Information for Participants

Tue 10 May 2022 05:00 - 06:00 at ICSE room 3 - Software Testing 2 Chair(s): Aldeida Aleti

Info for room ICSE room 3-odd hours:

Click here to go to the room on Midspace

Tue 10 May 2022 20:00 - 21:00 at ICSE room 2 - Software Testing 5 Chair(s): Leonardo Sousa

Info for room ICSE room 2-even hours:

Click here to go to the room on Midspace