Write a Blog >>
ICSE 2023
Sun 14 - Sat 20 May 2023 Melbourne, Australia
Fri 19 May 2023 14:15 - 14:30 at Meeting Room 110 - Issue reporting and reproduction Chair(s): Daniel Russo

Software defect datasets are crucial to facilitating the evaluation and comparison of techniques in fields such as fault localization, test generation, and automated program repair. However, the reproducibility of software defect artifacts is not immune to breakage. In this paper, we conduct a study on the reproducibility of software defect artifacts. First, we study five state-of-the-art Java defect datasets. Despite the multiple strategies applied by dataset maintainers to ensure reproducibility, all datasets are prone to breakages. Second, we conduct a case study in which we systematically test the reproducibility of 1,795 software artifacts during a 13-month period. We find that 62.6% of the artifacts break at least once, and 15.3% artifacts break multiple times. We manually investigate the root causes of breakages and handcraft 10 patches, which are automatically applied to 1,055 distinct artifacts in 2,948 fixes. Based on the nature of the root causes, we propose automated dependency caching and artifact isolation to prevent further breakage. In particular, we show that isolating artifacts to eliminate external dependencies increases reproducibility to 95% or higher, which is on par with the level of reproducibility exhibited by the most reliable manually curated dataset.

Fri 19 May

Displayed time zone: Hobart change

13:45 - 15:15
Issue reporting and reproductionTechnical Track / DEMO - Demonstrations at Meeting Room 110
Chair(s): Daniel Russo Department of Computer Science, Aalborg University
13:45
15m
Talk
Incident-aware Duplicate Ticket Aggregation for Cloud Systems
Technical Track
Jinyang Liu The Chinese University of Hong Kong, Shilin He Microsoft Research, Zhuangbin Chen Chinese University of Hong Kong, China, Liqun Li Microsoft Research, Yu Kang Microsoft Research, Xu Zhang Microsoft Research, Pinjia He Chinese University of Hong Kong at Shenzhen, Hongyu Zhang The University of Newcastle, Qingwei Lin Microsoft Research, Zhangwei Xu Microsoft Azure, Saravan Rajmohan Microsoft 365, Dongmei Zhang Microsoft Research, Michael Lyu The Chinese University of Hong Kong
14:00
15m
Talk
Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction
Technical Track
Sungmin Kang KAIST, Juyeon Yoon Korea Advanced Institute of Science and Technology, Shin Yoo KAIST
Pre-print
14:15
15m
Talk
On the Reproducibility of Software Defect Datasets
Technical Track
Hao-Nan Zhu University of California, Davis, Cindy Rubio-González University of California at Davis
14:30
15m
Talk
Context-aware Bug Reproduction for Mobile Apps
Technical Track
Yuchao Huang , Junjie Wang Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Zhe Liu Institute of Software, Chinese Academy of Sciences, Song Wang York University, Chunyang Chen Monash University, Mingyang Li Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Qing Wang Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences
14:45
15m
Talk
Read It, Don't Watch It: Captioning Bug Recordings Automatically
Technical Track
Sidong Feng Monash University, Mulong Xie Australian National University, Yinxing Xue University of Science and Technology of China, Chunyang Chen Monash University
Pre-print
15:00
7m
Talk
BURT: A Chatbot for Interactive Bug Reporting
DEMO - Demonstrations
Yang Song College of William and Mary, Junayed Mahmud George Mason University, Nadeeshan De Silva William & Mary, Ying Zhou University of Texas at Dallas, Oscar Chaparro College of William and Mary, Kevin Moran George Mason University, Andrian Marcus University of Texas at Dallas, Denys Poshyvanyk College of William and Mary