ISSTA 2019
Mon 15 - Fri 19 July 2019 Beijing, China
Thu 18 Jul 2019 11:22 - 11:45 at Grand Ballroom - Regression Testing Chair(s): Dan Hao

Mutation testing is widely used in research as a metric for evaluating the quality of test suites. However, traditional mutation testing assumes tests to exhibit deterministic behavior, in terms of their coverage and the outcome of a test (not) killing a certain mutant. Such an assumption does not hold in the presence of flaky tests, whose outcomes can non-deterministically differ even when run on the same code under test. Almost all modern software projects have some flaky tests. Without reliable test outcomes, mutation testing can result in unreliable results, e.g., in our experiments, mutation scores vary by 5 percentage points on average between repeated executions, and the difference in mutant-test pairs was 10 percentage points on average. We propose an advanced technique that better controls for flakiness throughout the mutation testing process. We implement our techniques by modifying the popular open-source tool, PIT. We evaluate our modifications on 30 open-source projects, finding that our technique can increase developers’ confidence in mutation results in the presence of flaky tests by almost entirely eliminating the number of “unknown” (flaky) mutants.

Thu 18 Jul

Displayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change

11:00 - 12:30
Regression TestingTechnical Papers at Grand Ballroom
Chair(s): Dan Hao Peking University
11:00
22m
Talk
Root Causing Flaky Tests in a Large-scale Industrial Setting
Technical Papers
Wing Lam University of Illinois at Urbana-Champaign, Patrice Godefroid Microsoft Research, Suman Nath Microsoft Corporation, Anirudh Santhiar Indian Institute of Science, Suresh Thummalapenta
11:22
22m
Talk
Mitigating the Effects of Flaky Tests on Mutation Testing
Technical Papers
August Shi University of Illinois at Urbana-Champaign, Jonathan Bell George Mason University, Darko Marinov University of Illinois at Urbana-Champaign
Pre-print Media Attached
11:45
22m
Talk
Assessing the State and Improving the Art of Parallel Testing for CArtifacts ReusableArtifacts Functional
Technical Papers
Oliver Schwahn TU Darmstadt, Nicolas Coppik TU Darmstadt, Stefan Winter TU Darmstadt, Neeraj Suri
12:07
22m
Talk
Failure Clustering Without Coverage
Technical Papers