PROFES 2023
Sun 10 - Wed 13 December 2023 Dornbirn, Austria
Wed 13 Dec 2023 11:00 - 11:10 at W211 - Software Testing and Quality Assurance Chair(s): Dietmar Pfahl

Background: ROC (Receiver Operating Characteristic) curves are widely used to represent the performance (i.e., degree of correctness) of fault proneness models. AUC, the Area Under the ROC Curve is a quite popular performance metric, which summarizes into a single number the goodness of the predictions represented by the ROC curve. Alternative techniques have been proposed for evaluating the performance represented by a ROC curve: among these are RRA (Ratio of Relevant Areas) and φ (alias Matthews Correlation Coefficient).

Objectives: In this paper, we aim at evaluating AUC as a performance metric, also with respect to alternative proposals.

Method: We carry out an empirical study by replicating a previously published fault prediction study and measuring the performance of the obtained faultiness models using AUC, RRA, and a recently proposed way of relating a specific kind of ROC curves to φ, based on iso-φ ROC curves, i.e., ROC curves with constant φ. We take into account prevalence, i.e., the proportion of faulty modules in the dataset that is the object of predictions.

Results: AUC appears to provide indications that are concordant with φ for fairly balanced datasets, while it is much more optimistic than φ for quite imbalanced datasets. RRA’s indications appear to be moderately affected by the degree of balance in a dataset. In addition, RRA appears to agree with φ.

Conclusions: Based on the collected evidence, AUC does not seem to be suitable for evaluating the performance of fault proneness models when used with imbalanced datasets. In these cases, using RRA can be a better choice.

Wed 13 Dec

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

11:00 - 12:20
Software Testing and Quality AssuranceResearch Papers / Organization / Short Papers and Posters / Industry Papers at W211
Chair(s): Dietmar Pfahl University of Tartu
11:00
10m
Research paper
An Experience in the Evaluation of Fault Prediction
Research Papers
Luigi Lavazza Università degli Studi dell'Insubria, Sandro Morasca Università degli Studi dell'Insubria, Gabriele Rotoloni
11:10
10m
Industry talk
Is It the Best Solution? Testing An Optimisation Algorithm with Metamorphic Testing
Industry Papers
Alejandra Duque-Torres University of Tartu, Claus Klammer Software Competence Center Hagenberg, Stefan Fischer Software Competence Center Hagenberg, Dietmar Pfahl University of Tartu
11:20
10m
Short-paper
Impacts of Program Structures on Code Coverage of Generated Test Suites
Short Papers and Posters
Ryoga Watanabe Osaka University, Yoshiki Higo Osaka University, Shinji Kusumoto Osaka University
11:30
10m
Research paper
Anomaly Detection Through Container Testing: A Survey of Company Practices
Research Papers
Salla Timonen University of Jyväskylä, Maha Sroor University of Jyväskylä, Rahul Mohanani University of Jyväskylä, Tommi Mikkonen University of Jyvaskyla
11:40
10m
Short-paper
The Effects of Soft Assertion on Spectrum-based Fault Localization
Short Papers and Posters
Kouhei Mihara Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University
11:50
10m
Industry talk
Characterizing Requirements Smells
Industry Papers
Emanuele Gentili Mbda Italy S.p.a., Falessi Davide University of Rome Tor Vergata
12:00
10m
Research paper
Do Exceptional Behavior Tests Matter on Spectrum-based Fault Localization?
Research Papers
Haruka Yoshioka Osaka University, Yoshiki Higo Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University, Shinji Itoh Hitachi, Ltd., Research &Development Group, Phan Thi Thanh Huyen Hitachi, Ltd., Research &Development Group
12:10
10m
Research paper
On Deprecated API Usages: an Exploratory Study of Top-starred Projects on GitHub
Research Papers
Pietro Cassieri University of Salerno, Simone Romano University of Salerno, Giuseppe Scanniello University of Salerno