New convex-based metamorphic relations and large-scale machine learning model evaluation
Machine learning (ML) models are victims of the oracle problem, i.e. it is not possible to know with absolute confidence an output for a given input. This prevents them from being evaluated using conventional software engineering techniques. However, there is an approach called “metamorphic relation” that reduces the oracle problem and helps to evaluate ML models. Unlike conventional tests, a metamorphic relation does not check if an input produces a specific output, but checks if a relationship between inputs and outputs is respected. Naturally, metamorphic relations have already been proposed in the literature, either to evaluate the behavior of a specific ML model, or to evaluate the general behavior of any ML model. The purpose of this paper is to propose new metamorphic relations to complement those of the literature, in order to propose a more complete methodology for evaluating ML models. So, in order to challenge this methodology, all these metamorphic relations are used to evaluate 21 different machine learning algorithms.
Wed 17 SepDisplayed time zone: Athens change
14:00 - 15:30 | Metrics and Human-Centric Approaches to TestingGeneral Track at Atrium C Chair(s): Nina Yevtushenko Ivannikov Institute for System Programming of the RussianAcademy of Sciences | ||
14:00 30mTalk | Time for Quiescence: Modelling Quiescent Behaviour in Testing via Time-outs in Timed Automata General Track Laura Brandán Briones Universidad Nacional de Córdoba, Marcus Gerhold University of Twente, The Netherlands, Petra van den Bos University of Twente, The Netherlands, Marielle Stoelinga University of Twente and Radboud University, Nijmegen | ||
14:30 30mTalk | Enhancing Path Testing with Eye-Tracking: A Human-Centric Approach to Functional Software Testing General Track Angelos Fotopoulos University of Patras, Fezo Metsi University of Patras, Michalis Xenos University of Patras | ||
15:00 30mTalk | New convex-based metamorphic relations and large-scale machine learning model evaluation General Track Jessy Colonval Université Marie et Louis Pasteur, CNRS, institut FEMTO-ST(UMR 6174), F-25000, Fabrice Bouquet University of Bourgogne Franche-Comté | ||