Pose estimation systems are used in a variety of fields, from sports analytics to livestock care. Given their potential impact, it is paramount to systematically test their behaviour and potential for failure. This is a complex task due to the oracle problem and the high cost of manual labelling necessary to build ground truth keypoints. This problem is, in turn, exacerbated by the fact that different applications will require systems to focus on different subjects (e.g., human versus animal) or landmarks (e.g., only extremities versus whole body and face), which makes labelled test data rarely reusable. To combat these problems we propose MeT-Pose, a metamorphic testing framework for pose estimation systems that bypasses the need for manual annotation while assessing the performance of these systems under different circumstances. MeT-Pose thus allows users of pose estimation systems to assess the systems in conditions that more closely relate to their application without having to label an ad-hoc test dataset or to rely only on available datasets, which may not be adapted to their application domain. While we define MeT-Pose in general terms, we also present a non-exhaustive list of metamorphic rules that represent common challenges in computer vision applications, as well as a specific way to evaluate these rules. We then experimentally show the effectiveness of MeT-Pose by applying it to Mediapipe Holistic, a state of the art human pose estimation system, with the FLIC and PHOENIX datasets. With these experiments, we outline numerous ways in which the outputs of MeT-Pose can uncover faults in pose estimation systems at a similar or higher rate than classic testing using hand labelled data, and show that users can tailor the rule set they use to the faults and level of accuracy relevant to their application.
Wed 2 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
14:00 - 15:30 | Test OracleJournal-First Papers / Research Papers / Industry at Aula Magna (AM) Chair(s): Mike Papadakis University of Luxembourg | ||
14:00 15mTalk | Differential Testing of Concurrent Classes Research Papers Valerio Terragni University of Auckland, Shing-Chi Cheung Hong Kong University of Science and Technology | ||
14:15 15mTalk | Hybrid Equivalence/Non-Equivalence Testing Research Papers Laboni Sarker University of California at Santa Barbara, Tevfik Bultan University of California at Santa Barbara | ||
14:30 15mTalk | Metamorphic Testing for Pose Estimation Systems Research Papers Matias Duran Trinity College Dublin, Ireland, Thomas Laurent Lero@Trinity College Dublin, Ellen Rushe School of Computing Dublin City University, Anthony Ventresque Trinity College Dublin & Lero, Ireland | ||
14:45 15mTalk | Mutation-Guided Metamorphic Testing of Optimality in AI Planning Journal-First Papers Quentin Mazouni Simula Research Laboratory, Arnaud Gotlieb Simula Research Laboratory, Helge Spieker Simula Research Laboratory, Mathieu Acher University of Rennes - Inria - CNRS - IRISA, Benoit Combemale University of Rennes, Inria, CNRS, IRISA Link to publication DOI | ||
15:00 15mTalk | LLMs in the Heart of Differential Testing: A Case Study on a Medical Rule Engine Industry Erblin Isaku Simula Research Laboratory, and University of Oslo (UiO), Christoph Laaber Simula Research Laboratory, Hassan Sartaj Simula Research Laboratory, Shaukat Ali Simula Research Laboratory and Oslo Metropolitan University, Thomas Schwitalla Cancer Registry of Norway, Jan F. Nygård Cancer Registry of Norway |