Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems
Modern software systems are highly configurable, offering a vast number of configuration options that can be customized to meet specific functional and non-functional requirements. To support the configuration process, several automated software approaches based on machine learning have been proposed in the literature. These approaches aim to assist developers by predicting non-functional properties based on configuration settings. A recent study demonstrated the potential of leveraging a subset of configuration options (a.k.a. features) to achieve accurate performance predictions in the Linux kernel. The promise of learning over a reduced set of features – instead of all features – is to obtain performance models that are faster to compute, simpler to interpret, and still accurate. Despite the encouraging results of the original study, several questions remain unresolved: Can the findings be generalized to other configurable systems other than Linux? Which learning algorithms deliver the most efficient results when working with a reduced number of features? What are the most effective sampling strategies for building accurate and efficient models? In this work, we extend the original study by conducting an in-depth analysis across eight configurable systems. We evaluate the impact of sampling strategies and learning algorithms on model accuracy and training efficiency. Our goal is to understand whether there is a dominant sampling strategy and learning algorithm for varying systems and performance targets. Our results reveal variability in optimal strategies across systems and advocate for tailored approaches rather than universal solutions.
preprint (ICSR_Bessa2025_.pdf) | 525KiB |
Sun 27 AprDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:30 | Session 3: Micro-services and Configurable SystemsICSR at 204 Chair(s): Tommi Mikkonen University of Jyväskylä | ||
14:00 30mPaper | MONO2REST: Identification and exposition of micro-services: a reusable RESTification approach ICSR Matthéo Lecrivain Nantes Université, Hanifa Barry Université de Montréal, Dalila Tamzalit Nantes Université, Houari Sahraoui DIRO, Université de Montréal Pre-print | ||
14:30 30mPaper | Semantic Dependency in Microservice Architecture ICSR Amr Elsayed The University of Arizona, Kari E Cordes University of Arizona, Austin Medina University of Arizona, Tomas Cerny University of Arizona DOI Pre-print | ||
15:00 30mPaper | Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems ICSR João Marcello Bessa Rodrigues Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Millena Cavalcanti Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Mathieu Acher Univ Rennes, Inria, CNRS, IRISA, Markus Endler Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Juliana Alves Pereira PUC-Rio File Attached |