Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems
Modern software systems are highly configurable, offering a vast number of configuration options that can be customized to meet specific functional and non-functional requirements. To support the configuration process, several automated software approaches based on machine learning have been proposed in the literature. These approaches aim to assist developers by predicting non-functional properties based on configuration settings. A recent study demonstrated the potential of leveraging a subset of configuration options (a.k.a. features) to achieve accurate performance predictions in the Linux kernel. The promise of learning over a reduced set of features – instead of all features – is to obtain performance models that are faster to compute, simpler to interpret, and still accurate. Despite the encouraging results of the original study, several questions remain unresolved: Can the findings be generalized to other configurable systems other than Linux? Which learning algorithms deliver the most efficient results when working with a reduced number of features? What are the most effective sampling strategies for building accurate and efficient models? In this work, we extend the original study by conducting an in-depth analysis across eight configurable systems. We evaluate the impact of sampling strategies and learning algorithms on model accuracy and training efficiency. Our goal is to understand whether there is a dominant sampling strategy and learning algorithm for varying systems and performance targets. Our results reveal variability in optimal strategies across systems and advocate for tailored approaches rather than universal solutions.
| preprint (ICSR_Bessa2025_.pdf) | 525KiB |
Sun 27 AprDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:30 | Session 3: Micro-services and Configurable SystemsICSR at 204 Chair(s): Tommi Mikkonen University of Jyväskylä | ||
14:00 30mPaper | MONO2REST: Identification and exposition of micro-services: a reusable RESTification approach ICSR Matthéo Lecrivain Nantes Université, Hanifa Barry Université de Montréal, Dalila Tamzalit Nantes Université, Houari Sahraoui DIRO, Université de Montréal Pre-print | ||
14:30 30mPaper | Semantic Dependency in Microservice Architecture ICSR Amr Elsayed The University of Arizona, Kari E Cordes University of Arizona, Austin Medina University of Arizona, Tomas Cerny University of Arizona DOI Pre-print | ||
15:00 30mPaper | Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems ICSR João Marcello Bessa Rodrigues Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Millena Cavalcanti Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Mathieu Acher Univ Rennes, Inria, CNRS, IRISA, Markus Endler Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Juliana Alves Pereira PUC-Rio File Attached | ||