Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems (ICSR 2025 - 22nd International Conference on Systems and Software Reuse)

Who

João Marcello Bessa Rodrigues, Millena Cavalcanti, Mathieu Acher, Markus Endler, Juliana Alves Pereira

Track

ICSR 2025

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 27 Apr 2025 15:00 - 15:30 at 204 - Session 3: Micro-services and Configurable Systems Chair(s): Tommi Mikkonen

Abstract

Modern software systems are highly configurable, offering a vast number of configuration options that can be customized to meet specific functional and non-functional requirements. To support the configuration process, several automated software approaches based on machine learning have been proposed in the literature. These approaches aim to assist developers by predicting non-functional properties based on configuration settings. A recent study demonstrated the potential of leveraging a subset of configuration options (a.k.a. features) to achieve accurate performance predictions in the Linux kernel. The promise of learning over a reduced set of features – instead of all features – is to obtain performance models that are faster to compute, simpler to interpret, and still accurate. Despite the encouraging results of the original study, several questions remain unresolved: Can the findings be generalized to other configurable systems other than Linux? Which learning algorithms deliver the most efficient results when working with a reduced number of features? What are the most effective sampling strategies for building accurate and efficient models? In this work, we extend the original study by conducting an in-depth analysis across eight configurable systems. We evaluate the impact of sampling strategies and learning algorithms on model accuracy and training efficiency. Our goal is to understand whether there is a dominant sampling strategy and learning algorithm for varying systems and performance targets. Our results reveal variability in optimal strategies across systems and advocate for tailored approaches rather than universal solutions.

File attachments

preprint (ICSR_Bessa2025_.pdf)	525KiB

João Marcello Bessa Rodrigues

Pontifical Catholic University of Rio de Janeiro (PUC-Rio)

Millena Cavalcanti

Pontifical Catholic University of Rio de Janeiro (PUC-Rio)

Mathieu Acher

Univ Rennes, Inria, CNRS, IRISA

France

Markus Endler

Pontifical Catholic University of Rio de Janeiro (PUC-Rio)

Juliana Alves Pereira

PUC-Rio

Brazil

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 27 Apr
Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30	Session 3: Micro-services and Configurable SystemsICSR at 204 Chair(s): Tommi Mikkonen University of Jyväskylä

14:00 30m Paper		MONO2REST: Identification and exposition of micro-services: a reusable RESTification approach ICSR Matthéo Lecrivain Nantes Université, Hanifa Barry Université de Montréal, Dalila Tamzalit Nantes Université, Houari Sahraoui DIRO, Université de Montréal Pre-print
14:30 30m Paper		Semantic Dependency in Microservice Architecture ICSR Amr Elsayed The University of Arizona, Kari E Cordes University of Arizona, Austin Medina University of Arizona, Tomas Cerny University of Arizona DOI Pre-print
15:00 30m Paper		Unveiling the Impact of Sampling on Feature Selection for Performance Prediction in Configurable Systems ICSR João Marcello Bessa Rodrigues Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Millena Cavalcanti Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Mathieu Acher Univ Rennes, Inria, CNRS, IRISA, Markus Endler Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Juliana Alves Pereira PUC-Rio File Attached