Selective Regression Testing based on Big Data: Comparing Feature Extraction Techniques (NEXTA 2020)

Who

Khaled Al-Sabbagh, Miroslaw Staron, Regina Hebig, Miroslaw Ochodek, Wilhelm Meding

Track

NEXTA 2020

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 24 Oct 2020 11:30 - 12:00 at D. Luis - Session: Optimisation Chair(s): Adnan Causevic

Abstract

Regression testing is a necessary activity in continuous integration (CI) since it provides confidence that modified parts of the system are correct at each integration cycle. CI provides large volumes of data which can be used to support regression testing activities. By using machine learning, patterns about faulty changes in the modified program can be induced, allowing test orchestrators to make inferences about test cases that need to be executed at each CI cycle. However, one challenge in using learning models lies in finding a suitable way for characterizing source code changes and preserving important information. In this paper, we empirically evaluate the effect of three feature extraction algorithms on the performance of an existing ML-based selective regression testing technique. We designed and performed an experiment to empirically investigate the effect of Bag of Words (BoW), Word Embeddings (WE), and content-based feature extraction (CBF). We used stratified cross validation on the space of features generated by the three FE techniques and evaluated the performance of three machine learning models using the precision and recall metrics. The results from this experiment showed a significant difference between the models’ precision and recall scores, suggesting that the BoWfed model outperforms the other two models with respect to precision, whereas a CBF-fed model outperforms the rest with respect to recall.

Link to Publication

https://doi.org/10.1109/ICSTW50294.2020.00058

DOI

https://doi.org/10.1109/ICSTW50294.2020.00058

Khaled Al-Sabbagh

University of Gothenburg

Sweden

Miroslaw Staron

University of Gothenburg

Sweden

Regina Hebig

Chalmers | Gothenburg University

Sweden

Miroslaw Ochodek

Poznan University of Technology

Poland

Wilhelm Meding

Ericsson

Sweden

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sat 24 Oct
Displayed time zone: Lisbon change

11:00 - 12:30	Session: OptimisationNEXTA 2020 at D. Luis Chair(s): Adnan Causevic Mälardalen University

11:00 30m Full-paper		Optimization of automated executions based on integration test configurations of embedded software NEXTA 2020 Masashi Mizoguchi Hitachi Ltd., Takahiro Iida Hitachi Automotive Systems Ltd., Toru Irie Hitachi Automotive Systems Ltd. Link to publication DOI
11:30 30m Full-paper		Selective Regression Testing based on Big Data: Comparing Feature Extraction Techniques NEXTA 2020 Khaled Al-Sabbagh University of Gothenburg, Miroslaw Staron University of Gothenburg, Regina Hebig Chalmers \| Gothenburg University, Miroslaw Ochodek Poznan University of Technology, Wilhelm Meding Ericsson Link to publication DOI
12:00 20m Full-paper		Runtime Prioritization with the Classification Tree Method for Test Automation NEXTA 2020 Barbara Jung Expleo Germany, Peter M. Kruse Expleo Group Link to publication DOI