Requirements Classification for Traceability Link Recovery (Requirements Engineering 2024 - Research Papers)

Who

Tobias Hey, Sophie Corallo, Jan Keim

Track

Requirements Engineering 2024 Research Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 28 Jun 2024 10:45 - 11:15 at V201 - Traceability Chair(s): Fatma Başak Aydemir

Abstract

Being aware of and understanding the relations between the requirements of a software system to its other artifacts is crucial for their successful development, maintenance and evolution. There are approaches to automatically recover this traceability information, but they fail to identify the actual relevant parts of the requirements. Recent large language model-based requirements classification approaches have shown to be able to identify aspects and concerns of requirements with promising accuracy. Therefore, we investigate the potential of those classification approaches for identifying irrelevant requirement parts for traceability link recovery between requirements and code.

We train the large language model-based requirements classification approach NoRBERT on a new dataset of requirements and their entailed aspects and concerns. We use the results of the classification to filter irrelevant parts of the requirements before recovering trace links with the fine-grained word embedding-based FTLR approach.

Two empirical studies show promising results regarding the quality of classification and the impact on traceability link recovery. NoRBERT can identify functional and user-related aspects in the requirements with an F₁-score of 84%. With the classification and requirements filtering, the performance of FTLR could be improved significantly and FTLR performs better than state-of-the-art unsupervised traceability link recovery approaches.

Link to Preprint

https://doi.org/10.5445/IR/1000170117

Tobias Hey

Karlsruhe Institute of Technology (KIT)

Germany

Sophie Corallo

Karlsruhe Institute of Technology (KIT)

Jan Keim

Karlsruhe Institute of Technology (KIT)

Germany

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 28 Jun
Displayed time zone: (UTC) Coordinated Universal Time change

10:45 - 12:15	TraceabilityResearch Papers / RE@Next! Papers / Industrial Innovation Papers at V201 Chair(s): Fatma Başak Aydemir Utrecht University

10:45 30m Paper		Requirements Classification for Traceability Link Recovery Research Papers Tobias Hey Karlsruhe Institute of Technology (KIT), Sophie Corallo Karlsruhe Institute of Technology (KIT), Jan Keim Karlsruhe Institute of Technology (KIT) Pre-print
11:15 30m Paper		Enhancing Traceability Graphs using Machine Learning: Real time use-cases and lessons learnt from ‘impakt’ research project (Presentation Only) Industrial Innovation Papers Sruthi Radhakrishnan , Andreas Graf
11:45 30m Paper		Code Gradients: Automated Traceability of LLM-Generated Code RE@Next! Papers Marc North , Nelly Bencomo Durham University, Amir Atapour-Abarghouei