Fri 28 Jun 2024 10:45 - 11:15 at V201 - Traceability Chair(s): Fatma Başak Aydemir

Being aware of and understanding the relations between the requirements of a software system to its other artifacts is crucial for their successful development, maintenance and evolution. There are approaches to automatically recover this traceability information, but they fail to identify the actual relevant parts of the requirements. Recent large language model-based requirements classification approaches have shown to be able to identify aspects and concerns of requirements with promising accuracy. Therefore, we investigate the potential of those classification approaches for identifying irrelevant requirement parts for traceability link recovery between requirements and code.

We train the large language model-based requirements classification approach NoRBERT on a new dataset of requirements and their entailed aspects and concerns. We use the results of the classification to filter irrelevant parts of the requirements before recovering trace links with the fine-grained word embedding-based FTLR approach.

Two empirical studies show promising results regarding the quality of classification and the impact on traceability link recovery. NoRBERT can identify functional and user-related aspects in the requirements with an F1-score of 84%. With the classification and requirements filtering, the performance of FTLR could be improved significantly and FTLR performs better than state-of-the-art unsupervised traceability link recovery approaches.

Fri 28 Jun

Displayed time zone: (UTC) Coordinated Universal Time change

10:45 - 12:15
10:45
30m
Paper
Requirements Classification for Traceability Link Recovery
Research Papers
Tobias Hey Karlsruhe Institute of Technology (KIT), Sophie Corallo Karlsruhe Institute of Technology (KIT), Jan Keim Karlsruhe Institute of Technology (KIT)
Pre-print
11:15
30m
Paper
Enhancing Traceability Graphs using Machine Learning: Real time use-cases and lessons learnt from ‘impakt’ research project (Presentation Only)
Industrial Innovation Papers
11:45
30m
Paper
Code Gradients: Automated Traceability of LLM-Generated Code
RE@Next! Papers