Applications of natural language processing in software traceability: A systematic mapping study (EASE 2023 - Journal First)

Who

Zaki Pauzi, Andrea Capiluppi

Track

EASE 2023 Journal First

Time Zone

The program is currently displayed in (GMT+03:00) Athens.

Use conference time zone: (GMT+03:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 14 Jun 2023 16:30 - 16:40 at Aurora Hall - Methodology and Secondary Studies Chair(s): Thomas Fehlmann

Abstract

A key part of software evolution and maintenance is the continuous integration from collaborative efforts, often resulting in complex traceability challenges between software artifacts: features and modules remain scattered in the source code, and traceability links become harder to recover. In this paper, we perform a systematic mapping study dealing with recent research recovering these links through information retrieval, with a particular focus on natural language processing (NLP).

Our search strategy gathered a total of 96 papers in focus of our study, covering a period from 2013 to 2021. We conducted trend analysis on NLP techniques and tools involved, and traceability efforts (applying NLP) across the software development life cycle (SDLC). Based on our study, we have identified the following key issues, barriers, and setbacks: syntax convention, configuration, translation, explainability, properties representation, tacit knowledge dependency, scalability, and data availability.

Based on these, we consolidated the following open challenges: representation similarity across artifacts, the effectiveness of NLP for traceability, and achieving scalable, adaptive, and explainable models. To address these challenges, we recommend a holistic framework for NLP solutions to achieve effective traceability and efforts in achieving interoperability and explainability in NLP models for traceability.

Link to Publication

https://doi.org/10.1016/j.jss.2023.111616

DOI

https://doi.org/10.1016/j.jss.2023.111616

File attachments

Presentation slides (2023-06-15_JSS_J1C2.pptx)	1.20MiB

Zaki Pauzi

University of Groningen, BP plc

United Kingdom

Andrea Capiluppi

University of Groningen

Netherlands