Practical considerations and solutions in NLP-based analysis of code review comments - An experience report (PROFES 2024 - Short Papers and Posters)

Track

PROFES 2024 Short Papers and Posters

Time Zone

The program is currently displayed in (GMT+02:00) Athens.

Use conference time zone: (GMT+02:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 4 Dec 2024 14:00 - 14:12 at UT Library - Room 2 (Seminar Room Tõstamaa) - PROFES Session 9: AI and ML for Software Engineering Chair(s): Giuseppe Scanniello

Abstract

Context: Automated analysis of code review comments (CRCs) can aid in highlighting frequently discussed issues by reviewers from large repositories. However, CRCs contain natural language text and code references; thus, topic modeling approaches must be carefully selected. Objective: This work aims to discuss the various challenges observed while evaluating two topic modeling methods for the analysis of CRCs. Method: We evaluated GSDMM and BERTopic to analyze frequently discussed themes in CRCs. We utilized 5,560 CRCs, followed by an evaluation of the quality of themes from a domain expert. Results: We report several observations and challenges in improving the quality of the generated themes, including choices regarding the pre-processing, topic modeling parameters, embedding model, and objective measures used, which impact the interpretability of the generated topics. Conclusions: This work raises important questions regarding the approach for analysis of CRCs and provides potential avenues and suggestions for further exploration. Future studies can utilize the technical demonstrator to explore the interpretability of the generated topics from CRCs.

Time Zone

The program is currently displayed in (GMT+02:00) Athens.

Use conference time zone: (GMT+02:00) AthensSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 4 Dec
Displayed time zone: Athens change

14:00 - 15:30	PROFES Session 9: AI and ML for Software EngineeringShort Papers and Posters / Research Papers at UT Library - Room 2 (Seminar Room Tõstamaa) Chair(s): Giuseppe Scanniello University of Basilicata

14:00 12m Short-paper		Practical considerations and solutions in NLP-based analysis of code review comments - An experience report Short Papers and Posters Umar Iftikhar
14:12 12m Short-paper		Towards Automated Recovery of Links Between Code Commits and Requirements – Initial Results Short Papers and Posters Risha Parveen , Ali Mehraj Tampere University, Zheying Zhang Tampere University, Kari Systa Tampere University, Terhi Kilamo Tampere University
14:24 18m Research paper		Towards Enhancing Task Prioritization in Software Development Through Transformer-Based Issues Classification Research Papers Kristian Marison Haugerud University of Oslo, Karthik Shivashankar University of Oslo, Antonio Martini University of Oslo, Norway
14:42 12m Short-paper		The Effects of Semantic Information on LLM-based Program Repair Short Papers and Posters Shota Hori Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University, Yoshiki Higo Osaka University, Kazuya Yasuda Hitachi, Ltd., Shinji Itoh Hitachi, Ltd., Research &Development Group, Phan Thi Thanh Huyen Hitachi, Ltd., Research &Development Group
14:54 36m Talk		Session 9 Discussion Research Papers

Practical considerations and solutions in NLP-based analysis of code review comments - An experience report

Program Display Configuration

Program Display Configuration

Wed 4 DecDisplayed time zone: Athens change

Umar Iftikhar

Wed 4 Dec
Displayed time zone: Athens change