PROFES 2024
Mon 2 - Wed 4 December 2024 Tartu, Estonia

This program is tentative and subject to change.

Context: Automated analysis of code review comments (CRCs) can aid in highlighting frequently discussed issues by reviewers from large repositories. However, CRCs contain natural language text and code references; thus, topic modeling approaches must be carefully selected. Objective: This work aims to discuss the various challenges observed while evaluating two topic modeling methods for the analysis of CRCs. Method: We evaluated GSDMM and BERTopic to analyze frequently discussed themes in CRCs. We utilized 5,560 CRCs, followed by an evaluation of the quality of themes from a domain expert. Results: We report several observations and challenges in improving the quality of the generated themes, including choices regarding the pre-processing, topic modeling parameters, embedding model, and objective measures used, which impact the interpretability of the generated topics. Conclusions: This work raises important questions regarding the approach for analysis of CRCs and provides potential avenues and suggestions for further exploration. Future studies can utilize the technical demonstrator to explore the interpretability of the generated topics from CRCs.

This program is tentative and subject to change.

Wed 4 Dec

Displayed time zone: Athens change

14:00 - 15:30
PROFES Session 9: AI and ML for Software EngineeringShort Papers and Posters / Research Papers at UT Library - Room 2
14:00
12m
Short-paper
Practical considerations and solutions in NLP-based analysis of code review comments - An experience report
Short Papers and Posters
14:12
12m
Short-paper
Towards Automated Recovery of Links Between Code Commits and Requirements – Initial Results
Short Papers and Posters
Risha Parveen , Ali Mehraj Tampere University, Zheying Zhang Tampere University, Kari Systa Tampere University, Terhi Kilamo Tampere University
14:24
18m
Research paper
Towards Enhancing Task Prioritization in Software Development Through Transformer-Based Issues Classification
Research Papers
Kristian Marison Haugerud University of Oslo, Karthik Shivashankar University of Oslo, Antonio Martini University of Oslo, Norway
14:42
12m
Short-paper
The Effects of Semantic Information on LLM-based Program Repair
Short Papers and Posters
Shota Hori Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University, Yoshiki Higo Osaka University, Kazuya Yasuda Hitachi, Ltd., Shinji Itoh Hitachi, Ltd., Research &Development Group, Phan Thi Thanh Huyen Hitachi, Ltd., Research &Development Group
14:54
36m
Talk
Session 9 Discussion
Research Papers