PROFES 2024
Mon 2 - Wed 4 December 2024 Tartu, Estonia

Prioritizing tasks is extremely beneficial, but also difficult for software development teams. Assigning priority to tasks is also timeconsuming, especially in projects with a high volume of new issues. Consequently, many issues in GitHub are not labelled. An effective priority tool can streamline this process by suggesting priority labels, saving developers’ time and enabling faster identification of high-impact product improvements. In this paper we investigate the application of text classification using Transformer models to automatically assign priority labels to software development issues. We used data from the GitHub and Jira vast datasets to develop state-of-the-art machine learning models (Transformers) to automatically classify the priority of text issues. We thoroughly evaluated the generalizability of our models by using issues that are self-tagged by developers in projects that were not part of the training (Out-of-Distribution) and we adapted our models to specific projects by incorporating part of the issues in the training (fine-tuning) to improve performance. Our experiments show that results vary but can reach a performance of correctly labeling 80% of high priority issues in a project. Our results indicate that Transformers have the potential to assist developers in (semi-)automatically assigning priority labels to their issues and therefore reducing overhead. We find that fine-tuning improve significantly the performance by adapting the machine learning models to specific projects, but further research is needed to optimize this approach.

Wed 4 Dec

Displayed time zone: Athens change

14:00 - 15:30
PROFES Session 9: AI and ML for Software EngineeringShort Papers and Posters / Research Papers at UT Library - Room 2 (Seminar Room Tõstamaa)
Chair(s): Giuseppe Scanniello University of Basilicata
14:00
12m
Short-paper
Practical considerations and solutions in NLP-based analysis of code review comments - An experience report
Short Papers and Posters
14:12
12m
Short-paper
Towards Automated Recovery of Links Between Code Commits and Requirements – Initial Results
Short Papers and Posters
Risha Parveen , Ali Mehraj Tampere University, Zheying Zhang Tampere University, Kari Systa Tampere University, Terhi Kilamo Tampere University
14:24
18m
Research paper
Towards Enhancing Task Prioritization in Software Development Through Transformer-Based Issues Classification
Research Papers
Kristian Marison Haugerud University of Oslo, Karthik Shivashankar University of Oslo, Antonio Martini University of Oslo, Norway
14:42
12m
Short-paper
The Effects of Semantic Information on LLM-based Program Repair
Short Papers and Posters
Shota Hori Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University, Yoshiki Higo Osaka University, Kazuya Yasuda Hitachi, Ltd., Shinji Itoh Hitachi, Ltd., Research &Development Group, Phan Thi Thanh Huyen Hitachi, Ltd., Research &Development Group
14:54
36m
Talk
Session 9 Discussion
Research Papers