ESEIW 2022
Sun 18 - Fri 23 September 2022 Helsinki, Finland
Fri 23 Sep 2022 14:15 - 14:30 at Sonck - Session 5B - Development & Testing & Behavioral 2 Chair(s): Sheila Reinehr

Background. Software effort can be measured by story point \cite{10.1145/2639490.2639503}. Story point estimation is important in software projects’ planning. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem. These approaches require expensive embedding models and confront challenges that the sequence of text might not be an efficient representation for software issues which can be the combination of text and code.

Aims. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We use the dataset proposed by Choetkiertikul et al. \cite{DBLP:journals/corr/ChoetkiertikulD16} for evaluation. We select GPT2SP \cite{9732669} and Deep-SE \cite{DBLP:journals/corr/ChoetkiertikulD16} as the baselines for comparison.

Method. First, from the analysis of the story point dataset \cite{DBLP:journals/corr/ChoetkiertikulD16}, we conclude that software issues are actually a mixture of natural language sentences with quoted code snippets and have problems related to large-size vocabulary. Second, we provide a module to normalize the input text including words and code tokens of the software issues. Third, we design an algorithm to convert an input software issue to a graph with different types of nodes and edges. Fourth, we construct a heterogeneous graph neural networks model with the support of fastText \cite{bojanowski2017enriching} for constructing initial node embedding to learn and predict the story points of new issues.

Results. We did the comparison over three scenarios of estimation, including within project, cross-project within the repository, and cross-project cross repository with our baseline approaches. We achieve the average Mean Absolute Error (MAE) as 2.38, 2.61, and 2.63 for three scenarios. We outperform GPT2SP in 2/3 of the scenarios while outperforming Deep-SE in the most challenging scenario with significantly less amount of running time. We also compare our approaches with different homogeneous graph neural network models and the results show that the heterogeneous graph neural networks model outperforms the homogeneous models in story point estimation. For time performance, we achieve about 570 seconds as the time performance in both three processes: node embedding initialization, model construction, and story point estimation.

Conclusion. HeteroSP, a heterogeneous graph neural networks model for story point estimation, achieved good accuracy and efficient running time.

Fri 23 Sep

Displayed time zone: Athens change

13:30 - 15:00
Session 5B - Development & Testing & Behavioral 2ESEM Technical Papers at Sonck
Chair(s): Sheila Reinehr Pontifícia Universidade Católica do Paraná (PUCPR)
13:30
15m
Full-paper
Potential Technical Debt and Its Resolution in Code Reviews: An Exploratory Study of the OpenStack and Qt Communities
ESEM Technical Papers
Liming Fu Wuhan University, Peng Liang Wuhan University, China, Zeeshan Rasheed Wuhan University, Zengyang Li Central China Normal University, Amjed Tahir Massey University, Xiaofeng Han Wuhan University, China
Link to publication DOI Pre-print
13:45
15m
Full-paper
MMF3: Neural Code Summarization Based on Multi-Modal Fine-Grained Feature Fusion
ESEM Technical Papers
Zheng Ma Shandong Normal University, Yuexiu Gao Shandong Normal University, Lei Lyu Shandong Normal University, Chen Lyu Shandong Normal University
14:00
15m
Full-paper
PG-VulNet: Detect Supply Chain Vulnerabilities in IoT Devices using Pseudo-code and Graphs
ESEM Technical Papers
Xin Liu Lanzhou University, Yixiong Wu Institute for Network Science and Cyberspace of Tsinghua University, Qingchen Yu Zhejiang University, Shangru Song Beijing Institute of Technology, Yue Liu Southeast University; Qi An Xin Group Corp., Qingguo Zhou Lanzhou University, Jianwei Zhuge Tsinghua University
14:15
15m
Full-paper
Heterogeneous Graph Neural Networks for Software Effort Estimation
ESEM Technical Papers
Hung Phan Iowa State University, Ali Jannesari Iowa State University
Pre-print
14:30
15m
Full-paper
Meetings and Mood - Related or Not? Insights from Student Software Projects
ESEM Technical Papers
Jil Klünder Leibniz Universität Hannover, Oliver Karras TIB - Leibniz Information Centre for Science and Technology
Pre-print
14:45
15m
Full-paper
A Tale of Two Tasks: Automated Issue Priority Prediction with Deep Multi-task Learning
ESEM Technical Papers
Yingling Li , Xing Che , Yuekai Huang Institute of Software, Chinese Academy of Sciences, Junjie Wang Institute of Software at Chinese Academy of Sciences, Song Wang York University, Yawen Wang Institute of Software, Chinese Academy of Sciences, Qing Wang Institute of Software at Chinese Academy of Sciences