HANTracer: Leveraging Heterogeneous Graph Attention Network for Large-Scale Requirements-Code Traceability Link Recovery
In the task of requirements-to-code traceability link recovery, the continues growth of software scale has led to diminishing differences between indices and more complex nonlinear relationships within the data. This results in the performance decline of the most widely used information retrieval methods and machine learning methods when handling this task. Therefore, we propose a requirement traceability method based on heterogeneous graph attention networks, named HANTracer. The model integrates the high-dimensional vectors generated by a pre-trained model as node features to deepen the differentiation between nodes and enhances node representations with contextual information learned from the graph structure. Additionally, it utilizes the properties of heterogeneous edges to construct various edge features, such as code calling, code inheritance, and text similarity relationships, aiding the model in understanding and utilizing the relationships between different types of data elements. By incorporating average pooling layers and multiple fully connected layers, the HANTracer model is further improved to enhance its ability to extract nonlinear features. Experimental results indicate that HANTracer achieves an average F1 performance higher than the state-of-the-art methods TAROT by 100.30% and DF4RT by 46.27% on seven real-world open source software (OSS) datasets, demonstrating significant performance advantages in large-scale and complex data environments.
Thu 5 DecDisplayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change
16:00 - 17:30 | Session (14)ERA - Early Research Achievements / Technical Track at Room 3 (Xiangquan Ballroom) Chair(s): Jun Sun Singapore Management University | ||
16:00 30mTalk | Improving Code Search with Hard Negative Sampling Based on Fine-tuning Technical Track Hande Dong International Digital Economy Academy, Jiayi Lin International Digital Economy Academy, Yanlin Wang Sun Yat-sen University, Yichong Leng University of Science and Technology of China, Jiawei Chen Zhejiang University, Yutao Xie International Digital Economy Academy | ||
16:30 30mTalk | HANTracer: Leveraging Heterogeneous Graph Attention Network for Large-Scale Requirements-Code Traceability Link Recovery Technical Track Zhiyuan Zou , Bangchao Wang Wuhan Textile University, Hongyan Wan Wuhan Textile University, Huan Jin Wuhan Textile University, Xiaoxiao Li School of Computer Science and Artificial Intelligence, Wuhan Textile University, Yukun Cao School of Computer Science and Artificial Intelligence, Wuhan Textile University | ||
17:00 20mTalk | Enhancing Source Code Comment Generation via Retrieval-Augmented Generation with Design Document Term Dictionary ERA - Early Research Achievements Kazu Nishikawa Hitachi, Ltd. Research & Development Group., Genta Koreki Hitachi, Ltd. Research & Development Group., Hideyuki Kanuka Hitachi, Ltd. |