EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery
Issue-commit links, as a type of software traceability links, play a vital role in various software development and maintenance tasks. However, they are typically deficient, as developers often forget or fail to create tags when making commits. Existing studies have deployed deep learning techniques, including pre-trained models, to improve automatic issue-commit link recovery. Despite their promising performance, we argue that previous approaches have four main problems, hindering them from recovering links in large software projects. To overcome these problems, we propose an efficient and accurate pre-trained framework called EALink for issue-commit link recovery. EALink requires much fewer model parameters than existing pre-trained methods, bringing efficient training and recovery. Moreover, we design various techniques to improve the recovery accuracy of EALink. We construct a large-scale dataset and conduct extensive experiments to demonstrate the power of EALink. Results show that EALink outperforms the state-of-the-art methods by a large margin (15.23%-408.65%) on various evaluation metrics. Meanwhile, its training and inference overhead is orders of magnitude lower than existing methods. We provide our implementation and data at https://github.com/KDEGroup/EALink.
Tue 12 SepDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
10:30 - 12:00 | Open Source and Software Ecosystems 1Research Papers / Tool Demonstrations at Room E Chair(s): Denys Poshyvanyk William & Mary | ||
10:30 12mTalk | An Empirical Study of Malicious Code In PyPI Ecosystem Research Papers Wenbo Guo School of Cyber Science and Engineering, Sichuan University, Zhengzi Xu Nanyang Technological University, Chengwei Liu Nanyang Technological University, Cheng Huang School of Cyber Science and Engineering, Sichuan University, Yong Fang School of Cyber Science and Engineering, Sichuan University, Yang Liu Nanyang Technological University Pre-print | ||
10:42 12mTalk | Understanding and Remediating Open-Source License Incompatibilities in the PyPI Ecosystem Research Papers Weiwei Xu Peking University, Hao He Carnegie Mellon University, Kai Gao University of Science and Technology Beijing, Minghui Zhou Peking University Pre-print | ||
10:54 12mTalk | Mitigating Persistence of Open-Source Vulnerabilities in Maven Ecosystem Research Papers Lyuye Zhang Nanyang Technological University, Chengwei Liu Nanyang Technological University, Sen Chen Tianjin University, Zhengzi Xu Nanyang Technological University, Lingling Fan Nankai University, Lida Zhao Nanyang Technological University, Yiran Zhang Nanyang Technological University, Yang Liu Nanyang Technological University | ||
11:06 12mTalk | Bus Factor Explorer Tool Demonstrations Egor Klimov JetBrains Research, Muhammad Umair Ahmed Bilkent University, Nikolai Sviridov JetBrains Research, Pouria Derakhshanfar JetBrains Research, Eray Tüzün Bilkent University, Vladimir Kovalenko JetBrains Research Media Attached | ||
11:30 12mTalk | EALink: An Efficient and Accurate Pre-Trained Framework for Issue-Commit Link Recovery Research Papers Chenyuan Zhang Xiamen University, Yanlin Wang Sun Yat-sen University, Zhao Wei Tencent, Yong Xu Tencent, Juhong Wang Tencent, Hui Li Xiamen University, Rongrong Ji Xiamen University Pre-print Media Attached | ||
11:42 12mTalk | Fork Entropy: Assessing the Diversity of Open Source Software Projects' ForksRecorded talk Research Papers Liang Wang Nanjing University, Zhiwen Zheng State Key Laboratory for Novel Software Technology, Nanjing University, Xiangchen Wu State Key Laboratory for Novel Software Technology, Nanjing University, Baihui Sang State Key Laboratory for Novel Software Technology, Nanjing University, Jierui Zhang Nanjing University, Xianping Tao Nanjing University Media Attached |