OCoR: An Overlapping-Aware Code Retriever (ASE 2020 - Research Papers)

Who

Qihao Zhu, Zeyu Sun, Xiran Liang, Yingfei Xiong, Lu Zhang

Track

ASE 2020 Research Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 24 Sep 2020 02:40 - 03:00 at Wombat - AI for Software Engineering (4) Chair(s): Hoa Khanh Dam

Abstract

Code retrieval helps developers reuse the code snippet in the open-source projects. Given a natural language description, code retrieval aims to search for the most relevant code among a set of code. Existing state-of-the-art approaches apply neural networks to code retrieval. However, these approaches still fail to capture an important feature: overlaps. The overlaps between different names used by different people indicate that two different names may be potentially related (e.g., message'' andmsg''), and the overlaps between identifiers in code and words in natural language descriptions indicate that the code snippet and the description may potentially be related. To address these problems, we propose a novel neural architecture named OCoR, where we introduce two specifically-designed components to capture overlaps: the first embeds identifiers by character to capture the overlaps between identifiers, and the second introduces a novel overlap matrix to represent the degrees of overlaps between each natural language word and each identifier.

The evaluation was conducted on two established datasets. The experimental results show that OCoR significantly outperforms the existing state-of-the-art approaches and achieves 13.1% to 22.3% improvements. Moreover, we also conducted several in-depth experiments to help understand the performance of different components in OCoR.

Qihao Zhu

Peking University

Zeyu Sun

Peking University

Xiran Liang

Peking University

Yingfei Xiong

Peking University, China

Lu Zhang

Peking University, China

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 24 Sep
Displayed time zone: (UTC) Coordinated Universal Time change

02:20 - 03:20	AI for Software Engineering (4)Research Papers / NIER track at Wombat Chair(s): Hoa Khanh Dam University of Wollongong

02:20 20m Talk		Detecting and Explaining Self-Admitted Technical Debts with Attention-based Neural Networks Research Papers Xin Wang Wuhan University, Jin Liu Wuhan University, Li Li Monash University, Australia, Xiao Chen Monash University, Xiao Liu School of Information Technology, Deakin University, Hao Wu Yunnan University
02:40 20m Talk		OCoR: An Overlapping-Aware Code Retriever Research Papers Qihao Zhu Peking University, Zeyu Sun Peking University, Xiran Liang Peking University, Yingfei Xiong Peking University, China, Lu Zhang Peking University, China
03:00 10m Talk		BugPecker: Locating Faulty Methods with Deep Learning on Revision Graphs NIER track Junming Cao School of Software, Shanghai Jiao Tong University, Shouliang Yang School of Software, Shanghai Jiao Tong University, Wenhui Jiang School of Software, Shanghai Jiao Tong University, Hushuang Zeng School of Software, Shanghai Jiao Tong University, Beijun Shen School of Software, Shanghai Jiao Tong University, Hao Zhong Shanghai Jiao Tong University