Code to Comment "Translation": Data, Metrics, Baselining & Evaluation (ASE 2020 - Research Papers)

Who

David Gros, Hariharan Sezhiyan, Prem Devanbu, Zhou Yu

Track

ASE 2020 Research Papers

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 23 Sep 2020 17:10 - 17:30 at Koala - Empirical Software Engineering (1) Chair(s): Jinqiu Yang

Abstract

The relationship of comments to code, and in particular, the task of generating useful comments given the code, has long been of interest. The earliest approaches have been based on strong syntactic theories of comment-structures, and relied on textual templates. More recently, researchers have applied deep-learning methods to this task—specifically, trainable generative translation models which are known to work very well for Natural Language translation (e.g., from German to English). We carefully examine the underlying assumption here: that the task of generating comments sufficiently resembles the task of translating between natural languages, and so similar models and evaluation metrics could be used. We analyze several recent code-comment datasets for this task: CodeNN, DeepCom, FunCom, and DocString. We compare them with WMT19, a standard dataset frequently used to train state-of-the-art natural language translators. We found some interesting differences between the code-comment data and the WMT19 natural language data. Next, we describe and conduct some studies to calibrate BLEU (which is commonly used as a measure of comment quality). using "affinity pairs" of methods, from different projects, in the same project, in the same class, etc; Our study suggests that the current performance on some datasets might need to be improved substantially. We also argue that fairly naive information retrieval (IR) methods do well enough at this task to be considered a reasonable baseline. Finally, we make some suggestions on how our findings might be used in future research in this area.

David Gros

University of California, Davis

Hariharan Sezhiyan

University of California, Davis

Prem Devanbu

University of California

United States

Zhou Yu

University of California, Davis

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 23 Sep
Displayed time zone: (UTC) Coordinated Universal Time change

17:10 - 18:10	Empirical Software Engineering (1)Research Papers / Journal-first Papers at Koala Chair(s): Jinqiu Yang Concordia University, Montreal, Canada

17:10 20m Talk		Code to Comment "Translation": Data, Metrics, Baselining & Evaluation Research Papers David Gros University of California, Davis, Hariharan Sezhiyan University of California, Davis, Prem Devanbu University of California, Zhou Yu University of California, Davis
17:30 20m Talk		Reproducing Performance Bug Reports in Server Applications: The Researchers' Experiences Journal-first Papers Xue Han University of Kentucky, Daniel Carroll University of Kentucky, Tingting Yu University of Kentucky Link to publication DOI
17:50 20m Talk		Exploring the Architectural Impact of Possible Dependencies in Python software Research Papers Wuxia Jin Xi'an Jiaotong University, Yuanfang Cai Drexel University, Rick Kazman University of Hawai‘i at Mānoa, Gang Zhang Emergent Design Inc, Qinghua Zheng Xi'an Jiaotong University, Ting Liu Xi'an Jiaotong University