Adversarial Attack and Robustness Improvement on Code Summarization (EASE 2024 - Research Papers)

Who

Xi Ding, Yuan Huang, Xiangping Chen, Jing Bian

Track

EASE 2024 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 19 Jun 2024 14:00 - 14:15 at Room Capri - Program Comprehension Chair(s): Nicole Novielli

Abstract

Automatic code summarization, also known as code comment generation, has been proven beneficial for developers to understand better and maintain software projects. However, few research works have investigated the robustness of such models. Robustness requires that the model sustains the quality of the output summaries in the presence of perturbations to the inputs. In this paper, we provide an in-depth study of the robustness of code summarization models. We propose CREATE (Code summaRization modEl’s Adversarial aTtackEr), an approach for performing adversarial attacks against the model. This approach can generate adversarial samples to mislead the model and explore its robustness while ensuring these samples are compilable and semantically similar. We attack mainstream code summarization models with a large-scale available Java dataset to evaluate the effectiveness and efficiency of our approach. The experimental results indicate that CREATE’s attack effectiveness and efficiency surpasses other baselines, causing a decrease in the quality of generated comments by at least 40%. Furthermore, we investigate the magnitude of perturbation caused by CREATE during adversarial attacks, and the results show that the similarity between the adversarial samples generated by CREATE and the input code is approximately 0.8, demonstrating that it induces more minor perturbations compared to other baselines. Finally, we utilize CREATE for adversarial training of the model. Through experimentation, this approach indeed effectively enhances the model’s robustness.

Xi Ding

Sun Yat-Sen University

China

Yuan Huang

Sun Yat-sen University

China

Xiangping Chen

Sun Yat-Sen University

China

Jing Bian

Sun Yat-Sen University

China

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 19 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

14:00 - 15:20	Program ComprehensionResearch Papers / Short Papers, Vision and Emerging Results at Room Capri Chair(s): Nicole Novielli University of Bari

14:00 15m Talk		Adversarial Attack and Robustness Improvement on Code Summarization Research Papers Xi Ding Sun Yat-Sen University, Yuan Huang Sun Yat-sen University, Xiangping Chen Sun Yat-Sen University, Jing Bian Sun Yat-Sen University
14:15 15m Talk		Understanding Logical Expressions with Negations: Its Complicated Research Papers Aviad Baron Hebrew University, Ilai Granot Hebrew University, Ron Yosef Hebrew University, Dror Feitelson Hebrew University
14:30 15m Talk		A Quantitative Investigation of Trends in Confusing Variable Pairs Through Commits: Do Confusing Variable Pairs Survive? Research Papers Hirohisa Aman Ehime University, Sousuke Amasaki Okayama Prefectural University, Tomoyuki Yokogawa Okayama Prefectural University, Minoru Kawahara Ehime University
14:45 10m Talk		When simplicity meets effectiveness: Detecting code comments coherence with word embeddings and LSTM Short Papers, Vision and Emerging Results Michael Dubem Igbomezie University of L'Aquila, Phuong T. Nguyen University of L’Aquila, Davide Di Ruscio University of L'Aquila Pre-print
14:55 10m Talk		Exploring Influence of Feature Toggles on Code Complexity Short Papers, Vision and Emerging Results Md Tajmilur Rahman Gannon University, Imran Shalabi Gannon University, Tushar Sharma Dalhousie University
15:05 15m Talk		An Empirical Study on the Energy Usage and Performance of Pandas and Polars Data Analysis Python Libraries Research Papers Felix Nahrstedt Vrije Universiteit Amsterdam, Mehdi Karmouche Vrije Universiteit Amsterdam, Karolina Bargieł Vrije Universiteit Amsterdam, Pouyeh Banijamali Vrije Universiteit Amsterdam, Apoorva Nalini Pradeep Kumar Vrije Universiteit Amsterdam, Ivano Malavolta Vrije Universiteit Amsterdam Pre-print