Is GitHub's Copilot as Bad As Humans at Introducing Vulnerabilities in Code? (MSR 2022 - Registered Reports)

Who

Owura Asare, Mei Nagappan, N. Asokan

Track

MSR 2022 Registered Reports

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 20 May 2022 14:32 - 14:36 at MSR Main room - even hours - Session 16: Non-functional Properties (Availability, Security, Legal Aspects) Chair(s): Maxime Lamothe, Jin L.C. Guo

Abstract

\textit{Background/Context:} Several advances in deep learning have been successfully applied to the software development process. Of recent interest is the use of neural language models to build tools that can assist in writing code. There is a growing body of work to evaluate these tools and their underlying language models. We aim to contribute to this line of research via a comparative empirical analysis of these tools and language models from a security perspective. For the rest of this paper, we use CGT (Code Generation Tool) to refer to language models as well as other tools, such as Copilot, that are built with language models.

\textit{Objective/Aim:} The aim of this study is to compare the performance of CGTs and human developers. Specifically, we investigate whether CGTs are just as likely to introduce the same software vulnerabilities as human developers.

\textit{Method:} We will use the Big-Vul dataset proposed by Fan et al.~\cite{fan_cc_2020} - a dataset of vulnerabilities introduced by human developers. For each entry in the dataset, we will recreate the scenario before the bug was introduced and allow the CGT to generate a completion. The completions are manually inspected in order to be classified as 1. containing the same vulnerability (introduced by the human), 2. containing a fix for the vulnerability or 3. other. The \emph{other} category is used as a catchall for scenarios that are out of scope for this project.

Link to Preprint

https://arxiv.org/abs/2204.04741

Owura Asare

University of Waterloo

Mei Nagappan

University of Waterloo

Canada

N. Asokan

University of Waterloo

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 20 May
Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:00	Session 16: Non-functional Properties (Availability, Security, Legal Aspects)Industry Track / Technical Papers / Registered Reports / Data and Tool Showcase Track at MSR Main room - even hours Chair(s): Maxime Lamothe Polytechnique Montreal, Montreal, Canada, Jin L.C. Guo McGill University

14:00 7m Talk		A Deep Study of the Effects and Fixes of Server-Side Request Races in Web Applications Technical Papers Zhengyi Qiu North Carolina State University, Shudi Shao North Carolina State University, Qi Zhao North Carolina State University, Hassan Ali Khan North Carolina State University, Xinning Hui North Carolina State University, Guoliang Jin North Carolina State University Media Attached
14:07 4m Talk		A Large-scale Dataset of (Open Source) License Text VariantsData and Tool Showcase Award Data and Tool Showcase Track Stefano Zacchiroli Télécom Paris, Polytechnic Institute of Paris DOI Pre-print
14:11 7m Talk		SECOM: Towards a convention for security commit messagesFOSS Impact Paper Award Industry Track Sofia Reis Instituto Superior Técnico, U. Lisboa & INESC-ID, Rui Abreu Faculty of Engineering, University of Porto, Portugal, Hakan Erdogmus Carnegie Mellon University, Corina S. Păsăreanu Carnegie Mellon University Pre-print
14:18 7m Talk		Varangian: A Git Bot for Augmented Static Analysis Industry Track Saurabh Pujar IBM Research, Yunhui Zheng IBM Research, Luca Buratti IBM Research, Burn Lewis IBM Research, Alessandro Morari IBM Research, Jim A. Laredo IBM Research, Kevin Postlethwait Red Hat, Christoph Görn Red Hat
14:25 7m Talk		Detecting Privacy-Sensitive Code Changes with Language Modeling Industry Track Gökalp Demirci Meta Platforms, Inc., Vijayaraghavan Murali Meta Platforms, Inc., Imad Ahmad Meta Platforms, Inc., Rajeev Rao Meta Platforms, Inc., Gareth Ari Aye Meta Platforms, Inc.
14:32 4m Talk		Is GitHub's Copilot as Bad As Humans at Introducing Vulnerabilities in Code? Registered Reports Owura Asare University of Waterloo, Mei Nagappan University of Waterloo, N. Asokan University of Waterloo Pre-print
14:36 7m Talk		Finding the Fun in Fundraising: Public Issues and Pull Requests in VC-backed Open-Core Companies Industry Track Kevin Xu GitHub
14:43 17m Live Q&A		Discussions and Q&A Technical Papers

Information for Participants

Fri 20 May 2022 14:00 - 15:00 at MSR Main room - even hours - Session 16: Non-functional Properties (Availability, Security, Legal Aspects) Chair(s): Maxime Lamothe, Jin L.C. Guo

Info for room MSR Main room - even hours:

Click here to go to the room on Midspace