Semantic-aware Source Code Modeling (ASE 2024 - Doctoral Symposium)

Sun 27 October - Fri 1 November 2024 Sacramento, California, United States

Track

ASE 2024 Doctoral Symposium

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 28 Oct 2024 10:30 - 11:00 at Bondi - DS: Student Presentations I

Abstract

Source code modeling represents a promising avenue for automating software development, such as code generation, bug repair, and program analysis. This research direction aims to train deep neural nets to learn the statistical predictability inherent in human-written programs to enhance developer productivity, code quality, and the overall software development life cycle.

Although existing code modeling approaches, particularly those underpinned by Transformer-based language models, have demonstrated effectiveness across various software engineering tasks, most of them have directly adopted learning schemes from natural language processing (e.g., data collection and processing, training objectives) to source code, primarily focusing on learning code text and syntax. However, such a direct transplant limits the models’ capability to capture deep program semantics, such as code functionality, dependencies, and program states during execution.

In this research proposal, we highlight the critical role of program semantics in source code modeling. We propose a range of innovative methodologies to bridge the gap between the text-based language models for large-scale code training and the requirement of deep semantic understanding to assist with software engineering tasks effectively. Furthermore, we showcase the efficacy of the proposed semantic-aware code modeling through a handful of published papers and preliminary results, with motivations to delve deeper into this avenue during doctoral research.

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 28 Oct
Displayed time zone: Pacific Time (US & Canada) change

10:30 - 12:00	DS: Student Presentations IDoctoral Symposium at Bondi

10:30 30m Talk		Semantic-aware Source Code Modeling Doctoral Symposium Yangruibo Ding Columbia University
11:00 30m Talk		Software Supply Chain Risk: Characterization, Measurement & Attenuation Doctoral Symposium Alexis Butler Royal Holloway University of London
11:30 30m Talk		Using AI to Automate the Modernization of Legacy Software Applications Doctoral Symposium Vikram Nitin Columbia University

Semantic-aware Source Code Modeling

Program Display Configuration

Program Display Configuration

Mon 28 OctDisplayed time zone: Pacific Time (US & Canada) change

Yangruibo Ding

Columbia University

Mon 28 Oct
Displayed time zone: Pacific Time (US & Canada) change