Beyond Accuracy: Evaluating Source Code Capabilities in Large Language Models for Software Engineering (ICSE 2024 - Doctoral Symposium)

Track

ICSE 2024 Doctoral Symposium

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 16 Apr 2024 14:00 - 15:30 at Fernando Pessoa - Focus Group: AI/ML for SE Chair(s): Reyhaneh Jabbarvand

Abstract

This dissertation aims to introduce interpretability techniques to comprehensively evaluate the performance of Large Language Models (LLMs) in software engineering tasks, beyond canonical metrics. In software engineering, Deep Learning techniques are widely employed across various domains, automating tasks such as code comprehension, bug fixing, code summarization, machine translation, and code generation. However, the prevalent use of accuracy-based metrics for evaluating Language Models trained on code often leads to an overestimation of their performance. Our work seeks to propose novel and comprehensive interpretability techniques to evaluate source code capabilities and provide a more nuanced understanding of LLMs performance across downstream tasks.

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 16 Apr
Displayed time zone: Lisbon change

14:00 - 15:30	Focus Group: AI/ML for SEDoctoral Symposium at Fernando Pessoa Chair(s): Reyhaneh Jabbarvand University of Illinois at Urbana-Champaign

14:00 90m Poster		Beyond Accuracy: Evaluating Source Code Capabilities in Large Language Models for Software Engineering Doctoral Symposium Alejandro Velasco William & Mary
14:00 90m Poster		Towards Interpreting the Behavior of Large Language Models on Software Engineering Tasks Doctoral Symposium Atish Kumar Dipongkor University of Central Florida
14:00 90m Poster		Programming Language Models in Multilingual Settings Doctoral Symposium Jonathan Katzy Delft University of Technology
14:00 90m Poster		Beyond Accuracy and Robustness Metrics for Large Language Models for Code Doctoral Symposium Daniel Rodriguez-Cardenas
14:00 90m Poster		Towards Safe, Secure, and Usable LLMs4Code Doctoral Symposium Ali Al-Kaswan Delft University of Technology, Netherlands

Beyond Accuracy: Evaluating Source Code Capabilities in Large Language Models for Software Engineering

Tue 16 Apr
Displayed time zone: Lisbon change

Alejandro Velasco

William & Mary

Tracks

Co-hosted Conferences

Workshops

Beyond Accuracy: Evaluating Source Code Capabilities in Large Language Models for Software Engineering

Program Display Configuration

Program Display Configuration

Tue 16 AprDisplayed time zone: Lisbon change

Alejandro Velasco

William & Mary

Tue 16 Apr
Displayed time zone: Lisbon change