Autorepairability of ChatGPT and Gemini: A Comparative Study (APSEC 2024 - ERA - Early Research Achievements)

Who

Chutweeraya Sriwilailak, Yoshiki Higo, Pongpop Lapvikai, Chaiyong Rakhitwetsagul, Morakot Choetkiertikul

Track

APSEC 2024 ERA - Early Research Achievements

Time Zone

The program is currently displayed in (GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi.

Use conference time zone: (GMT+08:00) Beijing, Chongqing, Hong Kong, UrumqiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 6 Dec 2024 11:40 - 12:00 at Room 1 (Zunhui Room) - Session (17) Chair(s): Meng Yan

Abstract

In recent years, Automated Program Repair (APR), which focuses on automatically fixing source code without human intervention, has become a hot topic in the field of software engineering, leading to the proposal of various automatic repair techniques. Additionally, Lapvikai et al. introduced a new software quality metric called “Autorepairability”. Autorepairability is a metric that indicates how easily bugs in the target source code can be fixed using APR techniques.

By utilizing Autorepairability, it becomes possible to pre-check whether the program repair techniques will work effectively on the target software and to perform refactoring to improve Autorepairability. Lapvikai et al. not only proposed Autorepairability but also measured it using traditional APR techniques based on genetic programming. However, in the past two to three years, program repair using large language models (LLMs) has become more prevalent, and several studies have revealed that these models exhibit superior repair capabilities compared to traditional APR techniques.

In this study, we applied Autorepairability for the performance comparison of multiple APR techniques. Specifically, we measured and compared Autorepairability using ChatGPT and Gemini, which are representative large language models, as well as kGenProg, a traditional APR technique. The results demonstrated that Gemini exhibited higher repair capabilities compared to both ChatGPT and the traditional APR technique kGenProg.

Chutweeraya Sriwilailak

Mahidol University

Thailand

Yoshiki Higo

Osaka University

Japan

Pongpop Lapvikai

Mahidol University

Thailand

Chaiyong Rakhitwetsagul

Mahidol University, Thailand

Thailand

Morakot Choetkiertikul

Mahidol University, Thailand

Thailand

Time Zone

The program is currently displayed in (GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi.

Use conference time zone: (GMT+08:00) Beijing, Chongqing, Hong Kong, UrumqiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 6 Dec
Displayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change

11:00 - 12:20	Session (17)SEIP - Software Engineering in Practice / ERA - Early Research Achievements / Technical Track at Room 1 (Zunhui Room) Chair(s): Meng Yan School of Big Data & Software Engineering, Chongqing University

11:00 20m Talk		Large Language Models Empowered Online Log Anomaly Detection in AIOps SEIP - Software Engineering in Practice suqiong zhang zstu, Dongyi Fan zstu, Lili He zstu
11:20 20m Talk		Leveraging Generative AI for Accelarating Enterprise Application Development: Insights from ChatGPT SEIP - Software Engineering in Practice Asha Rajbhoj TCS Research, Tanay Sant Tata Consultancy Services, Akanksha Somase Tata Consultancy Services, Vinay Kulkarni Tata Consultancy Services Research
11:40 20m Talk		Autorepairability of ChatGPT and Gemini: A Comparative Study ERA - Early Research Achievements Chutweeraya Sriwilailak Mahidol University, Yoshiki Higo Osaka University, Pongpop Lapvikai Mahidol University, Chaiyong Rakhitwetsagul Mahidol University, Thailand, Morakot Choetkiertikul Mahidol University, Thailand
12:00 20m Talk		Towards Log-based Execution Status Estimation Using Graph Neural Networks ERA - Early Research Achievements Shimon Sumita , Hiroyuki Nakagawa Okayama University, Shinobu Saito NTT, Tatsuhiro Tsuchiya