TCSE logo 
 Sigsoft logo
Sustainability badge
Fri 2 May 2025 11:30 - 11:45 at 204 - Program Comprehension 3 Chair(s): Arie van Deursen

To keep up with changes in requirements, frameworks, and coding practices, software organizations might need to migrate code from one language to another. Source-to-source migration, or transpilation, is often a complex, manual process. Transpilation requires expertise both in the source and target language, making it highly laborious and costly. Languages models for code generation and transpilation are becoming increasingly popular. However, despite capturing code-structure well, code generated by language models is often spurious and contains subtle problems. We propose BatFix, a novel approach that augments language models for transpilation by leveraging program repair and synthesis to fix the code generated by these models. BatFix takes as input both the original program, the target program generated by the machine translation model, and a set of test cases and outputs a repaired program that passes all test cases. Experimental results show that our approach is agnostic to language models and programming languages. BatFix can locate bugs spawning multiple lines and synthesize patches for syntax and semantic bugs for programs migrated from Java to C++ and Python to C++ from multiple language models, including, OpenAI’s Codex.

Fri 2 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Program Comprehension 3Research Track / Journal-first Papers at 204
Chair(s): Arie van Deursen TU Delft
11:00
15m
Talk
Automated Test Generation For Smart Contracts via On-Chain Test Case Augmentation and MigrationBlockchain
Research Track
Jiashuo Zhang Peking University, China, Jiachi Chen Sun Yat-sen University, John Grundy Monash University, Jianbo Gao Peking University, Yanlin Wang Sun Yat-sen University, Ting Chen University of Electronic Science and Technology of China, Zhi Guan Peking University, Zhong Chen
Pre-print
11:15
15m
Talk
Boosting Code-line-level Defect Prediction with Spectrum Information and Causality Analysis
Research Track
Shiyu Sun , Yanhui Li Nanjing University, Lin Chen Nanjing University, Yuming Zhou Nanjing University, Jianhua Zhao Nanjing University, China
11:30
15m
Talk
BatFix: Repairing language model-based transpilation
Journal-first Papers
Daniel Ramos Carnegie Mellon University, Ines Lynce INESC-ID/IST, Universidade de Lisboa, Vasco Manquinho INESC-ID; Universidade de Lisboa, Ruben Martins Carnegie Mellon University, Claire Le Goues Carnegie Mellon University
11:45
15m
Talk
Tracking the Evolution of Static Code Warnings: The State-of-the-Art and a Better Approach
Journal-first Papers
Junjie Li , Jinqiu Yang Concordia University
12:00
15m
Talk
PACE: A Program Analysis Framework for Continuous Performance Prediction
Journal-first Papers
Chidera Biringa University of Massachusetts, Gokhan Kul University of Massachusetts Dartmouth
12:15
15m
Talk
Mimicking Production Behavior With Generated Mocks
Journal-first Papers
Deepika Tiwari KTH Royal Institute of Technology, Martin Monperrus KTH Royal Institute of Technology, Benoit Baudry Université de Montréal
:
:
:
: