Translation of Low-Resource COBOL to Logically Correct and Readable Java leveraging High-Resource Java Refinement (LLM4Code 2024)

Who

Shubham Gandhi, Manasi Patwardhan, Jyotsana Khatri, Lovekesh Vig, Raveendra Kumar Medicherla

Track

LLM4Code 2024

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 20 Apr 2024 11:50 - 12:00 at Luis de Freitas Branco - Session 2: Full Papers Chair(s): Yiling Lou

Abstract

Automated translation of legacy code to modern programming languages is the need of the hour for modernizing enterprise systems. This work specifically addresses automated COBOL to Java translation. Traditional rule-based tools for this perform statement-wise translation, overlooking possible modularization and refactoring of the source COBOL code to translate to human-readable target Java code. Our investigation reveals that state-of-the-art Large Language Models (LLMs) in the domain of code encounter difficulties with regard to logical correctness and readability when directly translating low-resource COBOL code to Java. To address these challenges, we propose an LLM-based workflow, leveraging temperature sampling and refinement-based strategies, to not only ensure logical correctness of the translation but also maximize the readability of the target Java code. We exploit the fact that, due to their extensive exposure to human-written Java codes during pre-training, the LLMs are more equipped with profound comprehension and capability for refining translated Java codes than COBOL to Java translation. With a dataset sourced from CodeNet, we demonstrate that sequential refinement of the translated high-resource Java code with execution-guided logic feedback followed by LLM-based readability feedback, yields better performance in terms of logical correctness (81.99% execution accuracy) and readability (0.610 score), than LLM based translation with test cases and readability guidance (60.25% and 0.539) or refinement of the translation task itself (77.95% and 0.572).

Link to Preprint

https://llm4code.github.io/assets/pdf/papers/51.pdf

Shubham Gandhi

TCS Research

India

Manasi Patwardhan

TCS Research

Jyotsana Khatri

TCS Research

Lovekesh Vig

TCS Research, New Delhi, India

Raveendra Kumar Medicherla

TCS Research, Tata Consultancy Services

India

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sat 20 Apr
Displayed time zone: Lisbon change

11:00 - 12:30	Session 2: Full PapersLLM4Code at Luis de Freitas Branco Chair(s): Yiling Lou Fudan University

11:00 10m Talk		LLM-based and Retrieval-Augmented Control Code Generation LLM4Code Heiko Koziolek ABB Corporate Research, Sten Grüner ABB Corporate Research, Rhaban Amelung née Hark ABB Research, Virendra Ashiwal ABB Research, Sofia Linsbauer ABB Research, Nafise Eskandani ABB Corporate Research Center Pre-print
11:10 10m Talk		Learn to Code Sustainably: An Empirical Study on Green Code Generation LLM4Code Tina Vartziotis TWT Science and Innovation, National Technical University of Athens, Ippolyti Dellatolas Massachusetts Institute of Technology, George Dasoulas Harvard University, Maximilian Schmidt TWT Science and Innovation, Florian Schneider TWT Science and Innovation, Tim Hoffmann Mercedes-Benz, Sotirios Kotsopoulos National Technical University of Athens, Massachusetts Institute of Technology, Michael Keckeisen TWT Science and Innovation
11:20 10m Talk		Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions LLM4Code Federico Cassano Northeastern University, Tao Li Northeastern University, Akul Sethi Northeastern University, Noah Shinn Northeastern University, Abby Brennan-Jones Wellesley College, Anton Lozhkov Hugging Face, Carolyn Jane Anderson Wellesley College, Arjun Guha Northeastern University; Roblox Pre-print
11:30 10m Talk		HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations LLM4Code Thai Minh Nguyen Monash University, Nghi D. Q. Bui Fulbright University, Viet Nam
11:40 10m Talk		LLM-based Control Code Generation using Image Recognition LLM4Code Heiko Koziolek ABB Corporate Research, Anne Koziolek Karlsruhe Institute of Technology Pre-print
11:50 10m Talk		Translation of Low-Resource COBOL to Logically Correct and Readable Java leveraging High-Resource Java Refinement LLM4Code Shubham Gandhi TCS Research, Manasi Patwardhan TCS Research, Jyotsana Khatri TCS Research, Lovekesh Vig TCS Research, New Delhi, India, Raveendra Kumar Medicherla TCS Research, Tata Consultancy Services Pre-print
12:00 10m Talk		Unit Test Generation using Generative AI : A Comparative Performance Analysis of Autogeneration Tools LLM4Code Shreya Bhatia IIIT Delhi, Tarushi Gandhi IIIT Delhi, Dhruv Kumar Indraprastha Institute of Information Technology, Delhi, Pankaj Jalote IIIT Delhi Pre-print
12:10 10m Talk		StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of CodeBest Presentation Award LLM4Code Hannah McLean Babe Oberlin College, Sydney Nguyen Wellesley College, Yangtian Zi Northeastern University, Arjun Guha Northeastern University; Roblox, Molly Q Feldman Oberlin College, Carolyn Jane Anderson Wellesley College Pre-print
12:20 10m Talk		PromptSet: A Programmer’s Prompting Dataset LLM4Code Kaiser Pister Univeristy of Wisconsin-Madison, Dhruba Jyoti Paul Univeristy of Wisconsin-Madison, Ishan Joshi Univeristy of Wisconsin-Madison, Patrick Brophy Univeristy of Wisconsin-Madison Pre-print