Code Summarization without Direct Access to Code - Towards Exploring Federated LLMs for Software Engineering (EASE 2024 - Research Papers)

Who

Jahnavi Kumar, Sridhar Chimalakonda

Track

EASE 2024 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 20 Jun 2024 16:40 - 16:55 at Room Capri - Data and Sustainability Chair(s): Gemma Catolino

Abstract

Software Engineering (SE) researchers are extensively applying Large Language Models (LLMs) to address challenges in software engineering tasks such as code clone detection, code summarization, program comprehension among others. Despite promising results, LLMs have to be fine-tuned and customized with specific datasets for optimal performance. However, the proprietary nature of SE data, and the lack of LLMs trained on non-open source data is an open problem. While there exists work on applying Federated Learning (FL) for SE, integration of FL with LLMs for software engineering is unexplored. Hence, in this paper, we propose a FedLLM for the task of code summarization. We setup a federated learning architecture and fine-tune LLM (Llama2 with 6.7 billion parameters) using Parameter Efficient Fine-Tuning(PEFT) for code summarization. This is achieved with a 40GB RAM GPU in an A100 architecture. Results show that FL-trained LLM is as effective as a centrally-trained one. We envision that leveraging non-open source data using FedLLM for software engineering could be an interesting research direction.

Jahnavi Kumar

Indian Institute of Technology Tirupati, India

India

Sridhar Chimalakonda

Indian Institute of Technology, Tirupati