EASE 2024
Tue 18 - Fri 21 June 2024 Salerno, Italy
Thu 20 Jun 2024 16:40 - 16:55 at Room Capri - Data and Sustainability Chair(s): Gemma Catolino

Software Engineering (SE) researchers are extensively applying Large Language Models (LLMs) to address challenges in software engineering tasks such as code clone detection, code summarization, program comprehension among others. Despite promising results, LLMs have to be fine-tuned and customized with specific datasets for optimal performance. However, the proprietary nature of SE data, and the lack of LLMs trained on non-open source data is an open problem. While there exists work on applying Federated Learning (FL) for SE, integration of FL with LLMs for software engineering is unexplored. Hence, in this paper, we propose a FedLLM for the task of code summarization. We setup a federated learning architecture and fine-tune LLM (Llama2 with 6.7 billion parameters) using Parameter Efficient Fine-Tuning(PEFT) for code summarization. This is achieved with a 40GB RAM GPU in an A100 architecture. Results show that FL-trained LLM is as effective as a centrally-trained one. We envision that leveraging non-open source data using FedLLM for software engineering could be an interesting research direction.

Thu 20 Jun

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

16:00 - 16:40
Data and SustainabilityShort Papers, Vision and Emerging Results / Research Papers at Room Capri
Chair(s): Gemma Catolino University of Salerno
16:00
16m
Talk
Towards Comprehending Energy Consumption of Database Management Systems - A Tool and Empirical Study
Research Papers
Hemasri Sai Lella Indian Institute of Technology, Tirupati, Rajrupa Chattaraj Indian Institute of Technology Tirupati, India, Sridhar Chimalakonda Indian Institute of Technology, Tirupati, Manasa Kurra
DOI
16:15
15m
Talk
Data Quality Assessment in the Wild: Findings from GitHub
Research Papers
Ipek Ustunboyacioglu JADS/Tilburg University, Indika Kumara Tilburg University, Dario Di Nucci University of Salerno, Willem-Jan van den Heuvel JADS/Tilburg University, Damian Andrew Tamburri TU/e
16:30
10m
Talk
Sustainability in Blockchain Development: A BERT-Based Analysis of Ethereum Developer Discussions
Short Papers, Vision and Emerging Results
Matteo Vaccargiu University of Cagliari, Sabrina Aufiero University College London, Silvia Bartolucci University College London, Rumyana Neykova Brunel University London, Roberto Tonelli University of Cagliari, Giuseppe Destefanis Brunel University London
DOI Pre-print
16:40
15m
Talk
Code Summarization without Direct Access to Code - Towards Exploring Federated LLMs for Software Engineering
Research Papers
Jahnavi Kumar Indian Institute of Technology Tirupati, India, Sridhar Chimalakonda Indian Institute of Technology, Tirupati