Analyzing Prompt Influence on Automated Method Generation: An Empirical Study with Copilot (ICPC 2024 - Research Track)

Who

Ionut Daniel Fagadau, Leonardo Mariani, Daniela Micucci, Oliviero Riganelli

Track

ICPC 2024 Research Track

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 15 Apr 2024 11:20 - 11:30 at Sophia de Mello Breyner Andresen - AI-Assisted Program Comprehension Chair(s): Collin McMillan

Abstract

Generative AI is changing the way developers interact with software systems, providing services that are able to produce and deliver new content, crafted to satisfy the actual needs of developers. For instance, developers can ask for new code directly from within their IDEs by writing natural language prompts, and integrated services based on generative AI, such as Copilot, immediately respond to a prompt by providing a ready-to-use code snippet. Indeed, formulating the prompt appropriately, incorporating all the useful information, can be an important factor towards obtaining the right piece of code. The task of designing good prompts is known as prompt engineering.

In this paper, we systematically investigate the influence of seven prompt features, about the style and the content of the prompt, on the level of correctness of the resulting code. We specifically consider the task of using Copilot to obtain the body of 200 Java methods with 124,800 prompts obtained by systematically combining the seven considered prompt features. Results show how some elements, such as the presence of examples and the summary of the semantic of the method in the prompt. can significantly influence the quality of the result.

Link to Preprint

https://arxiv.org/pdf/2402.08430.pdf

Ionut Daniel Fagadau

University of Milano - Bicocca

Leonardo Mariani

University of Milano-Bicocca

Italy

Daniela Micucci

University of Milano-Bicocca, Italy

Italy

Oliviero Riganelli

University of Milano - Bicocca

Italy

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 15 Apr
Displayed time zone: Lisbon change

11:00 - 12:30	AI-Assisted Program ComprehensionResearch Track / / Replications and Negative Results (RENE) / Early Research Achievements (ERA) / Tool Demonstration at Sophia de Mello Breyner Andresen Chair(s): Collin McMillan University of Notre Dame

11:00 10m Talk		Towards Summarizing Code Snippets Using Pre-Trained TransformersICPCICPC Full paper Research Track Antonio Mastropaolo Università della Svizzera italiana, Matteo Ciniselli Università della Svizzera Italiana, Luca Pascarella ETH Zurich, Rosalia Tufano Università della Svizzera Italiana, Emad Aghajani Software Institute, USI Università della Svizzera italiana, Gabriele Bavota Software Institute @ Università della Svizzera Italiana Pre-print
11:10 10m Talk		Generating Java Methods: An Empirical Assessment of Four AI-Based Code AssistantsICPCICPC Full paper Research Track Vincenzo Corso University of Milano - Bicocca, Leonardo Mariani University of Milano-Bicocca, Daniela Micucci University of Milano-Bicocca, Italy, Oliviero Riganelli University of Milano - Bicocca Pre-print
11:20 10m Talk		Analyzing Prompt Influence on Automated Method Generation: An Empirical Study with CopilotICPCICPC Full paper Research Track Ionut Daniel Fagadau University of Milano - Bicocca, Leonardo Mariani University of Milano-Bicocca, Daniela Micucci University of Milano-Bicocca, Italy, Oliviero Riganelli University of Milano - Bicocca Pre-print
11:30 10m Talk		Interpretable Online Log Analysis Using Large Language Models with Prompt StrategiesICPCICPC Full paper Research Track Yilun Liu Huawei co. LTD, Shimin Tao University of Science and Technology of China; Huawei co. LTD, Weibin Meng Huawei co. LTD, Jingyu Wang , Wenbing Ma Huawei co. LTD, Yuhang Chen University of Science and Technology of China, Yanqing Zhao Huawei co. LTD, Hao Yang Huawei co. LTD, Yanfei Jiang Huawei co. LTD Pre-print
11:40 10m Talk		Do Machines and Humans Focus on Similar Code? Exploring Explainability of Large Language Models in Code SummarizationICPCICPC RENE Paper Replications and Negative Results (RENE) Jiliang Li Vanderbilt University, Yifan Zhang Vanderbilt University, Zachary Karas Vanderbilt University, Collin McMillan University of Notre Dame, Kevin Leach Vanderbilt University, Yu Huang Vanderbilt University Pre-print
11:50 10m Talk		Knowledge-Aware Code Generation with Large Language ModelsICPCICPC Full paper Research Track Tao Huang Shandong Normal University, Zhihong Sun Shandong Normal University, Zhi Jin Peking University, Ge Li Peking University, Chen Lyu Shandong Normal University Pre-print
12:00 8m Talk		Enhancing Source Code Representations for Deep Learning with Static AnalysisICPCICPC ERA Paper Early Research Achievements (ERA) Xueting Guan University of Melbourne, Christoph Treude Singapore Management University Pre-print
12:08 8m Talk		AthenaLLM: Supporting Experiments with Large Language Models in Software DevelopmentICPCICPC Tools Tool Demonstration Benedito Fernando Albuquerque de Oliveira Federal University of Pernambuco, Fernando Castor University of Twente and Federal University of Pernambuco
12:16 14m Talk		AI-Assisted Program Comprehension: Panel with SpeakersICPC Discussion