DDPT: Diffusion Driven Prompt Tuning for Large Language Model Code Generation (CAIN 2025 - Research and Experience Papers)

Who

Jinyang Li, Sangwon Hyun, Muhammad Ali Babar

Track

CAIN 2025 Research and Experience Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 28 Apr 2025 16:00 - 16:15 at 208 - Generative Model Engineering Chair(s): Manel Abdellatif

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation. However, the quality of the generated code is heavily dependent on the structure and composition of the prompts used. Crafting high-quality prompts is a challenging task that requires significant knowledge and skills of prompt engineering. To advance the automation support for the prompt engineering for LLM-based code generation, we propose a novel solution Diffusion-Driven Prompt Tuning (DDPT) that learns how to generate optimal prompt embedding from Gaussian Noise to automate the prompt engineering for code generation. We evaluate the feasibility of diffusion-based optimization and abstract the optimal prompt embedding as a directional vector toward the optimal embedding. We use the code generation loss given by the LLMs to help the diffusion model to capture the distribution of optimal prompt embedding during training. The trained diffusion model can build a path from the noise distribution to the optimal distribution at the sampling phrase. The evaluation result enable us to assert that that DDPT helps improve the prompt optimization for code generation and diffusion-driven language modeling techniques.

Jinyang Li

The University of Adelaide

Australia

Sangwon Hyun

CREST, University of Adelaide

Muhammad Ali Babar

School of Computer Science, The University of Adelaide