Enhancing Software Runtime with Reinforcement Learning-Driven Mutation Operator Selection in Genetic Improvement (GI 2025)

Who

Damien Bose, Carol Hanna, Justyna Petke

Track

GI 2025 Genetic Improvement

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 27 Apr 2025 11:30 - 12:00 at 202 - Morning Session 2 Chair(s): William Langdon, Vesna Nowack

Abstract

Genetic Improvement employs heuristic search algorithms to explore the search space of program variants by modifying code using mutation operators. This research focuses on operators that delete, insert and replace source code statements. Traditionally, in GI, an operator is chosen uniformly at random at each search iteration. This work leverages Reinforcement Learning to intelligently guide the selection of these operators specifically to improve program runtime. We propose to integrate RL into the operator selection process. Four Multi-Armed bandit RL algorithms (Epsilon Greedy, UCB, Probability Matching, and Policy Gradient) were integrated within a GI framework, and their efficacy and efficiency were benchmarked against the traditional GI operator selection approach. These RL-guided operator selection strategies have demonstrated empirical superiority over the traditional GI methods of randomly selecting a search operator, with UCB emerging as the top-performing RL algorithm. On average, the UCB-guided Hill Climbing search algorithm produced variants that compiled and passed all tests 44% of the time, while only 22% of the variants produced by the traditional uniform random selection strategies compiled and passed all tests. Index Terms—Reinforcement learning, Genetic Improvement.

Damien Bose

University College London

United Kingdom

Carol Hanna

University College London

United Kingdom

Justyna Petke

University College London

United Kingdom

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 27 Apr
Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30	Morning Session 2GI at 202 Chair(s): William Langdon University College London, Vesna Nowack Imperial College London

11:00 30m Talk		Large Language Model based Code Completion is an Effective Genetic Improvement Mutation GI Jingyuan Wang University College London, Carol Hanna University College London, Justyna Petke University College London
11:30 30m Talk		Enhancing Software Runtime with Reinforcement Learning-Driven Mutation Operator Selection in Genetic Improvement GI Damien Bose University College London, Carol Hanna University College London, Justyna Petke University College London
12:00 30m Talk		Empirical Comparison of Runtime Improvement Approaches: Genetic Improvement, Parameter Tuning, and Their Combination GI Thanatad Songpetchmongkol University College London, Aymeric Blot University of Rennes, IRISA / INRIA, Justyna Petke University College London