Enhancing Software Runtime with Reinforcement Learning-Driven Mutation Operator Selection in Genetic Improvement
Genetic Improvement employs heuristic search algorithms to explore the search space of program variants by modifying code using mutation operators. This research focuses on operators that delete, insert and replace source code statements. Traditionally, in GI, an operator is chosen uniformly at random at each search iteration. This work leverages Reinforcement Learning to intelligently guide the selection of these operators specifically to improve program runtime. We propose to integrate RL into the operator selection process. Four Multi-Armed bandit RL algorithms (Epsilon Greedy, UCB, Probability Matching, and Policy Gradient) were integrated within a GI framework, and their efficacy and efficiency were benchmarked against the traditional GI operator selection approach. These RL-guided operator selection strategies have demonstrated empirical superiority over the traditional GI methods of randomly selecting a search operator, with UCB emerging as the top-performing RL algorithm. On average, the UCB-guided Hill Climbing search algorithm produced variants that compiled and passed all tests 44% of the time, while only 22% of the variants produced by the traditional uniform random selection strategies compiled and passed all tests. Index Terms—Reinforcement learning, Genetic Improvement.
Sun 27 AprDisplayed time zone: Eastern Time (US & Canada) change
11:00 - 12:30 | Morning Session 2GI at 202 Chair(s): William Langdon University College London, Vesna Nowack Imperial College London | ||
11:00 30mTalk | Large Language Model based Code Completion is an Effective Genetic Improvement Mutation GI Jingyuan Wang University College London, Carol Hanna University College London, Justyna Petke University College London | ||
11:30 30mTalk | Enhancing Software Runtime with Reinforcement Learning-Driven Mutation Operator Selection in Genetic Improvement GI Damien Bose University College London, Carol Hanna University College London, Justyna Petke University College London | ||
12:00 30mTalk | Empirical Comparison of Runtime Improvement Approaches: Genetic Improvement, Parameter Tuning, and Their Combination GI Thanatad Songpetchmongkol University College London, Aymeric Blot University of Rennes, IRISA / INRIA, Justyna Petke University College London |