Goal-oriented Knowledge Reuse via Curriculum Evolution for Reinforcement Learning-based Adaptation
Reinforcement learning is a powerful methodology that enables self-adaptive systems to relearn and update their adaptation policy when dealing with unforeseen changes. To update the policy more efficiently, several knowledge reuse approaches have been proposed to speed up relearning. However, the current studies treat and reuse the knowledge integrally, which may result in increased relearning costs if the reused knowledge is inappropriate in the changed situation. Generally, some localized pieces of the knowledge are still appropriate for reuse if they are not related to the changes, while some pieces may become inappropriate for reuse if they are affected by the changes. This paper proposes a goal-oriented curriculum evolution method to realize finer-grained knowledge reuse, combining goal-oriented modeling and curriculum learning. The method is twofold: (1) at design time, we apply goal-oriented modeling to design a curriculum in which an RL problem is decomposed into sub-problems, so that knowledge can be decomposed into several pieces of localized knowledge for sub-problems, and (2) at runtime, we evolve the curriculum to reflect changes (i.e., update the sub-problems related to the changes), so that the affected pieces of knowledge can be locally updated to make them appropriate for reuse in the changed situation. The evaluation based on a cleaning robot shows that the relearning time was shortened, demonstrating the effectiveness of our method.
Thu 8 DecDisplayed time zone: Osaka, Sapporo, Tokyo change
15:00 - 16:30 | Machine Learning 2Technical Track at Room3 Chair(s): Morakot Choetkiertikul Mahidol University, Thailand | ||
15:00 20mPaper | Retrieve-Guided Commit Message Generation with Semantic Similarity And Disparity Technical Track Zhihan Li School of Computer Science and Engineering, Central South University, Yi Cheng School of Computer Science and Engineering, Central South University, Haiyang Yang School of Computer Science and Engineering, Central South University, Li Kuang School of Computer Science and Engineering, Central South University, Lingyan Zhang School of Computer Science and Engineering, Central South University | ||
15:20 20mPaper | Systematic Analysis of Defect Specific Code Abstraction for Neural Program Repair Technical Track Kicheol Kim Sungkyunkwan University, Misoo Kim Sungkyunkwan University, Eunseok Lee Sungkyunkwan University | ||
15:40 20mPaper | NEGAR: Network Embedding Guided Architecture Recovery for Software Systems Technical Track Jiayi Chen State Key Lab for Novel Software Technology, Nanjing University, Zhixing Wang State Key Lab for Novel Software Technology, Nanjing University, yuchen jiang , Tian Zhang Nanjing University, Jun Pang University of Luxembourg, Minxue Pan Nanjing University, Nitsan Amit Hebrew University | ||
16:00 20mPaper | Goal-oriented Knowledge Reuse via Curriculum Evolution for Reinforcement Learning-based Adaptation Technical Track Jialong Li Waseda University, Japan, Mingyue Zhang Peking University, China, Zhenyu Mao Waseda University, Haiyan Zhao Peking University, Zhi Jin Peking University, Shinichi Honiden Waseda University / National Institute of Informatics, Japan, Kenji Tei Waseda University |