R-Log: Incentivizing Log Analysis Capability in LLMs via Reasoning-based Reinforcement Learning
The growing complexity of log data in modern software systems has prompted the use of Large Language Models (LLMs) for automated log analysis. Current approaches typically rely on direct supervised fine-tuning (SFT) on log-label pairs. However, this exacerbates the domain discrepancy between general-purpose LLMs and specialized log data, causing overfitting. Furthermore, SFT’s imbalanced loss computation often allows lengthy contexts to overwhelm critical, concise details in model answers, leading to hallucinations. To address these limitations, we propose R-Log, a novel reasoning-based paradigm that mirrors the structured, step-by-step analytical process of human engineers. This approach enhances generalizability by learning the underlying rules behind conclusions. We further employ Reinforcement Learning (RL) to optimize the model within a simulated O&M environment, thereby reducing hallucinations by directly rewarding correct outcomes. R-Log is first cold-started on a curated dataset of 2k+ reasoning trajectories, guided by 13 strategies from manual O&M practices, to establish an initial reasoning capability. This ability is then refined via RL using a joint reward function. Empirical evaluations on real-world logs show that R-Log outperforms existing methods across five log analysis tasks, particularly in unseen scenarios (by 228.05%). We also designed R-Log-fast with 5x speedup while keeping 93% of the efficacy. R-Log is deployed in Huawei with interpretability as the core feature.
Wed 15 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
11:00 - 12:30 | AI for Software Engineering 3SE In Practice (SEIP) at Europa II Chair(s): Eric Bodden Heinz Nixdorf Institute at Paderborn University & Fraunhofer IEM | ||
11:00 15mTalk | Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices SE In Practice (SEIP) Lingzhe Zhang Peking University, China, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Yunpeng Zhai Alibaba Group, Leyi Pan Tsinghua University, Chiming Duan Peking University, Minghua He Peking University, Mengxi Jia Institute of Artificial Intelligence, China Telecom, Ying Li School of Software and Microelectronics, Peking University, Beijing, China | ||
11:15 15mTalk | R-Log: Incentivizing Log Analysis Capability in LLMs via Reasoning-based Reinforcement Learning SE In Practice (SEIP) Yilun Liu Huawei co. LTD, Chen Ziang Huawei co. LTD; Nankai University, Song Xu Huawei co. LTD, Minggui He Huawei co. LTD, Shimin Tao University of Science and Technology of China; Huawei co. LTD, Weibin Meng Huawei co. LTD, Yuming Xie Huawei co. LTD, Tao Han Huawei co. LTD, Chunguang Zhao Huawei co. LTD, Jingzhou Du Huawei co. LTD, Daimeng Wei Huawei co. LTD, Shenglin Zhang Nankai University, Yongqian Sun Nankai University Media Attached | ||
11:30 15mTalk | LLM-Based Automated Diagnosis Of Integration Test Failures At Google SE In Practice (SEIP) Pre-print | ||
11:45 15mTalk | Automated Bug Frame Retrieval from Gameplay Videos Using Vision-Language Models SE In Practice (SEIP) Wentao Lu University of Alberta, Alexander Senchenko Electronic Arts, Abram Hindle University of Alberta, Cor-Paul Bezemer University of Alberta | ||
12:00 15mTalk | Finding the Needle in the Crash Stack: Industrial-Scale Crash Root Cause Localization with AutoCrashFL SE In Practice (SEIP) Sungmin Kang NUS, Sumi Yun SAP Labs Korea, Jingun Hong SAP Labs Korea, Shin Yoo KAIST, Gabin An Korea University Pre-print | ||
12:15 15mTalk | PerFrame: Monitoring GUI Loading Performance in Mobile Apps via Semantic Distinguish SE In Practice (SEIP) Jianing Liu Fudan University, Shiyu Guo , Yongxiang Hu Fudan University, Yu Zhang Meituan, Hailiang Jin Meituan Inc., Juxing Yuan Meituan Inc., Yangfan Zhou Fudan University, Xin Wang Fudan University Media Attached | ||