LogSage: An LLM-Based Framework for CI/CD Failure Detection and Remediation with Industrial Validation
This program is tentative and subject to change.
Continuous Integration and Deployment (CI/CD) pipelines are critical to modern software engineering, yet diagnosing and resolving their failures remains complex and labor-intensive. We present LogSage, the first end-to-end LLM-powered framework for root cause analysis (RCA) and automated remediation of CI/CD failures. LogSage employs a token-efficient log preprocessing pipeline to filter noise and extract critical errors, then performs structured diagnostic prompting for accurate RCA. For solution generation, it leverages retrieval-augmented generation (RAG) to reuse historical fixes and invokes automation fixes via LLM tool-calling.
On a newly curated benchmark of 367 GitHub CI/CD failures, LogSage achieves over 98% precision, near-perfect recall, and an F1 improvement of more than 38% points in the RCA stage, compared with recent LLM-based baselines. In a year-long industrial deployment at ByteDance, it processed over 1.07M executions, with end-to-end precision exceeding 80%. These results demonstrate that LogSage provides a scalable and practical solution for automating CI/CD failure management in real-world DevOps workflows.
This program is tentative and subject to change.
Mon 17 NovDisplayed time zone: Seoul change
16:00 - 16:50 | |||
16:00 10mTalk | LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems Industry Showcase Zhihan Jiang The Chinese University of Hong Kong, Jinyang Liu ByteDance, Yichen LI ByteDance, Haiyu Huang CUHK, Xiao He Bytedance, Tieying Zhang ByteDance, Jianjun Chen Bytedance, Yi Li Nanyang Technological University, Rui Shi Bytedance, Michael Lyu The Chinese University of Hong Kong | ||
16:10 10mTalk | Walk the Talk: Is Your Log-based Software Reliability Maintenance System Really Reliable? NIER Track Minghua He Peking University, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Chiming Duan Peking University, Pei Xiao Peking University, Lingzhe Zhang Peking University, China, Kangjin Wang Alibaba Group, Yifan Wu Peking University, Ying Li School of Software and Microelectronics, Peking University, Beijing, China, Gang Huang Peking University | ||
16:20 10mTalk | Automated Proactive Logging Quality Improvement for Large-Scale Codebases Industry Showcase Yichen LI ByteDance, Jinyang Liu ByteDance, Junsong Pu School of Software Engineering, Sun Yat-sen University, Zhihan Jiang The Chinese University of Hong Kong, Zhuangbin Chen Sun Yat-sen University, Xiao He Bytedance, Tieying Zhang ByteDance, Jianjun Chen Bytedance, Yi Li Nanyang Technological University, Rui Shi Bytedance, Michael Lyu The Chinese University of Hong Kong | ||
16:30 10mTalk | LogSage: An LLM-Based Framework for CI/CD Failure Detection and Remediation with Industrial Validation Industry Showcase Juntao Luo ByteDance, Weiyuan Xu East China Normal University, ByteDance, Tao Huang ByteDance, Kaixin Sui ByteDance, Jie Geng ByteDance, Qijun Ma ByteDance, Isami Akasaka ByteDance, Xiaoxue Shi ByteDance, Jing Tang ByteDance, Peng Cai East China Normal University) | ||
16:40 10mTalk | From Technical Excellence to Practical Adoption: Lessons Learned Building an ML-Enhanced Trace Analysis Tool Industry Showcase Kaveh Shahedi Polytechnique Montréal, Matthew Khouzam Ericsson AB, Heng Li Polytechnique Montréal, Maxime Lamothe Polytechnique Montreal, Foutse Khomh Polytechnique Montréal | ||