ASE 2025
Sun 16 - Thu 20 November 2025 Seoul, South Korea

This program is tentative and subject to change.

Mon 17 Nov 2025 16:00 - 16:10 at Grand Hall 6 - Log & Dependency 2

Effective alert diagnosis is essential for ensuring the reliability of large-scale online service systems. However, on-call engineers are often burdened with manually inspecting massive volumes of logs to identify root causes. While various automated tools have been proposed, they struggle in practice due to alert-agnostic log scoping and the inability to organize complex data effectively for reasoning. To overcome these limitations, we introduce LogPilot, an intent-aware and scalable log-based framework powered by Large Language Models (LLMs) for automated alert diagnosis. LogPilot introduces an intent-aware approach, interpreting the logic in alert definitions (e.g., PromQL) to precisely identify causally related logs and requests. To achieve scalability, it reconstructs each request’s execution into a spatiotemporal log chain, clusters similar chains to identify recurring execution patterns, and provides representative samples to the LLMs for diagnosis. This clustering-based approach ensures the input is both rich in diagnostic detail and compact enough to fit within the LLM’s context window. Evaluated on real-world alerts from Volcano Engine Cloud, LogPilot improves the usefulness of root cause summarization by 50.34% and exact localization accuracy by 54.79% over state-of-the-art methods. With a diagnosis time under one minute and a cost of only $0.074 per alert, LogPilot has been successfully deployed in production, offering an automated and practical solution for service alert diagnosis.

This program is tentative and subject to change.

Mon 17 Nov

Displayed time zone: Seoul change

16:00 - 16:50
16:00
10m
Talk
LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems
Industry Showcase
Zhihan Jiang The Chinese University of Hong Kong, Jinyang Liu ByteDance, Yichen LI ByteDance, Haiyu Huang CUHK, Xiao He Bytedance, Tieying Zhang ByteDance, Jianjun Chen Bytedance, Yi Li Nanyang Technological University, Rui Shi Bytedance, Michael Lyu The Chinese University of Hong Kong
16:10
10m
Talk
Walk the Talk: Is Your Log-based Software Reliability Maintenance System Really Reliable?
NIER Track
Minghua He Peking University, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Chiming Duan Peking University, Pei Xiao Peking University, Lingzhe Zhang Peking University, China, Kangjin Wang Alibaba Group, Yifan Wu Peking University, Ying Li School of Software and Microelectronics, Peking University, Beijing, China, Gang Huang Peking University
16:20
10m
Talk
Automated Proactive Logging Quality Improvement for Large-Scale Codebases
Industry Showcase
Yichen LI ByteDance, Jinyang Liu ByteDance, Junsong Pu School of Software Engineering, Sun Yat-sen University, Zhihan Jiang The Chinese University of Hong Kong, Zhuangbin Chen Sun Yat-sen University, Xiao He Bytedance, Tieying Zhang ByteDance, Jianjun Chen Bytedance, Yi Li Nanyang Technological University, Rui Shi Bytedance, Michael Lyu The Chinese University of Hong Kong
16:30
10m
Talk
LogSage: An LLM-Based Framework for CI/CD Failure Detection and Remediation with Industrial Validation
Industry Showcase
Juntao Luo ByteDance, Weiyuan Xu East China Normal University, ByteDance, Tao Huang ByteDance, Kaixin Sui ByteDance, Jie Geng ByteDance, Qijun Ma ByteDance, Isami Akasaka ByteDance, Xiaoxue Shi ByteDance, Jing Tang ByteDance, Peng Cai East China Normal University)
16:40
10m
Talk
From Technical Excellence to Practical Adoption: Lessons Learned Building an ML-Enhanced Trace Analysis Tool
Industry Showcase
Kaveh Shahedi Polytechnique Montréal, Matthew Khouzam Ericsson AB, Heng Li Polytechnique Montréal, Maxime Lamothe Polytechnique Montreal, Foutse Khomh Polytechnique Montréal