Developing a Taxonomy for Advanced Log Parsing Techniques
Logs play a crucial role in software engineering, supporting tasks such as debugging, system comprehension, failure prediction, and anomaly detection. However, the inherently unstructured nature of logs presents significant challenges for extracting actionable insights. Despite the development of numerous log parsing techniques—including machine learning, pattern recognition, and heuristic approaches—consistent accuracy remains elusive, particularly when dealing with complex, diverse log formats. In this study, we address these challenges by conducting an in-depth analysis of the characteristics that lead to parsing errors. Using 16 log datasets and 8 distinct log parsers, we apply open coding to develop a comprehensive taxonomy of Log Event Characteristics (LECs) that frequently cause parsing inaccuracies. We evaluate how these characteristics impact different parsers and examine how the distribution of LECs contributes to the complexity of a dataset. The resulting taxonomy not only provides a foundation for developing more effective log parsing tools but also offers valuable insights for creating machine-friendly and human-readable logs, ultimately improving system diagnostics and reliability.
Mon 28 AprDisplayed time zone: Eastern Time (US & Canada) change
16:00 - 17:30 | Log Parsing, Bug Localisation, Review ComprehensionResearch Track / Early Research Achievements (ERA) at 205 Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Coen De Roover Vrije Universiteit Brussel, Gema Rodríguez-Pérez Department of Computer Science, Mathematics, Physics and Statistics, University of British Columbia, Okanagan Campus | ||
16:00 10mTalk | Developing a Taxonomy for Advanced Log Parsing Techniques Research Track Issam Sedki Concordia University, Wahab Hamou-Lhadj Concordia University, Montreal, Canada, Otmane Ait-Mohamed Concordia University, Naser Ezzati Jivan | ||
16:10 10mTalk | GELog:A GPT-Enhanced Log Representation Method for Anomaly Detection Research Track Wenwu Xu Institute of Information Engineering, Chinese Academy of Sciences and School of Cyberspace Security, University of Chinese Academy of Sciences, Peng Wang Institute of Information Engineering,Chinese Academy of Sciences, Haichao Shi Institute of Information Engineering,Chinese Academy of Sciences, Guoqiao Zhou Institute of Information Engineering,Chinese Academy of Sciences, Junliang Yao Institute of Information Engineering,Chinese Academy of Sciences, Xiao-Yu Zhang Institute of Information Engineering, Chinese Academy of Science | ||
16:20 10mTalk | Log Parsing using LLMs with Self-Generated In-Context Learning and Self-Correction Research Track Yifan Wu Peking University, Siyu Yu The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), Ying Li School of Software and Microelectronics, Peking University, Beijing, China Pre-print | ||
16:30 10mTalk | LLM-BL: Large Language Models are Zero-Shot Rankers for Bug Localization Research Track Zhengliang Li Nanjing University, Zhiwei Jiang Nanjing University, Qiguo Huang NanJing Audit University, Qing Gu Nanjing University | ||
16:40 10mTalk | Improved IR-based Bug Localization with Intelligent Relevance Feedback Research Track Pre-print | ||
16:50 6mTalk | Towards Enhancing IR-based Bug Localization Leveraging Texts and Multimedia from Bug Reports Early Research Achievements (ERA) Shamima Yeasmin University of Saskatchewan, Chanchal K. Roy University of Saskatchewan, Kevin Schneider University of Saskatchewan, Masud Rahman Dalhousie University, Kartik Mittal University of Saskatchewan, Ryder Hardy University of Saskatchewan Pre-print | ||
16:56 10mTalk | Building Bridges, Not Walls: Fairness-aware and Accurate Recommendation of Code Reviewers via LLM-based Agents Collaboration Research Track Luqiao Wang Xidian University, Qingshan Li Xidian University, Di Cui Xidian University, Mingkang Wang Xidian University, Yutong Zhao University of Central Missouri, Yongye Xu Xidian University, Huiying Zhuang Xidian University, Yangtao Zhou Xidian University, Lu Wang Xidian University | ||
17:06 10mTalk | Code Review Comprehension: Reviewing Strategies Seen Through Code Comprehension Theories Research Track Pavlina Wurzel Goncalves University of Zurich, Pooja Rani University of Zurich, Margaret-Anne Storey University of Victoria, Diomidis Spinellis Athens University of Economics and Business & Delft University of Technology, Alberto Bacchelli University of Zurich Pre-print | ||
17:16 10mTalk | KotSuite: Unit Test Generation for Kotlin Programs in Android Applications Research Track Feng Yang Wuhan University, Qi Xin Wuhan University, Zhilei Ren Dalian University of Technology, Jifeng Xuan Wuhan University | ||
17:26 4mLive Q&A | Session's Discussion: "Log Parsing, Bug Localisation, Review Comprehension" Research Track |