Write a Blog >>
ICSE 2023
Sun 14 - Sat 20 May 2023 Melbourne, Australia
Wed 17 May 2023 17:00 - 17:15 at Meeting Room 101 - Software logging Chair(s): Hongyu Zhang

Logs, being run-time information automatically generated by software, record system events and activities with their timestamps. Before obtaining more insights into the run-time status of the software, a fundamental step of log analysis, called log parsing, is employed to extract structured templates and parameters from the semi-structured raw log messages. However, current log parsers are all \textit{syntax-based} and regard each message as a character string, ignoring the semantic information included in parameters and templates.

Thus, we propose the \textit{semantic-based} parser SemParser to unlock the critical bottleneck of mining semantics from log messages. It contains two steps, an end-to-end semantic miner and a joint parser. Specifically, the first step aims to identify explicit semantics inside a single log, and the second step is responsible for jointly inferring implicit semantics and computing structural outputs based on the contextual knowledge base. To analyze the effectiveness of our semantic parser, we first demonstrate that it can derive rich semantics from log messages collected from six widely-applied systems with an average F1 score of 0.985. Then, we conduct two representative downstream tasks, showing that current downstream models improve their performance with appropriately extracted semantics by 1.2%-11.7% and 8.65% on two anomaly detection datasets and a failure identification dataset, respectively. We believe these findings provide insights into semantically understanding log messages for the log analysis community.

Wed 17 May

Displayed time zone: Hobart change

15:45 - 17:15
Software loggingTechnical Track at Meeting Room 101
Chair(s): Hongyu Zhang The University of Newcastle
15:45
15m
Talk
PILAR: Studying and Mitigating the Influence of Configurations on Log Parsing
Technical Track
Hetong Dai Concordia University, Yiming Tang Concordia University, Heng Li Polytechnique Montréal, Weiyi Shang University of Waterloo
16:00
15m
Talk
Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction
Technical Track
Zhenhao Li Concordia University, Chuan Luo Beihang University, Tse-Hsun (Peter) Chen Concordia University, Weiyi Shang University of Waterloo, Shilin He Microsoft Research, Qingwei Lin Microsoft Research, Dongmei Zhang Microsoft Research
16:15
15m
Talk
On the Temporal Relations between Logging and Code
Technical Track
Zishuo Ding Concordia University, Yiming Tang Concordia University, Yang Li Beijing University of Posts and Telecommunications, Heng Li Polytechnique Montréal, Weiyi Shang University of Waterloo
Pre-print
16:30
15m
Talk
How Do Developers' Profiles and Experiences Influence their Logging Practices? An Empirical Study of Industrial Practitioners
Technical Track
Guoping Rong Nanjing University, shenghui gu Nanjing University, Haifeng Shen Australian Catholic University, He Zhang Nanjing University, Hongyu Kuang Nanjing University
16:45
15m
Talk
When to Say What: Learning to Find Condition-Message Inconsistencies
Technical Track
Islem BOUZENIA University of Stuttgart, Michael Pradel University of Stuttgart
Pre-print
17:00
15m
Talk
A Semantic-aware Parsing Approach for Log Analytics
Technical Track
Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Cheryl Lee The Chinese University of Hong Kong, Michael Lyu The Chinese University of Hong Kong
Pre-print