AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection (ASE 2023 - Research Papers)

Who

Yintong Huo, Yichen LI, Yuxin Su, Pinjia He, Zifan Xie, Michael Lyu

Track

ASE 2023 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 12 Sep 2023 16:06 - 16:18 at Room E - Testing Tools and Techniques Chair(s): Tim Menzies

Abstract

The rapid progress of modern computing systems has led to a growing interest in informative run-time logs. Various log-based anomaly detection techniques have been proposed to ensure software reliability. However, their implementation in the industry has been limited due to the lack of high-quality public log resources as training datasets. While some log datasets are available for anomaly detection, they suffer from limitations in (1) comprehensiveness of log events; (2) scalability over diverse systems; and (3) flexibility of log utility. To address these limitations, we propose AutoLog, the first automated log generation methodology for anomaly detection. AutoLog uses program analysis to generate run-time log sequences without actually running the system. AutoLog starts with probing comprehensive logging statements associated with the call graphs of an application. Then, it constructs execution graphs for each method after pruning the call graphs to find log-related execution paths in a scalable manner. Finally, AutoLog propagates the anomaly label to each acquired execution path based on human knowledge. It generates flexible log sequences by walking along the log execution paths with controllable parameters. Experiments on 50 popular Java projects show that AutoLog acquires significantly more (9x-58x) log events than existing log datasets from the same system, and generates log messages much faster (15x) with a single machine than existing passive data collection approaches. We hope AutoLog can facilitate the benchmarking and adoption of automated log analysis techniques.

Link to Preprint

https://arxiv.org/pdf/2308.09324.pdf

Yintong Huo

The Chinese University of Hong Kong

Yichen LI

The Chinese University of Hong Kong

Yuxin Su

Sun Yat-sen University

Pinjia He

Chinese University of Hong Kong, Shenzhen

China

Zifan Xie

Huazhong University of Science and Technology

China

Michael Lyu

The Chinese University of Hong Kong

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 12 Sep
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

15:30 - 17:00	Testing Tools and TechniquesNIER Track / Research Papers / Tool Demonstrations at Room E Chair(s): Tim Menzies North Carolina State University

15:30 12m Talk		Modeling Programmer Attention as Scanpath Prediction NIER Track Aakash Bansal University of Notre Dame, Chia-Yi Su University of Notre Dame, Zachary Karas Vanderbilt University, Yifan Zhang Vanderbilt University, Yu Huang Vanderbilt University, Toby Jia-Jun Li University of Notre Dame, Collin McMillan University of Notre Dame
15:42 12m Talk		On Automated Assistants for Software Development: The Role of LLMs NIER Track Mira Leung University of British Columbia, Gail Murphy University of British Columbia Pre-print File Attached
15:54 12m Talk		SmartBugs 2.0: An Execution Framework for Weakness Detection in Ethereum Smart Contracts Tool Demonstrations Monika di Angelo TU Wien, Thomas Durieux TU Delft, João F. Ferreira INESC-ID and IST, University of Lisbon, Gernot Salzer TU Wien Pre-print File Attached
16:06 12m Talk		AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection Research Papers Yintong Huo The Chinese University of Hong Kong, Yichen LI The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Pinjia He Chinese University of Hong Kong, Shenzhen, Zifan Xie Huazhong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong Pre-print
16:18 12m Talk		Aster: Automatic Speech Recognition System Accessibility Testing for Stutterers Research Papers Yi Liu Nanyang Technological University, Yuekang Li University of New South Wales, Gelei Deng Nanyang Technological University, Felix Juefei-Xu Meta AI, Yao Du University of California, Irvine, Cen Zhang Nanyang Technological University, Chengwei Liu Nanyang Technological University, Yeting Li Institute of Information Engineering at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Lei Ma University of Alberta, Yang Liu Nanyang Technological University, Yuekang Li University of New South Wales
16:30 12m Talk		Software Entity Recognition with Noise-Robust LearningRecorded talk Research Papers Tai Nguyen University of Pennsylvania, Yifeng Di Purdue University, Joohan Lee University of Southern California, Muhao Chen University of Southern California, Tianyi Zhang Purdue University Pre-print Media Attached File Attached