Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models (CloudIntelligence 2021)

Who

Jasmin Bogatinovski, Harald Ott, Alexander Acker, Sasho Nedelkoski , Odej Kao

Track

CloudIntelligence 2021

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 29 May 2021 14:15 - 14:30 at CloudIntelligence Room - Technical Paper Session #2 Chair(s): Qingwei Lin

Abstract

Anomalies or failures in large computer systems, such as the cloud, have an impact on a large number of users that communicate, compute, and store information. Therefore, timely and accurate anomaly detection is necessary for reliability, security, safe operation, and mitigation of losses in these increasingly important systems. Recently, the evolution of industry opens up several problems that need to be tackled including (1) addressing the software evolution due software upgrades, and (2) solving the cold-start problem, where data from the system of interest is not available. In this paper, we propose a framework for anomaly detection in log data, as a major troubleshooting source of system information. To that end, we utilize pre-trained general-purpose language models to preserve the semantics of log messages and map them into log vector embeddings. The key idea is that these representations for the logs are robust and less invariant to changes in the logs, and therefore, result in a better generalization of the anomaly detection models. We perform several experiments on a cloud dataset evaluating different language models for obtaining numerical log representations such as BERT, GPT-2, and XL. The robustness is evaluated by gradually altering log messages, to simulate a change in semantics. Our results show that the proposed approach achieves high performance and robustness, which opens up possibilities for future research in this direction.

Jasmin Bogatinovski

Harald Ott

TU Berlin

Alexander Acker

Sasho Nedelkoski

TU Berlin

Odej Kao

Technische Universität Berlin

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sat 29 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

14:15 - 15:00	Technical Paper Session #2CloudIntelligence 2021 at CloudIntelligence Room Chair(s): Qingwei Lin Microsoft Research, Beijing, China

14:15 15m Paper		Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models CloudIntelligence 2021 Jasmin Bogatinovski , Harald Ott TU Berlin, Alexander Acker , Sasho Nedelkoski TU Berlin, Odej Kao Technische Universität Berlin
14:30 15m Paper		Rapid Trend Prediction for Large-Scale Cloud Database KPIs by Clustering CloudIntelligence 2021 Xiaoling Wang Northwestern Polytechnical University, Ning Li School of Computer Science, Northwestern Polytechnical University, Lijun Zhang Northwestern Polytechnical University, Xiaofang Zhang Northwestern Polytechnical University, Qiong Zhao Bank of Communications
14:45 15m Paper		Learning Dependencies in Distributed Cloud Applications to Identify and Localize Anomalies CloudIntelligence 2021 Dominik Scheinert Technische Universität Berlin, Alexander Acker , Lauritz Thamsen TU Berlin, Morgan Geldenhuys Technische Universität Berlin, Odej Kao Technische Universität Berlin

Information for Participants

Sat 29 May 2021 14:15 - 15:00 at CloudIntelligence Room - Technical Paper Session #2 Chair(s): Qingwei Lin

Info for room CloudIntelligence Room:

Go directly to this room on Clowdr