An Exploratory Study on How Non-Determinism in Large Language Models Affects Log Parsing (InteNSE 2024)

Who

Merve Astekin, Max Hort, Leon Moonen

Track

InteNSE 2024 InteNSE Workshop

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 15 Apr 2024 12:10 - 12:30 at Daciano da Costa - Late Morning Session Chair(s): Reyhaneh Jabbarvand, Saeid Tizpaz-Niari

Abstract

Most software systems used in production generate system logs that provide a rich source of information about the status and execution behavior of the system. These logs are commonly used to ensure the reliability and maintainability of software systems. The first step toward automated log analysis is generally log parsing, which aims to transform unstructured log messages into structured log templates and extract the corresponding parameters. Recently, Large Language Models (LLMs) such as ChatGPT have shown promising results on a wide range of software engineering tasks, including log parsing. However, the extent to which non-determinism influences log parsing using LLMs remains unclear. In particular, it is important to investigate whether LLMs behave consistently when faced with the same log message multiple times. In this study, we investigate the impact of non-determinism in state-of-the-art LLMs while performing log parsing. Specifically, we select six LLMs, including both paid proprietary and free-to-use models, and evaluate their non-determinism on 16 system logs obtained from a selection of mature open-source projects. The results of our study reveal varying degrees of non-determinism among models. Moreover, they show that there is no guarantee for deterministic results even with a temperature of zero.

Merve Astekin

Simula Research Laboratory

Norway

Max Hort

Simula Research Laboratory

Norway

Leon Moonen

Simula Research Laboratory and BI Norwegian Business School

Norway

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 15 Apr
Displayed time zone: Lisbon change

11:00 - 12:30	Late Morning SessionInteNSE at Daciano da Costa Chair(s): Reyhaneh Jabbarvand University of Illinois at Urbana-Champaign, Saeid Tizpaz-Niari University of Texas at El Paso

11:00 70m Keynote		Assured LLM-Based Software Engineering InteNSE Mark Harman Meta Platforms, Inc. and UCL
12:10 20m Paper		An Exploratory Study on How Non-Determinism in Large Language Models Affects Log Parsing InteNSE Merve Astekin Simula Research Laboratory, Max Hort Simula Research Laboratory, Leon Moonen Simula Research Laboratory and BI Norwegian Business School