Improving Graph Learning-Based Fault Localization with Tailored Semi-Supervised Learning (FSE 2025 - Research Papers)

Mon 23 - Fri 27 June 2025 Trondheim, Norway

Who

Chun Li, Hui Li, Zhong Li, Minxue Pan, Xuandong Li

Track

FSE 2025 Research Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 24 Jun 2025 16:30 - 16:50 at Aurora B - Failure and Fault Chair(s): Lars Grunske

Abstract

Due to advancements in graph neural networks, graph learning-based fault localization (GBFL) methods have achieved promising results. However, as these methods are supervised learning paradigms and deep learning is typically data-hungry, they can only be trained on fully labeled large-scale datasets. This is impractical because labeling failed tests is similar to manual fault localization, which is time-consuming and labor-intensive, leading to only a small portion of failed tests that can be labeled within limited budgets. These data labeling limitations would lead to the sub-optimal effectiveness of supervised GBFL techniques. Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model’s performance and address data labeling limitations. However, as these methods are not specifically designed for fault localization, directly utilizing them might lead to sub-optimal effectiveness. In response, we propose a novel semi-supervised GBFL framework, Legato. Legato first leverages the attention mechanism to identify and augment likely fault-unrelated sub-graphs in unlabeled graphs and then quantifies the suspiciousness distribution of unlabeled graphs to estimate pseudo-labels. Through training the model on augmented unlabeled graphs and pseudo-labels, Legato can utilize the unlabeled data to improve the effectiveness of fault localization and address the restrictions in data labeling. Through extensive evaluations against 3 baselines SSL methods, Legato demonstrates superior performance by outperforming all the methods in comparison.

DOI

https://doi.org/10.1145/3715788

Chun Li

Nanjing University

China

Hui Li

Samsung Electronics (China) R&D Centre

Zhong Li

China

Minxue Pan

Nanjing University

China

Xuandong Li

Nanjing University

China

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 24 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

16:00 - 17:40	Failure and FaultDemonstrations / Research Papers / Ideas, Visions and Reflections / Journal First at Aurora B Chair(s): Lars Grunske Humboldt-Universität zu Berlin

16:00 10m Talk		AgentFM: Role-Aware Failure Management for Distributed Databases with LLM-Driven Multi-Agents Ideas, Visions and Reflections Lingzhe Zhang Peking University, China, Yunpeng Zhai Alibaba Group, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Xiaosong Huang Peking University, Chiming Duan Peking University, Ying Li School of Software and Microelectronics, Peking University, Beijing, China
16:10 20m Talk		ReproCopilot: LLM-Driven Failure Reproduction with Dynamic Refinement Research Papers Tanakorn Leesatapornwongsa Microsoft Research, Fazle Faisal Microsoft Research, Suman Nath Microsoft Research DOI
16:30 20m Talk		Improving Graph Learning-Based Fault Localization with Tailored Semi-Supervised Learning Research Papers Chun Li Nanjing University, Hui Li Samsung Electronics (China) R&D Centre, Zhong Li , Minxue Pan Nanjing University, Xuandong Li Nanjing University DOI
16:50 20m Talk		Towards Understanding Docker Build Faults in Practice: Symptoms, Root Causes, and Fix Patterns Research Papers Yiwen Wu National University of Defense Technology, Yang Zhang National University of Defense Technology, China, Tao Wang National University of Defense Technology, Bo Ding National University of Defense Technology, Huaimin Wang DOI
17:10 20m Talk		One Sentence Can Kill the Bug: Auto-replay Mobile App Crashes from One-sentence Overviews Journal First Yuchao Huang , Junjie Wang Institute of Software at Chinese Academy of Sciences, Zhe Liu Institute of Software, Chinese Academy of Sciences, Mingyang Li Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Song Wang York University, Chunyang Chen TU Munich, Yuanzhe Hu Institute of Software, Chinese Academy of Sciences, Qing Wang Institute of Software at Chinese Academy of Sciences
17:30 10m Talk		Steering the Future: A Catalog of Failures in Deep Learning-Enabled Robotic Navigation Systems Demonstrations Meriel von Stein University of Virginia, Yili Bai University of Virginia, Trey Woodlief University of Virginia, United States, Sebastian Elbaum University of Virginia

Information for Participants

Tue 24 Jun 2025 16:00 - 17:40 at Aurora B - Failure and Fault Chair(s): Lars Grunske

Info for room Aurora B:

Aurora B is the second room in the Aurora wing.

When facing the main Cosmos Hall, access to the Aurora wing is on the right, close to the side entrance of the hotel.