D3: Differential Testing of Distributed Deep Learning with Model Generation (ICSE 2025 - Journal-first Papers)

Who

Jiannan Wang, Hung Viet Pham, Qi Li, Lin Tan, Yu Guo, Adnan Aziz, Erik Meijer

Track

ICSE 2025 Journal-first Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 2 May 2025 14:30 - 14:45 at 215 - SE for AI with Quality 2 Chair(s): Romina Spalazzese

Abstract

Deep Learning (DL) techniques have been widely deployed in many application domains. The growth of DL models’ size and complexity demands distributed training of DL models. Since DL training is complex, software implementing distributed DL training is error-prone. Thus, it is crucial to test distributed deep learning software to improve its reliability and quality. To address this issue, we propose a differential testing technique�D3, which leverages a distributed equivalence rule that we create to test distributed deep learning software. The rationale is that the same model trained with the same model input under different distributed settings should produce equivalent prediction output within certain thresholds. The different output indicates potential bugs in the distributed deep learning software. D3 automatically generates a diverse set of distributed settings, DL models, and model input to test distributed deep learning software. Our evaluation on two of the most popular DL libraries, i.e., PyTorch and TensorFlow, shows that D3 detects 21 bugs, including 12 previously unknown bugs.

Jiannan Wang

Purdue University

Hung Viet Pham

York University

Canada

Qi Li

Lin Tan

Purdue University

United States

Yu Guo

Meta Inc.

United States

Adnan Aziz

Meta Inc.

United States

Erik Meijer

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 2 May
Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30	SE for AI with Quality 2Journal-first Papers at 215 Chair(s): Romina Spalazzese Malmö University

14:00 15m Talk		Beyond Accuracy: An Empirical Study on Unit Testing in Open-source Deep Learning ProjectsSE for AI Journal-first Papers Han Wang Monash University, Sijia Yu Jilin University, Chunyang Chen TU Munich, Burak Turhan University of Oulu, Xiaodong Zhu Jilin University Link to publication DOI Pre-print
14:15 15m Talk		Boundary State Generation for Testing and Improvement of Autonomous Driving SystemsSE for AI Journal-first Papers Matteo Biagiola Università della Svizzera italiana, Paolo Tonella USI Lugano DOI Pre-print
14:30 15m Talk		D3: Differential Testing of Distributed Deep Learning with Model GenerationSE for AI Journal-first Papers Jiannan Wang Purdue University, Hung Viet Pham York University, Qi Li , Lin Tan Purdue University, Yu Guo Meta Inc., Adnan Aziz Meta Inc., Erik Meijer
14:45 15m Talk		Evaluating the Impact of Flaky Simulators on Testing Autonomous Driving SystemsSE for AI Journal-first Papers Mohammad Hossein Amini University of Ottawa, Shervin Naseri University of Ottawa, Shiva Nejati University of Ottawa
15:00 15m Talk		Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension StudySE for AI Journal-first Papers Luca Giamattei Università di Napoli Federico II, Matteo Biagiola Università della Svizzera italiana, Roberto Pietrantuono Università di Napoli Federico II, Stefano Russo Università di Napoli Federico II, Paolo Tonella USI Lugano DOI Pre-print
15:15 15m Talk		Two is Better Than One: Digital Siblings to Improve Autonomous Driving TestingSE for AI Journal-first Papers Matteo Biagiola Università della Svizzera italiana, Andrea Stocco Technical University of Munich, fortiss, Vincenzo Riccio University of Udine, Paolo Tonella USI Lugano DOI Pre-print