BugsInDLLs : A Database of Reproducible Bugs in Deep Learning Libraries to Enable Systematic Evaluation of Testing Techniques (ISSTA 2025 - Tool Demonstrations)

Wed 25 - Sat 28 June 2025 Trondheim, Norway

co-located with FSE 2025

Who

M M Abid Naziri, Aman Kumar Singh, Benjamin Wu, Feiran Qin, Saikat Dutta, Marcelo d'Amorim

Track

ISSTA 2025 Tool Demonstrations

Abstract

AI-enabled applications are prolific today. Deep Learning (DL) libraries, such as PyTorch and Tensorflow, provide the building blocks for the AI components of these applications. As any piece of software, these libraries can be buggy. An impressive number of bugfinding techniques to address this problem have been proposed, but the lack of a curated set of reproducible bugs in DL libraries hinders credible evaluation of these techniques. We present BugsInDLLs, a database of curated reproducible bugs to fill that gap. Unique challenges exist in this context, such as installing drivers of specific CUDA versions to reproduce certain GPU-related bugs. Our dataset currently consists of 112 environments to reproduce bugs across three popular DL libraries, namely, JAX, Tensorflow, and PyTorch.

M M Abid Naziri

North Carolina State University

United States

Aman Kumar Singh

Amrita Vishwa Vidyapeetham

India

Benjamin Wu

Feiran Qin

Saikat Dutta

Cornell University

United States

Marcelo d'Amorim

North Carolina State University

United States

BugsInDLLs : A Database of Reproducible Bugs in Deep Learning Libraries to Enable Systematic Evaluation of Testing Techniques