DiTOX: Fault Detection and Localization in the ONNX Optimizer
With over 760 stars on GitHub and being part of the official ONNX repository, the ONNX Optimizer is the default tool for applying graph-based optimizations to ONNX models. Despite its widespread use, its ability to maintain model accuracy during optimization has not been thoroughly investigated. In this work, we present DiTOX, a utility designed to automatically and comprehensively evaluate the correctness of the ONNX Optimizer. DiTOX adopts a straightforward yet powerful differential testing, \nick{fault localization,} and evaluation methodology, which can be readily adapted for use with other compiler optimizers. Specifically, DiTOX takes a collection of ONNX models, applies optimizations, and executes both the original and optimized versions across a user-defined input set, automatically capturing any issues encountered during optimization. When discrepancies in accuracy arise, DiTOX iteratively isolates the responsible optimization pass by repeating the process at a finer granularity. We applied DiTOX to 130 well-known models from the official ONNX Model Hub, spanning diverse tasks including classification, object detection, semantic segmentation, text summarization, question answering, and sentiment analysis. Our evaluation revealed that $9.2$% of the model instances either caused the optimizer to crash or led to the generation of invalid models using default optimization strategies. Additionally, $30$% of classification models and $16.6$% of object detection and segmentation models exhibited differing outputs across original and optimized versions, whereas models focused on text-related tasks were generally robust to optimization. In total, DiTOX uncovered 15 issues, 14 of which were previously unknown—related to optimizer crashes and accuracy regressions, impacting 9 of the 47 available optimization passes as well as the optimizer more broadly. All findings were reported to the ONNX Optimizer development team. DiTOX demonstrates a simple but effective framework for validating AI model optimizers and with minimal work, is applicable beyond the ONNX ecosystem.
Sat 31 JanDisplayed time zone: Hobart change
13:45 - 15:30 | |||
13:45 26mTalk | DiTOX: Fault Detection and Localization in the ONNX Optimizer Main Conference | ||
14:11 26mTalk | SSMR: Statically Detecting Speculation Safe Memory Regions to Mitigate Transient Execution Attacks Main Conference Ange-Thierry Ishimwe University of Colorado Boulder, Sam Mcdiarmid-sterling University of Colorado Boulder, Zack McKevitt University of Colorado Boulder, Tamara Silbergleit Lehman University of Colorado Boulder | ||
14:37 26mTalk | CHEHAB: Automatic Compiler Code Optimization for Fully Homomorphic Encryption Main Conference Riyadh Baghdadi New York University Abu Dhabi, Abdessamed Seddiki New York University Abu Dhabi and Ecole Superieure d'Informatique, Arab Mohammed New York University Abu Dhabi and Ecole Superieure d'Informatique, Zakaria Hebbal Ecole nationale Supérieure d'Informatique, Aimad Chabounia Ecole Superieure d'Informatique; New York University Abu Dhabi, Eduardo Chielle New York University Abu Dhabi, Michail Maniatakos New York University Abu Dhabi, MENACER Djamel Eddine Ecole Superieure d'Informatique, Karima Benatchba Ecole Nationale Supérieure d'Informatique, Challal Yacine University of Doha for Science and Technology | ||
15:03 26mTalk | Parallel and Customizable Equality Saturation Main Conference Jonathan Van der Cruysse McGill University, Abd-El-Aziz Zayed McGill University, Mai Jacob Peng McGill University, Christophe Dubach McGill University | ||