SETA: Statistical Fault Attribution for Compound AI SystemsFull Paper
Modern AI systems increasingly comprise multiple interconnected neural networks to tackle complex inference tasks. Testing such systems for robustness and safety entails significant challenges. Current state-of-the-art robustness testing techniques, whether black-box or white-box, have been proposed and implemented for single-network models and do not scale well to multi-network pipelines. We propose a modular robustness testing framework that applies a given set of perturbations to test data. Our testing framework supports (1) a component-wise system analysis to isolate errors and (2) reasoning about error propagation across the neural network modules. The testing framework is architecture and modality agnostic and can be applied across domains. We apply the framework to a real-world autonomous rail inspection system composed of multiple deep networks and successfully demonstrate how our approach enables fine-grained robustness analysis beyond conventional end-to-end metrics.
| SETA: Statistical Fault Attribution for Compound AI Systems (CAIN_SETA.pdf) | 14.73MiB |
Mon 13 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
14:00 - 15:30 | Quality Attributes and AssuranceResearch Track / CAIN Program at Oceania X Chair(s): Eric Knauss Chalmers | University of Gothenburg | ||
14:00 8mShort-paper | Quality Model for Machine Learning ComponentsShort Paper Research Track Grace Lewis Carnegie Mellon Software Engineering Institute, Rachel A Brower-Sinning Carnegie Mellon Software Engineering Institute, Robert Edman Carnegie Mellon Software Engineering Institute, Ipek Ozkaya Carnegie Mellon University, Sebastian Echeverria Carnegie Mellon Software Engineering Institute, Alex Derr Carnegie Mellon Software Engineering Institute, Collin Beaudoin Fairfield University, Katherine R. Maffey Carnegie Mellon University Pre-print | ||
14:08 12mFull-paper | Optimising for Energy Efficiency and Performance in Machine LearningFull Paper Research Track Emile Dos Santos Ferreira University of Cambridge, Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge Pre-print | ||
14:20 8mShort-paper | The Energy Impact of Domain Model Design in Classical PlanningShort Paper Research Track Ilche Georgievski University of Stuttgart, Serhat Tekin University of Stuttgart, Marco Aiello University of Stuttgart Pre-print | ||
14:28 12mFull-paper | LLMs as Design Partners for AI-Based System Patterns: An Empirical EvaluationFull Paper Research Track Felipe Rodrigues de Oliveira State University of Ceara, Brazil, Felipe Vasconcelos De Souza State University of Ceara, Brazil, Ana Luiza Bessa De Paula Barros State University of Ceara, Brazil, Paulo Maia State University of Ceará | ||
14:40 12mFull-paper | Statistical Confidence in Functional Correctness: An Approach for AI Product Functional Correctness EvaluationFull Paper Research Track Wallace Albertini Pontifical Catholic University of Rio de Janeiro, Marina Condé Araújo Department of Informatics - Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Júlia Condé Araújo Department of Informatics - Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Antonio Pedro Santos Alves Pontifical Catholic University of Rio de Janeiro, Marcos Kalinowski Pontifical Catholic University of Rio de Janeiro (PUC-Rio) | ||
14:52 12mFull-paper | SETA: Statistical Fault Attribution for Compound AI SystemsFull Paper Research Track Sayak Chowdhury IIITB - International Institute of Information Technology Bangalore, Meenakshi D'Souza IIITB - International Institute of Information Technology Bangalore Pre-print File Attached | ||
15:04 26mLive Q&A | Joint Q&A (Quality Attributes and Assurance) CAIN Program | ||