Boosting Robustness Verification of Semantic Feature Neighborhoods
Deep neural networks have been shown to be vulnerable to adversarial attacks that perturb inputs based on semantic features. Existing robustness analyzers can reason about semantic feature neighborhoods to increase the networks’ reliability. However, despite the significant progress in these techniques, they still struggle to scale to deep networks and large neighborhoods. In this work, we introduce VeeP, an active learning approach that splits the verification process into a series of smaller verification steps, each is submitted to an existing robustness analyzer. The key idea is to build on prior steps to predict the next optimal step. The optimal step is predicted by estimating the certification velocity and sensitivity via parametric regression. We evaluate VeeP on MNIST, Fashion-MNIST, CIFAR-10 and ImageNet and show that it can analyze neighborhoods of various features: brightness, contrast, hue, saturation, and lightness. We show that, on average, given a 90 minute timeout, VeeP verifies 96% of the maximally certifiable neighborhoods within 29 minutes, while existing splitting approaches verify, on average, 73% of the maximally certifiable neighborhoods within 58 minutes.
Mon 5 DecDisplayed time zone: Auckland, Wellington change
13:30 - 15:00 | Numerical Static AnalysesSAS at AMRF Auditorium Chair(s): Isabella Mastroeni University of Verona, Italy | ||
13:30 30mTalk | CLEVEREST: Accelerating CEGAR-based Neural Network Verification via Adversarial AttacksVirtual SAS Zhe Zhao ShanghaiTech University, Yedi Zhang ShanghaiTech University, Guangke Chen ShanghaiTech University, Fu Song ShanghaiTech University, Taolue Chen Birkbeck University of London, Jiaxiang Liu Shenzhen University | ||
14:00 30mTalk | Boosting Robustness Verification of Semantic Feature Neighborhoods SAS | ||
14:30 30mTalk | Lifting Numeric Relational Domains to Algebraic Data Types SAS Santiago Bautista Univ Rennes, Inria, CNRS, IRISA, Thomas P. Jensen INRIA Rennes, Benoît Montagu Inria Pre-print File Attached |