Wed 17 Nov 2021 10:06 - 10:08 at Kangaroo - Tool Demo (2) Chair(s): Mattia Fazzini
Deep learning (DL) training is nondeterministic and such nondeterminism was shown to cause significant variance of model accuracy (up to 10.8%). Such variance may affect the validity of the comparison of newly proposed DL techniques with baselines. To ensure such validity, DL researchers and practitioners must replicate their experiments multiple times with identical settings to quantify the variance of the proposed approaches and baselines. Replicating and measuring DL variances reliably and efficiently is challenging and understudied. We propose a ready-to-deploy framework DEVIATE that (1)measures DL training variance of a DL model with minimal manual efforts, and (2) provides statistical tests of both accuracy and variance. Specifically, DEVIATEautomaticallyanalyzes the DL training code and extracts monitored important metrics (such as accuracy and loss). In addition, DEVIATE performs popular statistical tests and provides users with a report of statistical p-values and effect sizes along with various confidence levels when comparing to selected baselines. We demonstrate the effectiveness of DEVIATE by performing case studies with adversarial training. Specifically, for an adversarial training process that uses the Fast Gradient Signed Method to generate adversarial examples as the training data, DEVIATEmeasures a max difference of accuracy among 8 identical training runs with fixed random seeds to be up to 5.1%.
Wed 17 NovDisplayed time zone: Hobart change
09:00 - 10:00 | Learning INIER track / Research Papers / Tool Demonstrations at Kangaroo Chair(s): Denys Poshyvanyk William and Mary | ||
09:00 20mTalk | DeepMetis: Augmenting a Deep Learning Test Set to Increase its Mutation Score Research Papers Vincenzo Riccio USI Lugano, Nargiz Humbatova Università della Svizzera Italiana (USI), Gunel Jahangirova USI Lugano, Paolo Tonella USI Lugano | ||
09:20 20mTalk | Efficient state synchronisation in model-based testing through reinforcement learning Research Papers Uraz Cengiz Türker University of Leicester, UK, Robert Hierons University of Sheffield, Mohammad Reza Mousavi King's College London, Ivan Tyukin University of Leicester | ||
09:40 10mTalk | What do pre-trained code models know about code? NIER track | ||
09:50 5mTalk | DEVIATE: A Deep Learning Variance Testing Framework Tool Demonstrations Hung Viet Pham University of Waterloo, Mijung Kim Purdue University, Lin Tan Purdue University, Yaoliang Yu University of Waterloo, Nachiappan Nagappan Microsoft Research |
10:00 - 11:00 | |||
10:00 2mTalk | Shaker: a Tool for Detecting More Flaky Tests Faster Tool Demonstrations Marcello Cordeiro Federal University of Pernambuco, Denini Silva Federal University of Pernambuco, Leopoldo Teixeira Federal University of Pernambuco, Breno Miranda Federal University of Pernambuco, Marcelo d'Amorim Federal University of Pernambuco Link to publication | ||
10:04 2mTalk | GenTree: Inferring Configuration Interactions using Decision Trees Tool Demonstrations | ||
10:06 2mTalk | DEVIATE: A Deep Learning Variance Testing Framework Tool Demonstrations Hung Viet Pham University of Waterloo, Mijung Kim Purdue University, Lin Tan Purdue University, Yaoliang Yu University of Waterloo, Nachiappan Nagappan Microsoft Research | ||
10:08 2mTalk | FLACK: Localizing Faults in Alloy Models Tool Demonstrations Guolong Zheng University of Nebraska Lincoln, ThanhVu Nguyen George Mason University, Simón Gutiérrez Brida University of Rio Cuarto and CONICET, Argentina, Germán Regis Universidad Nacional de Río Cuarto, Marcelo F. Frias Dept. of Software Engineering Instituto Tecnológico de Buenos Aires, Nazareno Aguirre University of Rio Cuarto and CONICET, Argentina, Hamid Bagheri University of Nebraska-Lincoln | ||
10:10 2mTalk | Scalable Fuzzing of Program Binaries with E9AFL Tool Demonstrations Xiang Gao National University of Singapore, Gregory J. Duck National University of Singapore, Abhik Roychoudhury National University of Singapore |