Deep Specification Mining
Formal specifications are essential but usually unavailable in software systems. Furthermore, writing these specifications is costly and requires skills from developers. Recently, many automated techniques have been proposed to mine specifications in various formats including finite-state automaton (FSA). However, more works in specification mining are needed to further improve the accuracy of the inferred specifications.
In this work, we propose Deep Specification Miner (DSM), a new approach that performs deep learning for mining FSA-based specifications. Our proposed approach uses test case generation to generate a rich set of execution traces for training a Recurrent Neural Network Based Language Model (RNNLM). From these execution traces, we construct a Prefx Tree Acceptor (PTA) and use the learned RNNLM to extract many features. These features are subsequently utilized by clustering algorithms to merge similar automata states in PTA for constructing a number of FSAs. Then, our approach performs a model selection heuristic to estimate F-measure of FSAs and returns the one with highest estimated F-measure. We execute DSM to mine specifications of 11 target library classes. Our empirical analysis shows that DSM achieves an average Precision, Recall, and F-measure of 82.76%, 72.3%, and 71.97%, respectively. Compared to the best baseline, our approach is more effective by 28.22% in terms of average F-measure
Mon 16 JulDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
16:00 - 17:30 | Machine LearningISSTA Technical Papers at Zurich II Chair(s): Alex Orso Georgia Institute of Technology | ||
16:00 20mTalk | Compiler Fuzzing through Deep Learning ISSTA Technical Papers Chris Cummins University of Edinburgh, Pavlos Petoumenos University of Edinburgh, Alastair Murray Codeplay Software, Hugh Leather University of Edinburgh | ||
16:20 20mTalk | Deep Specification Mining ISSTA Technical Papers Tien-Duy B. Le School of Information Systems, Singapore Management University, David Lo Singapore Management University | ||
16:40 20mTalk | Identifying Implementation Bugs in Machine Learning based Image Classifiers using Metamorphic Testing ISSTA Technical Papers Anurag Dwarakanath Accenture Labs, Manish Ahuja Accenture Labs, Samarth Sikand Accenture Labs, Raghotham M Rao Accenture Labs, R.P. Jagadeesh Chandra Bose Accenture Labs, Neville Dubash Accenture Labs, Sanjay Podder | ||
17:00 20mTalk | An Empirical Study on TensorFlow Program Bugs ISSTA Technical Papers Yuhao Zhang Peking University, Yifan Chen Peking University, Shing-Chi Cheung Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Yingfei Xiong Peking University, Lu Zhang Peking University Pre-print | ||
17:20 10m | Q&A in groups ISSTA Technical Papers |