Compiler Fuzzing through Deep Learning (ISSTA 2018 - ISSTA Technical Papers)

Write a Blog >>

Sun 15 - Sat 21 July 2018 Amsterdam, Netherlands

co-located with ECOOP '18 and others

Who

Chris Cummins, Pavlos Petoumenos, Alastair Murray, Hugh Leather

Track

ISSTA 2018 ISSTA Technical Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 16 Jul 2018 16:00 - 16:20 at Zurich II - Machine Learning Chair(s): Alex Orso

Abstract

Random program generation — fuzzing — is an effective technique for discovering bugs in compilers but successful fuzzers require extensive development effort for every language supported by the compiler, and often leave parts of the language space untested.

We introduce DeepSmith, a novel machine learning approach to accelerating compiler validation through the inference of generative models for compiler inputs. Our approach \emph{infers} a learned model of the structure of real world code based on a large corpus of open source code. Then, it uses the model to automatically generate tens of thousands of realistic programs. Finally, we apply established differential testing methodologies on them to expose bugs in compilers.

We apply our approach to the OpenCL programming language, automatically exposing bugs in OpenCL compilers with little effort on our side. In 1,000 hours of automated testing of commercial and open source compilers, we discover bugs in all of them, submitting 67 bug reports.

Our test cases are on average two orders of magnitude smaller than the state-of-the-art, require 3.03x less time to generate and evaluate, and expose bugs which the state-of-the-art cannot. Our random program generator, comprising only 500 lines of code, took 12 hours to train for OpenCL versus the state-of-the-art taking 9 man months to port from a generator for C and 50,000 lines of code.

Chris Cummins

University of Edinburgh

United Kingdom

Pavlos Petoumenos

University of Edinburgh

United Kingdom

Alastair Murray

Codeplay Software

Hugh Leather

University of Edinburgh

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 16 Jul
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

16:00 - 17:30	Machine LearningISSTA Technical Papers at Zurich II Chair(s): Alex Orso Georgia Institute of Technology

16:00 20m Talk		Compiler Fuzzing through Deep Learning ISSTA Technical Papers Chris Cummins University of Edinburgh, Pavlos Petoumenos University of Edinburgh, Alastair Murray Codeplay Software, Hugh Leather University of Edinburgh
16:20 20m Talk		Deep Specification Mining ISSTA Technical Papers Tien-Duy B. Le School of Information Systems, Singapore Management University, David Lo Singapore Management University
16:40 20m Talk		Identifying Implementation Bugs in Machine Learning based Image Classifiers using Metamorphic Testing ISSTA Technical Papers Anurag Dwarakanath Accenture Labs, Manish Ahuja Accenture Labs, Samarth Sikand Accenture Labs, Raghotham M Rao Accenture Labs, R.P. Jagadeesh Chandra Bose Accenture Labs, Neville Dubash Accenture Labs, Sanjay Podder
17:00 20m Talk		An Empirical Study on TensorFlow Program Bugs ISSTA Technical Papers Yuhao Zhang Peking University, Yifan Chen Peking University, Shing-Chi Cheung Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Yingfei Xiong Peking University, Lu Zhang Peking University Pre-print
17:20 10m		Q&A in groups ISSTA Technical Papers