Diversity-Driven Automated Formal Verification (ICSE 2022 - Technical Track)

Write a Blog >>

Sun 8 - Fri 27 May 2022

Who

Emily First, Yuriy Brun

Track

ICSE 2022 Technical Track

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 10 May 2022 13:20 - 13:25 at ICSE room 5 - Validation and Verification 7 Chair(s): Guido Salvaneschi
Wed 11 May 2022 20:25 - 20:30 at ICSE room 3 - Validation and Verification 5 Chair(s): Saba Alimadadi
Wed 25 May 2022 09:35 - 09:40 at Room 306+307 - Papers 4: Verification and Analysis Chair(s): Gregory Gay

Abstract

Formally verified correctness is one of the most desirable properties of software systems. But despite great progress made via interactive theorem provers, such as Coq, writing proof scripts for verification remains one of the most effort-intensive (and often prohibitively difficult) software development activities. Recent work has created tools that automatically synthesize proofs or proof scripts. For example, CoqHammer can prove 26.6% of theorems completely automatically by reasoning using precomputed facts, while TacTok and ASTactic, which use machine learning to model proof scripts and then perform biased search through the proof-script space, can prove 12.9% and 12.3% of the theorems, respectively. Further, these three tools are highly complementary; together, they can prove 30.4% of the theorems fully automatically. Our key insight is that control over the learning process can produce a diverse set of models, and that, due to the unique nature of proof synthesis (the existence of the theorem prover, an oracle that infallibly judges a proof’s correctness), this diversity can significantly improve these tools’ proving power. Accordingly, we develop Diva, which uses a diverse set of models with TacTok’s and ASTactic’s search mechanism to prove 21.7% of the theorems. That is, Diva proves 68% more theorems than TacTok and 77% more than ASTactic. Complementary to CoqHammer, Diva proves 781 theorems (27% added value) that Coq-Hammer does not, and 364 theorems no existing tool has proved automatically. Together with CoqHammer, Diva proves 33.8% of the theorems, the largest fraction to date. We explore nine dimensions for learning diverse models, and identify which dimensions lead to the most useful diversity. Further, we develop an optimization to speed up Diva’s execution by 40X. Our study introduces a completely new idea for using diversity in machine learning to improve the power of state-of-the-art proof-script synthesis techniques, and empirically demonstrates that the improvement is significant on a dataset of 68K theorems from 122 open-source software projects.

Link to Preprint

https://people.cs.umass.edu/~brun/pubs/pubs/First22icse.pdf

DOI

https://doi.org/10.1145/3510003.3510138

Emily First

University of Massachusetts Amherst

Yuriy Brun

University of Massachusetts

United States

Diversity-Driven Automated Formal Verification

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 10 May
Displayed time zone: Eastern Time (US & Canada) change

13:00 - 14:00	Validation and Verification 7Journal-First Papers / Technical Track / SEIP - Software Engineering in Practice at ICSE room 5 Chair(s): Guido Salvaneschi University of St. Gallen

5m Talk		Control and Discovery of Environment Behaviour Journal-First Papers Maureen Keegan Intercom, Nicolás D’Ippolito Dept. of Computer Science FCEyN, University of Buenos Aires, Víctor Braberman ICC (UBA-CONICET), Nir Piterman University of Gothenberg, Sebastian Uchitel Universidad de Buenos Aires / Imperial College Link to publication DOI Pre-print Media Attached
5m Talk		Unreliable Test Infrastructures in Automotive Testing Setups SEIP - Software Engineering in Practice Claudius Jordan Technical University of Munich, Philipp Foth Technical University of Munich, Alexander Pretschner TU Munich, Matthias Fruth TraceTronic GmbH Pre-print Media Attached
5m Talk		Nessie: Automatically Testing JavaScript APIs with Asynchronous Callbacks Technical Track Ellen Arteca Northeastern University, Sebastian Harner University of Stuttgart, Michael Pradel University of Stuttgart, Frank Tip Northeastern University Pre-print Media Attached
5m Talk		Towards Boosting Patch Execution On-the-Fly Technical Track Samuel Benton The University of Texas at Dallas, Yuntong Xie Tsinghua University, Lan Lu SUSTech, Mengshi Zhang Meta, Xia Li Kennesaw State University, Lingming Zhang University of Illinois at Urbana-Champaign Media Attached File Attached
5m Talk		Diversity-Driven Automated Formal VerificationDistinguished Paper Award Technical Track Emily First University of Massachusetts Amherst, Yuriy Brun University of Massachusetts DOI Pre-print Media Attached
5m Talk		Nalin: Learning from Runtime Behavior to Find Name-Value Inconsistencies Technical Track Jibesh Patra University of Stuttgart, Michael Pradel University of Stuttgart Pre-print Media Attached

Wed 11 May
Displayed time zone: Eastern Time (US & Canada) change

20:00 - 21:00	Validation and Verification 5Technical Track / SEIP - Software Engineering in Practice / Journal-First Papers at ICSE room 3 Chair(s): Saba Alimadadi Simon Fraser University

5m Talk		Control and Discovery of Environment Behaviour Journal-First Papers Maureen Keegan Intercom, Nicolás D’Ippolito Dept. of Computer Science FCEyN, University of Buenos Aires, Víctor Braberman ICC (UBA-CONICET), Nir Piterman University of Gothenberg, Sebastian Uchitel Universidad de Buenos Aires / Imperial College Link to publication DOI Pre-print Media Attached
5m Talk		How Does Code Reviewing Feedback Evolve? A Longitudinal Study at Dell EMC SEIP - Software Engineering in Practice Ruiyin Wen McGill University, Maxime Lamothe Polytechnique Montréal, Shane McIntosh University of Waterloo Pre-print Media Attached
5m Talk		Nessie: Automatically Testing JavaScript APIs with Asynchronous Callbacks Technical Track Ellen Arteca Northeastern University, Sebastian Harner University of Stuttgart, Michael Pradel University of Stuttgart, Frank Tip Northeastern University Pre-print Media Attached
5m Talk		ExAIS: Executable AI Semantics Technical Track Richard Schumi Singapore Management University, Jun Sun Singapore Management University Pre-print Media Attached
5m Talk		Towards Boosting Patch Execution On-the-Fly Technical Track Samuel Benton The University of Texas at Dallas, Yuntong Xie Tsinghua University, Lan Lu SUSTech, Mengshi Zhang Meta, Xia Li Kennesaw State University, Lingming Zhang University of Illinois at Urbana-Champaign Media Attached File Attached
5m Talk		Diversity-Driven Automated Formal VerificationDistinguished Paper Award Technical Track Emily First University of Massachusetts Amherst, Yuriy Brun University of Massachusetts DOI Pre-print Media Attached

Wed 25 May
Displayed time zone: Eastern Time (US & Canada) change

09:30 - 10:30	Papers 4: Verification and AnalysisTechnical Track / Journal-First Papers at Room 306+307 Chair(s): Gregory Gay Chalmers and the University of Gothenburg

09:30 5m Talk		Static Stack-Preserving Intra-Procedural Slicing of WebAssembly BinariesBest Artifact Award Technical Track Quentin Stiévenart Vrije Universiteit Brussel, David Binkley Loyola University Maryland, Coen De Roover Vrije Universiteit Brussel DOI Pre-print Media Attached
09:35 5m Talk		Diversity-Driven Automated Formal VerificationDistinguished Paper Award Technical Track Emily First University of Massachusetts Amherst, Yuriy Brun University of Massachusetts DOI Pre-print Media Attached
09:40 5m Talk		Control and Discovery of Environment Behaviour Journal-First Papers Maureen Keegan Intercom, Nicolás D’Ippolito Dept. of Computer Science FCEyN, University of Buenos Aires, Víctor Braberman ICC (UBA-CONICET), Nir Piterman University of Gothenberg, Sebastian Uchitel Universidad de Buenos Aires / Imperial College Link to publication DOI Pre-print Media Attached
09:45 5m Talk		Learning Lenient Parsing & Typing via Indirect Supervision Journal-First Papers Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis, Vincent J. Hellendoorn Carnegie Mellon University Link to publication DOI Pre-print Media Attached
09:50 5m Talk		Striking a Balance: Pruning False-Positives from Static Call GraphsNominated for Distinguished Paper Technical Track Akshay Utture University of California, Los Angeles (UCLA), Shuyang Liu University of California, Los Angeles, Christian Gram Kalhauge Technical University of Denmark, Jens Palsberg University of California at Los Angeles DOI Pre-print Media Attached
09:55 5m Talk		SugarC: Scalable Desugaring of Real-World Preprocessor Usage into Pure C Technical Track Zachary Patterson University of Texas at Dallas, Zenong Zhang The University of Texas at Dallas, Brent Pappas University of Central Florida, Shiyi Wei University of Texas at Dallas, Paul Gazzillo University of Central Florida Pre-print Media Attached

Information for Participants

Tue 10 May 2022 13:00 - 14:00 at ICSE room 5 - Validation and Verification 7 Chair(s): Guido Salvaneschi

Info for room ICSE room 5-odd hours:

Click here to go to the room on Midspace

Wed 11 May 2022 20:00 - 21:00 at ICSE room 3 - Validation and Verification 5 Chair(s): Saba Alimadadi

Info for room ICSE room 3-even hours:

Click here to go to the room on Midspace