Inputs from Hell: Learning Input Distributions for Grammar-Based Test Generation (ICSE 2021 - Journal-First Papers)

Who

Ezekiel Soremekun, Esteban Pavese, Nikolas Havrikov, Lars Grunske, Andreas Zeller

Track

ICSE 2021 Journal-First Papers

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 26 May 2021 18:50 - 19:10 at Blended Sessions Room 1 - 2.5.1. Testing: Automatic Test Generation Chair(s): José Miguel Rojas
Thu 27 May 2021 06:50 - 07:10 at Blended Sessions Room 1 - 2.5.1. Testing: Automatic Test Generation

Abstract

When a program has been tested on some sample input(s), what additional input does one test next? To further test the program, one needs to construct inputs that cover (new) input features, in a manner that is different from the initial samples.
This paper presents a novel test generation approach that employs context-free grammars to learn the production probabilities of input elements from sample inputs. Using the grammars as input parsers, we show how to learn input distributions from sample inputs, allowing to create “common inputs” that are similar to the sample. By inverting the learned probabilities, we can create “uncommon inputs” that are dissimilar to the sample.
Our evaluation of these approaches on three input formats show that “common inputs” reproduced 96% of the methods induced by the samples and the “uncommon inputs” covered different methods from the samples for almost all subjects (95%).

Link to Publication

https://ieeexplore.ieee.org/document/9154602

Link to Preprint

https://publications.cispa.saarland/3167/7/inputs-from-hell.pdf

DOI

https://doi.org/10.1109/TSE.2020.3013716

Ezekiel Soremekun

SnT, University of Luxembourg

Luxembourg

Esteban Pavese

Humboldt University of Berlin

Nikolas Havrikov

CISPA, Germany

Germany

Lars Grunske

Humboldt University of Berlin

Germany

Andreas Zeller

CISPA Helmholtz Center for Information Security

Germany

Media

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 26 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

18:50 - 19:50	2.5.1. Testing: Automatic Test GenerationJournal-First Papers / Technical Track at Blended Sessions Room 1 +12h Chair(s): José Miguel Rojas University of Leicester, UK

18:50 20m Paper		Inputs from Hell: Learning Input Distributions for Grammar-Based Test GenerationJournal-First Journal-First Papers Ezekiel Soremekun SnT, University of Luxembourg, Esteban Pavese Humboldt University of Berlin, Nikolas Havrikov CISPA, Germany, Lars Grunske Humboldt University of Berlin, Andreas Zeller CISPA Helmholtz Center for Information Security Link to publication DOI Pre-print Media Attached
19:10 20m Paper		Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We?Technical Track Technical Track Song Wang York University, Nishtha Shrestha York University, Abarna Kucheri Subburaman York University, Junjie Wang Institute of Software, Chinese Academy of Sciences, Moshi Wei York University, Nachiappan Nagappan Microsoft Research Link to publication Pre-print Media Attached
19:30 20m Paper		Using Relative Lines of Code to Guide Automated Test Generation for PythonJournal-First Journal-First Papers Josie Holmes Northern Arizona University, Iftekhar Ahmed University of California, Irvine, Caius Brindescu Oregon State University, Rahul Gopinath CISPA Helmholtz Center for Information Security, He Zhang Nanjing University, Alex Groce Northern Arizona University Pre-print Media Attached

Thu 27 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

06:50 - 07:50	2.5.1. Testing: Automatic Test GenerationTechnical Track / Journal-First Papers at Blended Sessions Room 1

06:50 20m Paper		Inputs from Hell: Learning Input Distributions for Grammar-Based Test GenerationJournal-First Journal-First Papers Ezekiel Soremekun SnT, University of Luxembourg, Esteban Pavese Humboldt University of Berlin, Nikolas Havrikov CISPA, Germany, Lars Grunske Humboldt University of Berlin, Andreas Zeller CISPA Helmholtz Center for Information Security Link to publication DOI Pre-print Media Attached
07:10 20m Paper		Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We?Technical Track Technical Track Song Wang York University, Nishtha Shrestha York University, Abarna Kucheri Subburaman York University, Junjie Wang Institute of Software, Chinese Academy of Sciences, Moshi Wei York University, Nachiappan Nagappan Microsoft Research Link to publication Pre-print Media Attached
07:30 20m Paper		Using Relative Lines of Code to Guide Automated Test Generation for PythonJournal-First Journal-First Papers Josie Holmes Northern Arizona University, Iftekhar Ahmed University of California, Irvine, Caius Brindescu Oregon State University, Rahul Gopinath CISPA Helmholtz Center for Information Security, He Zhang Nanjing University, Alex Groce Northern Arizona University Pre-print Media Attached