We are on the cusp of a major opportunity: software tools that take advantage of Big Code. Specifically, Big Code will enable novel tools in areas such as security enhancers, bug finders, and code synthesizers. What do researchers need from Big Code to make progress on their tools? Our answer is an infrastructure that consists of 100,000 executable Java programs together with a set of working tools and an environment for building new tools. This Normalized Java Resource (NJR) will lower the barrier to implementation of new tools, speed up research, and ultimately help advance research frontiers.
Researchers get significant advantages from using NJR. They can write scripts that base their new tool on NJR’s already-working tools, and they can search NJR for programs with desired characteristics. They will receive the search result as a container that they can run either locally or on a cloud service. Additionally, they benefit from NJR’s normalized representation of each Java program, which enables scalable running of tools on the entire collection. Finally, they will find that NJR’s collection of programs is diverse because of our efforts to run clone detection and near-duplicate removal. In this paper we describe our vision for NJR and our current prototype.
Fri 20 JulDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:00 - 12:30 | |||
11:00 10mDay opening | Opening SOAP | ||
11:10 50mTalk | NJR: A Normalized Java Resource SOAP Link to publication DOI Pre-print | ||
12:00 15mTalk | Systematic Evaluation of the Unsoundness of Call Graph Construction Algorithms for Java SOAP Michael Reif TU Darmstadt, Germany, Florian Kübler TU Darmstadt, Germany, Michael Eichberg TU Darmstadt, Germany, Mira Mezini TU Darmstadt Link to publication DOI Pre-print Media Attached | ||
12:15 15mTalk | Iceberg: A Dynamic Analysis of Java Critical Sections Investigating Runtime Performance Variability SOAP Link to publication DOI Pre-print |