Write a Blog >>
ICSE 2022
Sun 8 - Fri 27 May 2022
Mon 9 May 2022 21:00 - 21:05 at ICSE room 5-odd hours - Program Analysis 3 Chair(s): Travis Breaux
Wed 11 May 2022 13:00 - 13:05 at ICSE room 4-odd hours - Synthesis and Reverse Engineering Chair(s): Reed Milewicz
Wed 25 May 2022 11:30 - 11:35 at Room 304+305 - Papers 7: Evolution and Maintenance Chair(s): Thomas LaToza
Wed 25 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 1

Much software, whether beneficent or malevolent, is distributed only as binaries, sans source code. Absent source code, understanding binaries’ behavior can be quite challenging, especially when compiled under higher levels of compiler optimization. These optimizations can transform comprehensible, “natural” source constructions into something entirely unrecognizable. Reverse engineering binaries, especially those suspected of being malevolent or guilty of intellectual property theft, are important and time-consuming tasks. There is a great deal of interest in tools to “decompile” binaries back into more natural source code to aid reverse engineering. Decompilation involves several desirable steps, including recreating source-language constructions, variable names, and perhaps even comments. One central step in creating binaries is optimizing function calls, using steps such as inlining. Recovering these (possibly inlined) function calls from optimized binaries is an essential task that most state-of-the-art decompiler tools try to do but do not perform very well. In this paper, we evaluate a supervised learning approach to the problem of recovering optimized function calls. We leverage open-source software and develop an automated labeling scheme to generate a reasonably large dataset of binaries labeled with actual function usages. We augment this large but limited labeled dataset with a pre-training step, which learns the decompiled code statistics from a much larger unlabeled dataset. Thus augmented, our learned labeling model can be combined with an existing decompilation tool, Ghidra, to achieve substantially improved performance in function call recovery, especially at higher levels of optimization.

Mon 9 May

Displayed time zone: Eastern Time (US & Canada) change

21:00 - 22:00
21:00
5m
Talk
Learning to Find Usages of Library Functions in Optimized Binaries
Journal-First Papers
Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis, Anand Ashok Sawant University of California, Davis
Link to publication DOI Pre-print Media Attached
21:05
5m
Talk
InspectJS: Leveraging Code Similarity and User-Feedback for Effective Taint Specification Inference for JavaScript
SEIP - Software Engineering in Practice
Saikat Dutta University of Illinois at Urbana-Champaign, Diego Garbervetsky University of Buenos Aires and CONICET, Argentina, Shuvendu Lahiri Microsoft Research, Max Schaefer GitHub, Inc.
DOI Pre-print Media Attached
21:10
5m
Talk
Static Inference Meets Deep Learning: A Hybrid Type Inference Approach for PythonNominated for Distinguished Paper
Technical Track
Yun Peng The Chinese University of Hong Kong, Cuiyun Gao Harbin Institute of Technology, Zongjie Li The Hong Kong University of Science and Technology, Bowei Gao Harbin Institute of Technology, Shenzhen, David Lo Singapore Management University, Qirun Zhang Georgia Institute of Technology, USA, Michael Lyu The Chinese University of Hong Kong
DOI Pre-print Media Attached
21:15
5m
Talk
DeepDiagnosis: Automatically Diagnosing Faults and Recommending Actionable Fixes in Deep Learning Programs
Technical Track
Mohammad Wardat Dept. of Computer Science, Iowa State University, Breno Dantas Cruz Dept. of Computer Science, Iowa State University, Wei Le Iowa State University, Hridesh Rajan Iowa State University
Pre-print Media Attached
21:20
5m
Talk
Striking a Balance: Pruning False-Positives from Static Call GraphsNominated for Distinguished Paper
Technical Track
Akshay Utture University of California, Los Angeles (UCLA), Shuyang Liu University of California, Los Angeles, Christian Gram Kalhauge Technical University of Denmark, Jens Palsberg University of California at Los Angeles
DOI Pre-print Media Attached

Wed 11 May

Displayed time zone: Eastern Time (US & Canada) change

13:00 - 14:00
Synthesis and Reverse EngineeringTechnical Track / Journal-First Papers at ICSE room 4-odd hours
Chair(s): Reed Milewicz Sandia National Laboratories
13:00
5m
Talk
Learning to Find Usages of Library Functions in Optimized Binaries
Journal-First Papers
Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis, Anand Ashok Sawant University of California, Davis
Link to publication DOI Pre-print Media Attached
13:05
5m
Talk
Dynamic Update for Synthesized GR(1) Controllers
Technical Track
Gal Amram Tel Aviv University, Shahar Maoz Tel Aviv University, Israel, Itai Segall Nokia Bell-Labs, Matan Yossef Tel Aviv University
Pre-print Media Attached
13:10
5m
Talk
Push-Button Synthesis of Watch Companions for Android Apps
Technical Track
Cong Li Nanjing University, Yanyan Jiang Nanjing University, Chang Xu Nanjing University
Link to publication DOI Pre-print Media Attached
13:15
5m
Talk
Jigsaw: Large Language Models meet Program Synthesis
Technical Track
Naman Jain Microsoft Research, Skanda Vaidyanath Stanford, Arun Iyer Microsoft Research, India, Nagarajan Natarajan Microsoft Research, India, Suresh Parthasarathy Microsoft Research, India, Sriram Rajamani Microsoft Research, Rahul Sharma Microsoft Research
Pre-print Media Attached
13:20
5m
Talk
SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions
Technical Track
Ripon Saha , Akira Ura Fujitsu Ltd., Sonal Mahajan Uber Technologies Inc., Chenguang Zhu University of Texas at Austin, Linyi Li University of Illinois at Urbana-Champaign, Yang Hu The University of Texas at Austin, Hiroaki Yoshida AMD, Sarfraz Khurshid The University of Texas at Austin, Mukul Prasad Fujitsu Research of America
Pre-print Media Attached
13:25
5m
Talk
Static Stack-Preserving Intra-Procedural Slicing of WebAssembly BinariesBest Artifact Award
Technical Track
Quentin Stiévenart Vrije Universiteit Brussel, David Binkley Loyola University Maryland, Coen De Roover Vrije Universiteit Brussel
DOI Pre-print Media Attached

Wed 25 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Papers 7: Evolution and MaintenanceJournal-First Papers / Technical Track / SEIP - Software Engineering in Practice at Room 304+305
Chair(s): Thomas LaToza George Mason University
11:00
5m
Talk
A Software Impact Analysis Tool based on Change History Learning and its Evaluation
SEIP - Software Engineering in Practice
Haruya Iwasaki Shibaura Institute of Technologies, Tsuyoshi Nakajima Shibaura Institute of Technology, Ryota Tsukamoto Mitsubishi Electric Corporation, Kazuko Takahashi Mitsubishi Electric Corporation, Shuichi Tokumoto Mitsubishi Electric Corporation
DOI Media Attached
11:05
5m
Talk
Using Pre-Trained Models to Boost Code Review Automation
Technical Track
Rosalia Tufano Università della Svizzera Italiana, Simone Masiero Software Institute @ Università della Svizzera Italiana, Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Denys Poshyvanyk William and Mary, Gabriele Bavota Software Institute, USI Università della Svizzera italiana
Pre-print Media Attached
11:10
5m
Talk
Self-Admitted Technical Debt Practices: A Comparison Between Industry and Open-Source
Journal-First Papers
Fiorella Zampetti University of Sannio, Italy, Gianmarco Fucci University of Sannio, Alexander Serebrenik Eindhoven University of Technology, Massimiliano Di Penta University of Sannio, Italy
Link to publication DOI Pre-print Media Attached
11:15
5m
Talk
Journal First Submission of the Article: What do class comments tell us? An investigation of comment evolution and practices in Pharo Smalltalk
Journal-First Papers
Pooja Rani University of bern, Sebastiano Panichella Zurich University of Applied Sciences, Manuel Leuenberger Software Composition Group, University of Bern, Switzerland, Mohammad Ghafari School of Computer Science, University of Auckland, Oscar Nierstrasz University of Bern, Switzerland
Link to publication DOI Authorizer link Media Attached
11:20
5m
Talk
An Ensemble Approach for Annotating Source Code Identifiers with Part-of-speech Tags
Journal-First Papers
Christian D. Newman Rochester Institute of Technology, Michael J. Decker Bowling Green State University, Reem S. Alsuhaibani Kent State University, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Satyajit Mohapatra Rochester Institute of Technology, Tejal Vishnoi Rochester Institute of Technology, Marcos Zampieri Rochester Institute of Technology, Timothy Sheldon BNY Mellon, Emily Hill Drew University
Link to publication DOI Pre-print Media Attached
11:25
5m
Talk
Retrieving Data Constraint Implementations Using Fine-Grained Code Patterns
Technical Track
Juan Manuel Florez The University of Texas at Dallas, Jonathan Perry The University of Texas at Dallas, Shiyi Wei University of Texas at Dallas, Andrian Marcus University of Texas at Dallas
Pre-print Media Attached
11:30
5m
Talk
Learning to Find Usages of Library Functions in Optimized Binaries
Journal-First Papers
Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis, Anand Ashok Sawant University of California, Davis
Link to publication DOI Pre-print Media Attached
11:35
5m
Talk
Nalin: Learning from Runtime Behavior to Find Name-Value Inconsistencies
Technical Track
Jibesh Patra University of Stuttgart, Michael Pradel University of Stuttgart
Pre-print Media Attached
13:30 - 15:00
13:30
90m
Talk
In-IDE Code Generation from Natural Language: Promise and Challenges
Journal-First Papers
Frank Xu Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Graham Neubig Carnegie Mellon University
13:30
90m
Talk
Strategies for Reuse and Sharing among Data Scientists in Software Teams
SEIP - Software Engineering in Practice
Will Epperson Carnegie Mellon University, April Wang University of Michigan, Robert DeLine Microsoft Research, Steven M. Drucker Microsoft Research
Pre-print Media Attached
13:30
90m
Talk
Debugging with Stack Overflow: Web Search Behavior in Novice and Expert Programmers
SEET - Software Engineering Education and Training
Annie Li University of Michigan, Madeline Endres University of Michigan, Westley Weimer University of Michigan
DOI Pre-print Media Attached
13:30
90m
Talk
Static Stack-Preserving Intra-Procedural Slicing of WebAssembly BinariesBest Artifact Award
Technical Track
Quentin Stiévenart Vrije Universiteit Brussel, David Binkley Loyola University Maryland, Coen De Roover Vrije Universiteit Brussel
DOI Pre-print Media Attached
13:30
90m
Talk
Linear-time Temporal Logic guided Greybox Fuzzing
Technical Track
Ruijie Meng National University of Singapore, Singapore, Zhen Dong Fudan University, China, Jialin Li National University of Singapore, Singapore, Ivan Beschastnikh University of British Columbia, Abhik Roychoudhury National University of Singapore
DOI Pre-print Media Attached
13:30
90m
Talk
Individual differences limit predicting well-being and productivity using software repositories: a longitudinal industrial study
Journal-First Papers
Miikka Kuutila University of Oulu, Mika Mäntylä University of Oulu, Maëlick Claes University of Oulu, Marko Elovainio University of Helsinki, Bram Adams Queen's University, Kingston, Ontario
Link to publication Media Attached
13:30
90m
Talk
The Agile Success Model: A Mixed-methods Study of a Large-scale Agile Transformation
Journal-First Papers
Daniel Russo Department of Computer Science, Aalborg University
Link to publication DOI Pre-print
13:30
90m
Talk
PReach: A Heuristic for Probabilistic Reachability to Identify Hard to Reach Statements
Technical Track
Seemanta Saha University of California Santa Barbara, Mara Downing University of California, Santa Barbara, Tegan Brennan , Tevfik Bultan University of California, Santa Barbara
Pre-print Media Attached
13:30
90m
Talk
Active Learning of Discriminative Subgraph Patterns for API Misuse Detection
Journal-First Papers
Hong Jin Kang Singapore Management University, David Lo Singapore Management University
Pre-print Media Attached File Attached
13:30
90m
Talk
Toward Among-Device AI from On-Device AI with Stream Pipelines
SEIP - Software Engineering in Practice
MyungJoo Ham Samsung Electronics, Sangjung Woo Samsung Electronics, Jaeyun Jung Samsung Electronics, Wook Song Samsung Electronics, Gichan Jang Samsung Electronics, Yongjoo Ahn Samsung Electronics, Hyoungjoo Ahn Samsung Electronics
Pre-print Media Attached
13:30
90m
Talk
Integrating Hackathons into an Online Cybersecurity Course
SEET - Software Engineering Education and Training
Abasi-amefon Obot Affia University of Tartu, Estonia, Alexander Nolte University of Tartu, Raimundas Matulevičius University of Tartu, Estonia
DOI Pre-print Media Attached
13:30
90m
Talk
Verifying Dynamic Trait Objects in Rust
SEIP - Software Engineering in Practice
Alexa VanHattum Cornell University, Daniel Schwartz-Narbonne Amazon, n.n., Nathan Chong Amazon, Adrian Sampson Cornell University
Pre-print Media Attached
13:30
90m
Talk
Automatically Identifying Shared Root Causes of Test Breakages in SAP HANA
SEIP - Software Engineering in Practice
Gabin An KAIST, Juyeon Yoon Korea Advanced Institute of Science and Technology, Jeongju Sohn University of Luxembourg, Jingun Hong SAP Labs, Dongwon Hwang SAP Labs, Shin Yoo KAIST
Pre-print Media Attached
13:30
90m
Talk
Guiding Peer-feedback in Learning Software Design using UML
SEET - Software Engineering Education and Training
Satrio Adi Rukmono Institut Teknologi Bandung, Michel Chaudron Eindhoven University of Technology, The Netherlands
Pre-print Media Attached
13:30
90m
Talk
Fairness-aware Configuration of Machine Learning Libraries
Technical Track
Saeid Tizpaz-Niari University of Texas at El Paso, Ashish Kumar , Gang Tan Pennsylvania State University, Ashutosh Trivedi University of Colorado Boulder
DOI Pre-print Media Attached
13:30
90m
Talk
Using Pre-Trained Models to Boost Code Review Automation
Technical Track
Rosalia Tufano Università della Svizzera Italiana, Simone Masiero Software Institute @ Università della Svizzera Italiana, Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Denys Poshyvanyk William and Mary, Gabriele Bavota Software Institute, USI Università della Svizzera italiana
Pre-print Media Attached
13:30
90m
Talk
Automatic Anti-Pattern Detection in Microservice Architectures based on Distributed Tracing
SEIP - Software Engineering in Practice
Tim Hubener ING Bank N.V., Yaping Luo ING; Eindhoven University of Technology, Pieter Vallen ING, Jonck van der Kogel ING Bank N.V., Tom Liefheid ING Bank N.V., Michel Chaudron Eindhoven University of Technology, The Netherlands
Media Attached
13:30
90m
Talk
Retrieving Data Constraint Implementations Using Fine-Grained Code Patterns
Technical Track
Juan Manuel Florez The University of Texas at Dallas, Jonathan Perry The University of Texas at Dallas, Shiyi Wei University of Texas at Dallas, Andrian Marcus University of Texas at Dallas
Pre-print Media Attached
13:30
90m
Talk
Verification of Consistency between Process Models, Object Life Cycles, and Context-dependent Semantic Specifications
Journal-First Papers
Ralph Hoch Institute of Computer Technology, TU Wien, Christoph Luckeneder Vienna University of Technology, Roman Popp TU Wien, Vienna, Austria, Hermann Kaindl Institute of Computer Technology, TU Wien
Link to publication DOI Pre-print Media Attached
13:30
90m
Talk
If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components
Technical Track
Boyue Caroline Hu University of Toronto, Lina Marsso University of Toronto, Krzysztof Czarnecki University of Waterloo, Canada, Rick Salay University of Toronto, Huakun Shen University of Toronto, Marsha Chechik University of Toronto
DOI Pre-print Media Attached
13:30
90m
Talk
Preparing Software Engineers to Develop Robot Systems
SEET - Software Engineering Education and Training
Carl Hildebrandt University of Virginia, Meriel von Stein University of Virginia, Trey Woodlief University of Virginia, Sebastian Elbaum University of Virginia
DOI Pre-print Media Attached
13:30
90m
Poster
EUGAIN. The European Network For Gender Balance in Informatics
Technical Track
Valentina Lenarduzzi University of Oulu, Barbora Buhnova Masaryk University, Letizia Jaccheri Norwegian University of Science and Technology
13:30
90m
Talk
Detecting False Alarms from Automatic Static Analysis Tools: How Far are We?Nominated for Distinguished Paper
Technical Track
Hong Jin Kang Singapore Management University, Khai Loong Aw Singapore Management University, David Lo Singapore Management University
DOI Pre-print Media Attached File Attached
13:30
90m
Talk
An Ensemble Approach for Annotating Source Code Identifiers with Part-of-speech Tags
Journal-First Papers
Christian D. Newman Rochester Institute of Technology, Michael J. Decker Bowling Green State University, Reem S. Alsuhaibani Kent State University, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Satyajit Mohapatra Rochester Institute of Technology, Tejal Vishnoi Rochester Institute of Technology, Marcos Zampieri Rochester Institute of Technology, Timothy Sheldon BNY Mellon, Emily Hill Drew University
Link to publication DOI Pre-print Media Attached
13:30
90m
Talk
Counterfactual Explanations for Models of Code
SEIP - Software Engineering in Practice
Jürgen Cito TU Wien and Meta, Işıl Dillig University of Texas at Austin, Vijayaraghavan Murali Meta Platforms, Inc., Satish Chandra Facebook
Pre-print Media Attached
13:30
90m
Talk
Nalin: Learning from Runtime Behavior to Find Name-Value Inconsistencies
Technical Track
Jibesh Patra University of Stuttgart, Michael Pradel University of Stuttgart
Pre-print Media Attached
13:30
90m
Talk
Learning to Find Usages of Library Functions in Optimized Binaries
Journal-First Papers
Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis, Anand Ashok Sawant University of California, Davis
Link to publication DOI Pre-print Media Attached
13:30
90m
Talk
DeepStability: A Study of Unstable Numerical Methods and Their Solutions in Deep Learning
Technical Track
Eliska Kloberdanz Iowa State University, Kyle Kloberdanz Cape Privacy, Wei Le Iowa State University
Pre-print Media Attached
13:30
90m
Talk
Fuzzing Class Specifications
Technical Track
Facundo Molina University of Rio Cuarto and CONICET, Argentina, Marcelo d'Amorim Federal University of Pernambuco, Nazareno Aguirre University of Rio Cuarto and CONICET, Argentina
Pre-print Media Attached
13:30
90m
Talk
Journal First Submission of the Article: What do class comments tell us? An investigation of comment evolution and practices in Pharo Smalltalk
Journal-First Papers
Pooja Rani University of bern, Sebastiano Panichella Zurich University of Applied Sciences, Manuel Leuenberger Software Composition Group, University of Bern, Switzerland, Mohammad Ghafari School of Computer Science, University of Auckland, Oscar Nierstrasz University of Bern, Switzerland
Link to publication DOI Authorizer link Media Attached

Information for Participants
Mon 9 May 2022 21:00 - 22:00 at ICSE room 5-odd hours - Program Analysis 3 Chair(s): Travis Breaux
Info for room ICSE room 5-odd hours:

Click here to go to the room on Midspace

Wed 11 May 2022 13:00 - 14:00 at ICSE room 4-odd hours - Synthesis and Reverse Engineering Chair(s): Reed Milewicz
Info for room ICSE room 4-odd hours:

Click here to go to the room on Midspace