Write a Blog >>
ICSE 2022
Sun 8 - Fri 27 May 2022
Tue 10 May 2022 11:20 - 11:25 at ICSE room 5-odd hours - Tools and Environments 4 Chair(s): Guido Salvaneschi
Tue 10 May 2022 20:15 - 20:20 at ICSE room 3-even hours - Tools and Environments 2 Chair(s): Anthony I. (Tony) Wasserman
Fri 27 May 2022 09:20 - 09:25 at Room 306+307 - Papers 18: Recommender Systems, tools and environments Chair(s): Christian Bird
Fri 27 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 3

Over the years, researchers capitalized on the repetitiveness of software changes to automate many software evolution tasks. Despite the extraordinary rise in popularity of Python-based ML systems, they do not benefit from these advances. Without knowing what are the repetitive changes that ML developers make, researchers, tool, and library designers miss opportunities for automation, and ML developers fail to learn and use common practices.

To fill the knowledge gap and advance the science and tooling in ML software evolution, we conducted the first and most fine-grained study on code change patterns in a diverse corpus of 1000 top-rated ML systems comprising 58 million SLOC. To conduct this study we reuse, adapt, and improve upon the state-of-the-art repetitive change mining techniques. Our novel tool, R-CPatMiner, mines over 4M commits and constructs 350K fine-grained change graphs and detects 28K change patterns. Using thematic analysis, we identified 22 pattern groups and we reveal 4 major trends of how ML developers change their code. We sent a survey to 650 ML developers to further shed light on these patterns and their applications. We present actionable, empirically-justified implications for four audiences: (i) researchers, (ii) tool builders and IDE designers, (iii) ML library vendors, and (iv) developers and educators.

Tue 10 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:00
11:00
5m
Talk
Towards Property-Based Tests in Natural Language
NIER - New Ideas and Emerging Results
Colin Gordon Drexel University
Pre-print Media Attached
11:05
5m
Talk
Using a Semantic Knowledge Base to Improve the Managementof Security Reports in Industrial DevOps Projects
SEIP - Software Engineering in Practice
Markus Voggenreiter Siemens Technology / LMU Munich, Ulrich Schöpp fortiss GmbH
Pre-print Media Attached
11:10
5m
Talk
What's bothering developers in code review?
SEIP - Software Engineering in Practice
Emma Söderberg Lund University, Luke Church University of Cambridge | Lund University | Lark Systems, Jürgen Börstler Blekinge Institute of Technology, Diederick Niehorster Lund University, Christofer Rydenfält Lund University
Pre-print Media Attached
11:15
5m
Talk
"Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint
SEIP - Software Engineering in Practice
Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands
Pre-print Media Attached
11:20
5m
Talk
Discovering Repetitive Code Changes in Python ML Systems
Technical Track
Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA
DOI Pre-print Media Attached
11:25
5m
Talk
OJXPerf: Featherlight Object Replica Detection for Java Programs
Technical Track
Bolun Li North Carolina State University, Hao Xu College of William and Mary, Qidong Zhao North Carolina State University, Pengfei Su University of California, Merced, Milind Chabbi Scalable Machines Research, Shuyin Jiao North Carolina State University, Xu Liu North Carolina State University, Oak Ridge National Laboratory, USA
DOI Pre-print Media Attached
20:00 - 21:00
20:00
5m
Talk
Are You Still Working on This? An Empirical Study on Pull Request Abandonment
Journal-First Papers
Zhixing Li College of Computer, National University of Defense Technology, Changsha, China, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Tao Wang National University of Defense Technology, Gang Yin National University of Defense Technology, Shanshan Li National University of Defense Technology, Huaimin Wang National University of Defense Technology
Link to publication DOI Pre-print File Attached
20:05
5m
Talk
Towards Property-Based Tests in Natural Language
NIER - New Ideas and Emerging Results
Colin Gordon Drexel University
Pre-print Media Attached
20:10
5m
Talk
A Unified Code Review Automation for Large-scale Industry with Diverse Development Environments
SEIP - Software Engineering in Practice
Hyungjin Kim Samsung Research, Samsung Electronics, Yonghwi Kwon Samsung Research, Samsung Electronics, Hyukin Kwon Samsung Research, Samsung Electronics, Yeonhee Ryou Samsung Research, Samsung Electronics, Sangwoo Joh Samsung Research, Samsung Electronics, Taeksu Kim Samsung Research, Samsung Electronics, Chul-Joo Kim Samsung Research, Samsung Electronics
DOI Pre-print Media Attached
20:15
5m
Talk
Discovering Repetitive Code Changes in Python ML Systems
Technical Track
Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA
DOI Pre-print Media Attached
20:20
5m
Talk
OJXPerf: Featherlight Object Replica Detection for Java Programs
Technical Track
Bolun Li North Carolina State University, Hao Xu College of William and Mary, Qidong Zhao North Carolina State University, Pengfei Su University of California, Merced, Milind Chabbi Scalable Machines Research, Shuyin Jiao North Carolina State University, Xu Liu North Carolina State University, Oak Ridge National Laboratory, USA
DOI Pre-print Media Attached

Fri 27 May

Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:30
Papers 18: Recommender Systems, tools and environmentsTechnical Track / Journal-First Papers / NIER - New Ideas and Emerging Results / SEIP - Software Engineering in Practice at Room 306+307
Chair(s): Christian Bird Microsoft Research
09:00
5m
Talk
Predicting the Objective and Priority of Issue Reports in Software Repositories
Journal-First Papers
Maliheh Izadi Sharif University of Technology, Kiana Akbari Sharif University of technology, Abbas Heydarnoori Sharif University of Technology
Link to publication DOI Pre-print Media Attached
09:05
5m
Talk
Using Deep Learning to Generate Complete Log Statements
Technical Track
Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana
Pre-print Media Attached
09:10
5m
Talk
Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning
NIER - New Ideas and Emerging Results
Martin Weyssow DIRO, Université de Montréal, Houari Sahraoui Université de Montréal, Bang Liu DIRO & Mila, Université de Montréal
Pre-print Media Attached
09:15
5m
Talk
"Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint
SEIP - Software Engineering in Practice
Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands
Pre-print Media Attached
09:20
5m
Talk
Discovering Repetitive Code Changes in Python ML Systems
Technical Track
Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA
DOI Pre-print Media Attached
09:25
5m
Talk
FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment
Technical Track
Maxime Cordy University of Luxembourg, Luxembourg, Renaud Rwemalika University of Luxembourg, Adriano Franci University of Luxembourg, Mike Papadakis University of Luxembourg, Luxembourg, Mark Harman University College London
Pre-print Media Attached
09:30
5m
Talk
Semantic Image Fuzzing of AI Perception Systems
Technical Track
Trey Woodlief University of Virginia, Sebastian Elbaum University of Virginia, Kevin Sullivan University of Virginia
DOI Pre-print Media Attached
09:35
5m
Talk
Understanding and improving artifact sharing in software engineering research
Journal-First Papers
Christopher Steven Timperley Carnegie Mellon University, Lauren Herckis Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA
Link to publication DOI Pre-print Media Attached
09:40
5m
Talk
ARCLIN: Automated API Mention Resolution for Unformatted Texts
Technical Track
Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Hongming Zhang The Hong Kong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong
DOI Pre-print Media Attached
13:30 - 15:00
13:30
90m
Talk
Investigating User Perceptions of Conversational Agents for Software-related Exploratory Web Search
NIER - New Ideas and Emerging Results
Matthew Frazier University of Delaware, Shaayal Kumar University of Delaware, Kostadin Damevski Virginia Commonwealth University, Lori Pollock University of Delaware
DOI Pre-print Media Attached
13:30
90m
Talk
Bots for Pull Requests: The Good, the Bad, and the Promising
Technical Track
Mairieli Wessel Delft University of Technology, Ahmad Abdellatif Concordia University, Igor Wiese Federal University of Technology - Paraná (UTFPR), Tayana Conte Universidade Federal do Amazonas, Emad Shihab Concordia University, Marco Gerosa Northern Arizona University, USA, Igor Steinmacher Northern Arizona University
Pre-print
13:30
90m
Talk
Post2Vec: Learning Distributed Representations of Stack Overflow Posts
Journal-First Papers
Bowen Xu Singapore Management University, Thong Hoang Singapore Management University, Singapore, Abhishek Sharma Veracode, Inc., Chengran Yang Singapore Management University, Xin Xia Huawei Software Engineering Application Technology Lab, David Lo Singapore Management University
Link to publication DOI Pre-print
13:30
90m
Talk
Detecting Interpersonal Conflict in Issues and Code Review: Cross Pollinating Open- and Closed-Source Approaches
SEIS - Software Engineering in Society
Huilian Sophie Qiu Carnegie Mellon University, USA, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University, Carolyn Egelman Google, Ciera Jaspan , Emerson Murphy-Hill Google
Pre-print Media Attached
13:30
90m
Poster
Poster: Comprehensive Comparisons of Embedding Approaches for Cryptographic API Completion
Posters
Ya Xiao Virginia Tech, Salman Ahmed Virginia Polytechnic Institute and State University, Xinyang Ge Microsoft Research, Bimal Viswanath Virginia Tech, Na Meng Virginia Tech, Daphne Yao Virginia Tech
13:30
90m
Talk
Semantic Image Fuzzing of AI Perception Systems
Technical Track
Trey Woodlief University of Virginia, Sebastian Elbaum University of Virginia, Kevin Sullivan University of Virginia
DOI Pre-print Media Attached
13:30
90m
To Disengage or Not to Disengage: A Look at Contributor Disengagement in Open Source Software
SRC - ACM Student Research Competition
Philip Gray New College of Florida
13:30
90m
Talk
Hashing It Out: A Survey of Programmers’ Cannabis Usage, Perception, and Motivation
Technical Track
Madeline Endres University of Michigan, Kevin Boehnke University of Michigan, Westley Weimer University of Michigan
DOI Pre-print Media Attached
13:30
90m
Talk
Bus Factor In Practice
SEIP - Software Engineering in Practice
Elgun Jabrayilzade Bilkent University, Mikhail Evtikhiev JetBrains Research, Eray Tüzün Bilkent University, Vladimir Kovalenko JetBrains Research
Pre-print Media Attached
13:30
90m
Talk
Garbage Collection Makes Rust Easier to Use: A Randomized Controlled Trial of the Bronze Garbage CollectorNominated for Distinguished Paper
Technical Track
Michael Coblenz University of Maryland at College Park, Michelle Mazurek University of Maryland, Michael Hicks University of Maryland at College Park
DOI Pre-print Media Attached
13:30
90m
Talk
Learning and Programming Challenges of Rust: A Mixed-Methods Study
Technical Track
Shuofei Zhu The Pennsylvania State University, Ziyi Zhang University of Wisconsin–Madison, Boqin Qin China Telecom Cloud Computing Corporation, Aiping Xiong The Pennsylvania State University, Linhai Song Pennsylvania State University, USA
DOI Pre-print Media Attached
13:30
90m
Talk
Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning
NIER - New Ideas and Emerging Results
Martin Weyssow DIRO, Université de Montréal, Houari Sahraoui Université de Montréal, Bang Liu DIRO & Mila, Université de Montréal
Pre-print Media Attached
13:30
90m
Talk
Defect Reduction Planning (using TimeLIME)
Journal-First Papers
Kewen Peng North Carolina State University, Tim Menzies North Carolina State University
Authorizer link Pre-print Media Attached
13:30
90m
Demonstration
Gamekins: Gamifying Software Testing in Jenkins
DEMO - Demonstrations
Philipp Straubinger University of Passau, Gordon Fraser University of Passau
DOI Pre-print Media Attached
13:30
90m
Talk
How Do I Refactor This? An Empirical Study on Refactoring Trends and Topics in Stack Overflow
Journal-First Papers
Anthony Peruma Rochester Institute of Technology, Steven Simmons Rochester Institute of Technology, Eman Abdullah AlOmar Stevens Institute of Technology, Christian D. Newman Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec
Link to publication DOI Pre-print Media Attached
13:30
90m
Talk
Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection
Journal-First Papers
Nadia Daoudi SnT, University of Luxembourg, Kevin Allix University of Luxembourg, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of Luxembourg
Link to publication Pre-print Media Attached
13:30
90m
Mu2: Using Mutation Analysis to Guide Mutation-Based Fuzzing
SRC - ACM Student Research Competition
Isabella Laybourn Carnegie Mellon Silicon Valley
13:30
90m
Talk
Emotions and Perceived Productivity of Software Developers at the Workplace
Journal-First Papers
Daniela Girardi University of Bari, Filippo Lanubile University of Bari, Nicole Novielli University of Bari, Alexander Serebrenik Eindhoven University of Technology
Link to publication DOI Pre-print Media Attached
13:30
90m
Poster
CRustS: A Transpiler from Unsafe C to Safer Rust
Posters
Michael Ling Huawei Technologies Canada, Yijun Yu The Open University, UK, Haitao Wu Huawei Technologies Canada, Yuan Wang Huawei Sweden Research Center, James R. Cordy Queen's University, Ahmed E. Hassan Queen's University
13:30
90m
Talk
Multilingual training for Software Engineering
Technical Track
Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis
DOI Pre-print Media Attached
13:30
90m
Talk
An Empirical Investigation on the Challenges Faced by Women in the Software Industry: A Case StudySEIS-track Award
SEIS - Software Engineering in Society
Bianca Trinkenreich Northern of Arizona Univeristy, Ricardo Britto Ericsson / Blekinge Institute of Technology, Marco Gerosa Northern Arizona University, USA, Igor Steinmacher Northern Arizona University
Pre-print Media Attached
13:30
90m
Talk
Using Deep Learning to Generate Complete Log Statements
Technical Track
Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana
Pre-print Media Attached
13:30
90m
Talk
Collaboration Challenges in Building ML-Enabled Systems: Communication, Documentation, Engineering, and ProcessDistinguished Paper Award
Technical Track
Nadia Nahar Carnegie Mellon University, Shurui Zhou University of Toronto, Grace Lewis Carnegie Mellon Software Engineering Institute, Christian Kästner Carnegie Mellon University
Pre-print Media Attached
13:30
90m
Talk
Discovering Repetitive Code Changes in Python ML Systems
Technical Track
Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA
DOI Pre-print Media Attached
13:30
90m
Talk
Towards Mining OSS Skills from GitHub Activity
NIER - New Ideas and Emerging Results
Jenny T. Liang University of Washington, Thomas Zimmermann Microsoft Research, Denae Ford Microsoft Research
DOI Pre-print Media Attached
13:30
90m
Talk
EREBA: Black-box Energy Testing of Adaptive Neural Networks
Technical Track
Mirazul Haque UT Dallas, Yaswanth Yadlapalli University of Texas at Dallas, Wei Yang University of Texas at Dallas, Cong Liu University of Texas at Dallas, USA
Pre-print Media Attached
13:30
90m
Talk
"Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint
SEIP - Software Engineering in Practice
Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands
Pre-print Media Attached
13:30
90m
Poster
Improving Responsiveness of Android Activity Navigation via Genetic Improvement
Posters
James Callan UCL, Justyna Petke University College London

Information for Participants
Tue 10 May 2022 11:00 - 12:00 at ICSE room 5-odd hours - Tools and Environments 4 Chair(s): Guido Salvaneschi
Info for room ICSE room 5-odd hours:

Click here to go to the room on Midspace

Tue 10 May 2022 20:00 - 21:00 at ICSE room 3-even hours - Tools and Environments 2 Chair(s): Anthony I. (Tony) Wasserman
Info for room ICSE room 3-even hours:

Click here to go to the room on Midspace