Tue 10 May 2022 20:15 - 20:20 at ICSE room 3-even hours - Tools and Environments 2 Chair(s): Anthony I. (Tony) Wasserman
Fri 27 May 2022 09:20 - 09:25 at Room 306+307 - Papers 18: Recommender Systems, tools and environments Chair(s): Christian Bird
Fri 27 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 3
Over the years, researchers capitalized on the repetitiveness of software changes to automate many software evolution tasks. Despite the extraordinary rise in popularity of Python-based ML systems, they do not benefit from these advances. Without knowing what are the repetitive changes that ML developers make, researchers, tool, and library designers miss opportunities for automation, and ML developers fail to learn and use common practices.
To fill the knowledge gap and advance the science and tooling in ML software evolution, we conducted the first and most fine-grained study on code change patterns in a diverse corpus of 1000 top-rated ML systems comprising 58 million SLOC. To conduct this study we reuse, adapt, and improve upon the state-of-the-art repetitive change mining techniques. Our novel tool, R-CPatMiner, mines over 4M commits and constructs 350K fine-grained change graphs and detects 28K change patterns. Using thematic analysis, we identified 22 pattern groups and we reveal 4 major trends of how ML developers change their code. We sent a survey to 650 ML developers to further shed light on these patterns and their applications. We present actionable, empirically-justified implications for four audiences: (i) researchers, (ii) tool builders and IDE designers, (iii) ML library vendors, and (iv) developers and educators.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
11:00 - 12:00 | Tools and Environments 4NIER - New Ideas and Emerging Results / Technical Track / SEIP - Software Engineering in Practice at ICSE room 5-odd hours Chair(s): Guido Salvaneschi University of St. Gallen | ||
11:00 5mTalk | Towards Property-Based Tests in Natural Language NIER - New Ideas and Emerging Results Colin Gordon Drexel University Pre-print Media Attached | ||
11:05 5mTalk | Using a Semantic Knowledge Base to Improve the Managementof Security Reports in Industrial DevOps Projects SEIP - Software Engineering in Practice Pre-print Media Attached | ||
11:10 5mTalk | What's bothering developers in code review? SEIP - Software Engineering in Practice Emma Söderberg Lund University, Luke Church University of Cambridge | Lund University | Lark Systems, Jürgen Börstler Blekinge Institute of Technology, Diederick Niehorster Lund University, Christofer Rydenfält Lund University Pre-print Media Attached | ||
11:15 5mTalk | "Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint SEIP - Software Engineering in Practice Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands Pre-print Media Attached | ||
11:20 5mTalk | Discovering Repetitive Code Changes in Python ML Systems Technical Track Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA DOI Pre-print Media Attached | ||
11:25 5mTalk | OJXPerf: Featherlight Object Replica Detection for Java Programs Technical Track Bolun Li North Carolina State University, Hao Xu College of William and Mary, Qidong Zhao North Carolina State University, Pengfei Su University of California, Merced, Milind Chabbi Scalable Machines Research, Shuyin Jiao North Carolina State University, Xu Liu North Carolina State University, Oak Ridge National Laboratory, USA DOI Pre-print Media Attached |
20:00 - 21:00 | Tools and Environments 2Journal-First Papers / Technical Track / SEIP - Software Engineering in Practice / NIER - New Ideas and Emerging Results at ICSE room 3-even hours Chair(s): Anthony I. (Tony) Wasserman Carnegie Mellon Silicon Valley | ||
20:00 5mTalk | Are You Still Working on This? An Empirical Study on Pull Request Abandonment Journal-First Papers Zhixing Li College of Computer, National University of Defense Technology, Changsha, China, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Tao Wang National University of Defense Technology, Gang Yin National University of Defense Technology, Shanshan Li National University of Defense Technology, Huaimin Wang National University of Defense Technology Link to publication DOI Pre-print File Attached | ||
20:05 5mTalk | Towards Property-Based Tests in Natural Language NIER - New Ideas and Emerging Results Colin Gordon Drexel University Pre-print Media Attached | ||
20:10 5mTalk | A Unified Code Review Automation for Large-scale Industry with Diverse Development Environments SEIP - Software Engineering in Practice Hyungjin Kim Samsung Research, Samsung Electronics, Yonghwi Kwon Samsung Research, Samsung Electronics, Hyukin Kwon Samsung Research, Samsung Electronics, Yeonhee Ryou Samsung Research, Samsung Electronics, Sangwoo Joh Samsung Research, Samsung Electronics, Taeksu Kim Samsung Research, Samsung Electronics, Chul-Joo Kim Samsung Research, Samsung Electronics DOI Pre-print Media Attached | ||
20:15 5mTalk | Discovering Repetitive Code Changes in Python ML Systems Technical Track Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA DOI Pre-print Media Attached | ||
20:20 5mTalk | OJXPerf: Featherlight Object Replica Detection for Java Programs Technical Track Bolun Li North Carolina State University, Hao Xu College of William and Mary, Qidong Zhao North Carolina State University, Pengfei Su University of California, Merced, Milind Chabbi Scalable Machines Research, Shuyin Jiao North Carolina State University, Xu Liu North Carolina State University, Oak Ridge National Laboratory, USA DOI Pre-print Media Attached |
Fri 27 MayDisplayed time zone: Eastern Time (US & Canada) change
09:00 - 10:30 | Papers 18: Recommender Systems, tools and environmentsTechnical Track / Journal-First Papers / NIER - New Ideas and Emerging Results / SEIP - Software Engineering in Practice at Room 306+307 Chair(s): Christian Bird Microsoft Research | ||
09:00 5mTalk | Predicting the Objective and Priority of Issue Reports in Software Repositories Journal-First Papers Maliheh Izadi Sharif University of Technology, Kiana Akbari Sharif University of technology, Abbas Heydarnoori Sharif University of Technology Link to publication DOI Pre-print Media Attached | ||
09:05 5mTalk | Using Deep Learning to Generate Complete Log Statements Technical Track Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print Media Attached | ||
09:10 5mTalk | Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning NIER - New Ideas and Emerging Results Martin Weyssow DIRO, Université de Montréal, Houari Sahraoui Université de Montréal, Bang Liu DIRO & Mila, Université de Montréal Pre-print Media Attached | ||
09:15 5mTalk | "Project smells" — Experiences in Analysing the Software Quality of ML Projects with mllint SEIP - Software Engineering in Practice Bart van Oort Delft University of Technology, Luís Cruz Deflt University of Technology, Babak Loni ING Bank N.V., Arie van Deursen Delft University of Technology, Netherlands Pre-print Media Attached | ||
09:20 5mTalk | Discovering Repetitive Code Changes in Python ML Systems Technical Track Malinda Dilhara University of Colorado Boulder, USA, Ameya Ketkar Oregon State University, USA, Nikhith Sannidhi University of Colorado Boulder, Danny Dig University of Colorado Boulder, USA DOI Pre-print Media Attached | ||
09:25 5mTalk | FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment Technical Track Maxime Cordy University of Luxembourg, Luxembourg, Renaud Rwemalika University of Luxembourg, Adriano Franci University of Luxembourg, Mike Papadakis University of Luxembourg, Luxembourg, Mark Harman University College London Pre-print Media Attached | ||
09:30 5mTalk | Semantic Image Fuzzing of AI Perception Systems Technical Track Trey Woodlief University of Virginia, Sebastian Elbaum University of Virginia, Kevin Sullivan University of Virginia DOI Pre-print Media Attached | ||
09:35 5mTalk | Understanding and improving artifact sharing in software engineering research Journal-First Papers Christopher Steven Timperley Carnegie Mellon University, Lauren Herckis Carnegie Mellon University, Claire Le Goues Carnegie Mellon University, Michael Hilton Carnegie Mellon University, USA Link to publication DOI Pre-print Media Attached | ||
09:40 5mTalk | ARCLIN: Automated API Mention Resolution for Unformatted Texts Technical Track Yintong Huo The Chinese University of Hong Kong, Yuxin Su Sun Yat-sen University, Hongming Zhang The Hong Kong University of Science and Technology, Michael Lyu The Chinese University of Hong Kong DOI Pre-print Media Attached |