Write a Blog >>
ICSE 2022
Sun 8 - Fri 27 May 2022
Wed 11 May 2022 05:10 - 05:15 at ICSE room 1-odd hours - Machine Learning with and for SE 2 Chair(s): Gemma Catolino
Thu 12 May 2022 13:20 - 13:25 at ICSE room 4-odd hours - Machine Learning with and for SE 12 Chair(s): Wei Yang
Wed 25 May 2022 11:00 - 11:05 at Room 301+302 - Papers 6: Machine Learning with and for SE 1 Chair(s): Baishakhi Ray

Machine translation plays an essential role in people’s daily international communication. However, machine translation systems are far from perfect. To tackle this problem, researchers have proposed several approaches to testing machine translation. A promising trend among these approaches is to use word replacement, where only one word in the original sentence is replaced with another word to form a sentence pair. However, precise control of the impact of word replacement remains an outstanding issue in these approaches.

To address this issue, we propose CAT, a novel word-replacement-based approach, whose basic idea is to identify word replacement with controlled impact (referred to as isotopic replacement). To achieve this purpose, we use a neural-based language model to encode the sentence context, and design a neural-network-based algorithm to evaluate context-aware semantic similarity between two words. Furthermore, similar to TransRepair, a state-of-the-art word-replacement-based approach, CAT also provides automatic fixing of revealed bugs without model retraining.

Our evaluation on Google Translate and Transformer indicates that CAT achieves significant improvements over TransRepair. In particular, 1) CAT detects seven more types of bugs than TransRepair; 2) CAT detects 129% more translation bugs than TransRepair; 3) CAT repairs twice more bugs than TransRepair, many of which may bring serious consequences if left unfixed; and 4) CAT has better efficiency than TransRepair in input generation (0.01s v.s. 0.41s) and comparable efficiency with TransRepair in bug repair (1.92s v.s. 1.34s).

Wed 11 May

Displayed time zone: Eastern Time (US & Canada) change

05:00 - 06:00
Machine Learning with and for SE 2Technical Track / Journal-First Papers / SEIP - Software Engineering in Practice at ICSE room 1-odd hours
Chair(s): Gemma Catolino Tilburg University & ​Jheronimus Academy of Data Science
05:00
5m
Talk
Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection
Journal-First Papers
Nadia Daoudi SnT, University of Luxembourg, Kevin Allix University of Luxembourg, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of Luxembourg
Link to publication Pre-print Media Attached
05:05
5m
Talk
Mining Root Cause Knowledge from Cloud Service Incident Investigations for AIOps
SEIP - Software Engineering in Practice
Amrita Saha Salesforce Research Asia, Steven C.H. Hoi Salesforce Research Asia
Pre-print Media Attached
05:10
5m
Talk
Improving Machine Translation Systems via Isotopic Replacement
Technical Track
Zeyu Sun Peking University, Jie M. Zhang King's College London, Yingfei Xiong Peking University, Mark Harman University College London, Mike Papadakis University of Luxembourg, Luxembourg, Lu Zhang Peking University
Pre-print Media Attached
05:15
5m
Talk
Detecting False Alarms from Automatic Static Analysis Tools: How Far are We?Nominated for Distinguished Paper
Technical Track
Hong Jin Kang Singapore Management University, Khai Loong Aw Singapore Management University, David Lo Singapore Management University
DOI Pre-print Media Attached File Attached
05:20
5m
Talk
DeepAnalyze: Learning to Localize Crashes at Scale
Technical Track
Manish Shetty Microsoft Research, India, Chetan Bansal Microsoft Research, Suman Nath Microsoft Corporation, Sean Bowles Microsoft, Henry Wang Microsoft, Ozgur Arman Microsoft, Siamak Ahari Microsoft
Pre-print Media Attached

Thu 12 May

Displayed time zone: Eastern Time (US & Canada) change

13:00 - 14:00
Machine Learning with and for SE 12Journal-First Papers / Technical Track / NIER - New Ideas and Emerging Results at ICSE room 4-odd hours
Chair(s): Wei Yang University of Texas at Dallas
13:00
5m
Talk
Modeling Functional Similarity in Source Code with Graph-Based Siamese Networks
Journal-First Papers
NIKITA MEHROTRA Indraprastha Institute of Information Technology, NAVDHA AGARWAL Indraprastha Institute of Information Technology, Delhi, PIYUSH GUPTA Indraprastha Institute of Information Technology, Delhi, SAKET ANAND Indraprastha Institute of Information Technology, Delhi, David Lo Singapore Management University, Rahul Purandare IIIT-Delhi
Link to publication DOI Media Attached
13:05
5m
Talk
Revisiting Process versus Product Metrics: a Large Scale Analysi
Journal-First Papers
Suvodeep Majumder North Carolina State University, Pranav Mody North Carolina State University, Tim Menzies North Carolina State University
Link to publication DOI Pre-print Media Attached File Attached
13:10
5m
Talk
Learning to Recognize Actionable Static Code Warnings (is Intrinsically Easy)
Journal-First Papers
Xueqi Yang NCSU, Jianfeng Chen North Carolina State University, Rahul Yedida North Carolina State University, Zhe Yu , Tim Menzies North Carolina State University
Link to publication DOI Pre-print Media Attached
13:15
5m
Talk
Improving the Learnability of Machine Learning APIs by Semi-Automated API Wrapping
NIER - New Ideas and Emerging Results
Lars Reimann University of Bonn, Günter Kniesel-Wünsche University of Bonn
DOI Pre-print Media Attached
13:20
5m
Talk
Improving Machine Translation Systems via Isotopic Replacement
Technical Track
Zeyu Sun Peking University, Jie M. Zhang King's College London, Yingfei Xiong Peking University, Mark Harman University College London, Mike Papadakis University of Luxembourg, Luxembourg, Lu Zhang Peking University
Pre-print Media Attached
13:25
5m
Talk
Collaboration Challenges in Building ML-Enabled Systems: Communication, Documentation, Engineering, and ProcessDistinguished Paper Award
Technical Track
Nadia Nahar Carnegie Mellon University, Shurui Zhou University of Toronto, Grace Lewis Carnegie Mellon Software Engineering Institute, Christian Kästner Carnegie Mellon University
Pre-print Media Attached

Wed 25 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Papers 6: Machine Learning with and for SE 1Technical Track / Journal-First Papers / SEIP - Software Engineering in Practice at Room 301+302
Chair(s): Baishakhi Ray Columbia University
11:00
5m
Talk
Improving Machine Translation Systems via Isotopic Replacement
Technical Track
Zeyu Sun Peking University, Jie M. Zhang King's College London, Yingfei Xiong Peking University, Mark Harman University College London, Mike Papadakis University of Luxembourg, Luxembourg, Lu Zhang Peking University
Pre-print Media Attached
11:05
5m
Talk
Detecting False Alarms from Automatic Static Analysis Tools: How Far are We?Nominated for Distinguished Paper
Technical Track
Hong Jin Kang Singapore Management University, Khai Loong Aw Singapore Management University, David Lo Singapore Management University
DOI Pre-print Media Attached File Attached
11:10
5m
Talk
Active Learning of Discriminative Subgraph Patterns for API Misuse Detection
Journal-First Papers
Hong Jin Kang Singapore Management University, David Lo Singapore Management University
Pre-print Media Attached File Attached
11:15
5m
Talk
In-IDE Code Generation from Natural Language: Promise and Challenges
Journal-First Papers
Frank Xu Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Graham Neubig Carnegie Mellon University
11:20
5m
Talk
Strategies for Reuse and Sharing among Data Scientists in Software Teams
SEIP - Software Engineering in Practice
Will Epperson Carnegie Mellon University, April Wang University of Michigan, Robert DeLine Microsoft Research, Steven M. Drucker Microsoft Research
Pre-print Media Attached
11:25
5m
Talk
Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules
Technical Track
Rangeet Pan Iowa State University, USA, Hridesh Rajan Iowa State University
Pre-print Media Attached
11:30
5m
Talk
Fairness-aware Configuration of Machine Learning Libraries
Technical Track
Saeid Tizpaz-Niari University of Texas at El Paso, Ashish Kumar , Gang Tan Pennsylvania State University, Ashutosh Trivedi University of Colorado Boulder
DOI Pre-print Media Attached
11:35
5m
Talk
Automated Handling of Anaphoric Ambiguity in Requirements: A Multi-solution Study
Technical Track
Saad Ezzini University of Luxembourg, Sallam Abualhaija University of Luxembourg, Chetan Arora Deakin University, Mehrdad Sabetzadeh University of Ottawa
Pre-print Media Attached

Information for Participants
Wed 11 May 2022 05:00 - 06:00 at ICSE room 1-odd hours - Machine Learning with and for SE 2 Chair(s): Gemma Catolino
Info for room ICSE room 1-odd hours:

Click here to go to the room on Midspace

Thu 12 May 2022 13:00 - 14:00 at ICSE room 4-odd hours - Machine Learning with and for SE 12 Chair(s): Wei Yang
Info for room ICSE room 4-odd hours:

Click here to go to the room on Midspace