Wed 11 May 2022 12:20 - 12:25 at ICSE room 1-even hours - Machine Learning with and for SE 11 Chair(s): Ipek Ozkaya
Fri 27 May 2022 11:15 - 11:20 at Room 301+302 - Papers 19: Machine Learning with and for SE 2 Chair(s): Dalal Alrajeh
Fri 27 May 2022 13:30 - 15:00 at Ballroom Gallery - Posters 3
Well-trained machine-learning models, which leverage large amounts of open-source software data, have now become an interesting approach to automating many software engineering tasks. Several SE tasks have all been subject to this approach, with performance gradually improving over the past several years with better models and training methods. More, and more diverse, clean, labeled data is better for training; but constructing good-quality datasets is time-consuming and challenging. Ways of augmenting the volume and diversity of clean, labeled data generally have wide applicability. For some languages (e.g., Ruby) labeled data is less abundant; in others (e.g., JavaScript) the available data maybe more focused on some application domains, and thus less diverse. As a way around such data bottlenecks, we present evidence suggesting that human-written code in different languages (which performs the same function), is rather similar, and particularly preserving of identifier naming patterns; we further present evidence suggesting that identifiers are a very important element of training data for software engineering tasks. We leverage this rather fortuitous phenomenon to find evidence that available multilingual training data (across different languages) can be used to amplify performance for each language. We do this for 3 different tasks: code summarization, code retrieval, and function naming. We note that this data-augmenting approach is broadly compatible with different tasks, languages, and machine-learning models.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
| 21:00 - 22:00 | Machine Learning with and for SE 6Technical Track at ICSE room 3-odd hours  Chair(s): Ali Ouni ETS Montreal, University of Quebec | ||
| 21:005m Talk | DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs Technical Track Jialun Cao Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Meiziniu LI Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Xiao Chen Huazhong University of Science and Technology, Ming Wen Huazhong University of Science and Technology, Yongqiang Tian The Hong Kong University of Science and Technology; University of Waterloo, Bo Wu MIT-IBM Watson AI Lab in Cambridge, Shing-Chi Cheung Hong Kong University of Science and TechnologyDOI Pre-print Media Attached | ||
| 21:055m Talk | Fast Changeset-based Bug Localization with BERT Technical Track Agnieszka Ciborowska  Virginia Commonwealth University, Kostadin Damevski Virginia Commonwealth UniversityPre-print Media Attached | ||
| 21:105m Talk | Multilingual training for Software Engineering Technical Track Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, DavisDOI Pre-print Media Attached | ||
| 21:155m Talk | NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification Technical Track haibin zheng Zhejiang University of Technology, Zhiqing Chen Zhejiang University of Technology, Tianyu Du Zhejiang University, Xuhong Zhang Zhejiang University, Yao Cheng Huawei International, Shouling Ji Zhejiang University, Jingyi Wang Zhejiang University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Jinyin Chen College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, ChinaDOI Pre-print Media Attached | ||
| 21:205m Talk | Type4Py: Practical Deep Similarity Learning-Based Type Inference for Python Technical Track Amir Mir Delft University of Technology, Evaldas Latoskinas Delft University of Technology, Sebastian Proksch Delft University of Technology, Netherlands, Georgios Gousios Endor Labs & Delft University of TechnologyDOI Pre-print Media Attached | ||
| 21:255m Talk | Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules Technical TrackPre-print Media Attached | ||
Wed 11 MayDisplayed time zone: Eastern Time (US & Canada) change
| 12:00 - 13:00 | Machine Learning with and for SE 11Journal-First Papers / Technical Track at ICSE room 1-even hours  Chair(s): Ipek Ozkaya Carnegie Mellon Software Engineering Institute | ||
| 12:005m Talk | Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection Journal-First Papers Nadia Daoudi SnT, University of Luxembourg, Kevin Allix University of Luxembourg, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of LuxembourgLink to publication Pre-print Media Attached | ||
| 12:055m Talk | DeepAnalyze: Learning to Localize Crashes at Scale Technical Track Manish Shetty Microsoft Research, India, Chetan Bansal Microsoft Research, Suman Nath Microsoft Corporation, Sean Bowles Microsoft, Henry Wang Microsoft, Ozgur Arman Microsoft, Siamak Ahari MicrosoftPre-print Media Attached | ||
| 12:105m Talk | EREBA: Black-box Energy Testing of Adaptive Neural Networks Technical Track Mirazul Haque UT Dallas, Yaswanth Yadlapalli University of Texas at Dallas, Wei Yang University of Texas at Dallas, Cong Liu University of Texas at Dallas, USAPre-print Media Attached | ||
| 12:155m Talk | Fast Changeset-based Bug Localization with BERT Technical Track Agnieszka Ciborowska  Virginia Commonwealth University, Kostadin Damevski Virginia Commonwealth UniversityPre-print Media Attached | ||
| 12:205m Talk | Multilingual training for Software Engineering Technical Track Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, DavisDOI Pre-print Media Attached | ||
| 12:255m Talk | Using Pre-Trained Models to Boost Code Review Automation Technical Track Rosalia Tufano Università della Svizzera Italiana, Simone Masiero Software Institute @ Università della Svizzera Italiana, Antonio Mastropaolo Università della Svizzera italiana, Luca Pascarella Università della Svizzera italiana (USI), Denys Poshyvanyk William and Mary, Gabriele Bavota Software Institute, USI Università della Svizzera italianaPre-print Media Attached | ||

