Dependency Tracking for Risk Mitigation in Machine Learning (ML) Systems
Thu 12 May 2022 04:10 - 04:15 at ICSE room 1-even hours - Machine Learning with and for SE 3 Chair(s): Antinisca Di Marco
In a Machine Learning (ML) system, characteristics of the ML components create new challenges for software system design and development activities. Data-dependent behavior causes risks in ML systems. Dealing with such risks in the development phase requires non-trivial costs due to un-controllable data generation processes in the test phase. In addition, ML systems often need continuous monitoring and validation in run-time. In this paper, we demonstrate the risk of ML systems because of the unknown data generation processes and model uncertainty and propose an integrated dependency tracking system that balances the cost and risks in the development stage and operation stage. Our solution uses blockchain to track the co-evolution of the models and the corresponding datasets. As blockchain provides a transparent and immutable data store, the provenance of data and models stored on the blockchain provides a trustworthy trace for dependencies between datasets and models at the development phase, and predictions at the operation phase. A graph database is further used to store and visualize these dependencies for risk mitigation. We evaluate the technical feasibility of our solution using a real-world scenario, including machine-learning models for distinguishing beef produced from different farms in Australia.
Wed 11 MayDisplayed time zone: Eastern Time (US & Canada) change
| 20:00 - 21:00 | Machine Learning with and for SE 7SEIP - Software Engineering in Practice / Technical Track / Journal-First Papers at ICSE room 1-even hours  Chair(s): Lei Ma University of Alberta | ||
| 20:005m Talk | Journal First: On the Value of Oversampling for Deep Learning in Software Defect Prediction Journal-First PapersMedia Attached | ||
| 20:055m Talk | In-IDE Code Generation from Natural Language: Promise and Challenges Journal-First Papers Frank Xu Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Graham Neubig Carnegie Mellon University | ||
| 20:105m Talk | Dependency Tracking for Risk Mitigation in Machine Learning (ML) Systems SEIP - Software Engineering in Practice Xiwei (Sherry) Xu CSIRO Data61, Chen Wang CSIRO DATA61, Zhen Wang CSIRO Data61, Qinghua Lu CSIRO’s Data61, Liming Zhu CSIRO’s Data61; UNSWMedia Attached | ||
| 20:155m Talk | Strategies for Reuse and Sharing among Data Scientists in Software Teams SEIP - Software Engineering in Practice Will Epperson Carnegie Mellon University, April Wang University of Michigan, Robert DeLine Microsoft Research, Steven M. Drucker Microsoft ResearchPre-print Media Attached | ||
| 20:205m Talk | A Universal Data Augmentation Approach for Fault Localization Technical Track Huan Xie Chongqing University, Yan Lei School of Big Data & Software Engineering, Chongqing University, Meng Yan Chongqing University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Xin Xia Huawei Software Engineering Application Technology Lab, Xiaoguang Mao National University of Defense TechnologyDOI Pre-print Media Attached | ||
| 20:255m Talk | Explanation-Guided Fairness Testing through Genetic Algorithm Technical Track Ming Fan Xi'an Jiaotong University, Wenying Wei Xi'an Jiaotong University, Wuxia Jin Xi'an Jiaotong University, Zijiang Yang Western Michigan University, Ting Liu Xi'an Jiaotong UniversityDOI Pre-print | ||
Thu 12 MayDisplayed time zone: Eastern Time (US & Canada) change
| 04:00 - 05:00 | Machine Learning with and for SE 3Technical Track / Journal-First Papers / SEIP - Software Engineering in Practice at ICSE room 1-even hours  Chair(s): Antinisca Di Marco University of L'Aquila | ||
| 04:005m Talk | In-IDE Code Generation from Natural Language: Promise and Challenges Journal-First Papers Frank Xu Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Graham Neubig Carnegie Mellon University | ||
| 04:055m Talk | Active Learning of Discriminative Subgraph Patterns for API Misuse Detection Journal-First PapersPre-print Media Attached File Attached | ||
| 04:105m Talk | Dependency Tracking for Risk Mitigation in Machine Learning (ML) Systems SEIP - Software Engineering in Practice Xiwei (Sherry) Xu CSIRO Data61, Chen Wang CSIRO DATA61, Zhen Wang CSIRO Data61, Qinghua Lu CSIRO’s Data61, Liming Zhu CSIRO’s Data61; UNSWMedia Attached | ||
| 04:155m Talk | DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs Technical Track Jialun Cao Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Meiziniu LI Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Xiao Chen Huazhong University of Science and Technology, Ming Wen Huazhong University of Science and Technology, Yongqiang Tian The Hong Kong University of Science and Technology; University of Waterloo, Bo Wu MIT-IBM Watson AI Lab in Cambridge, Shing-Chi Cheung Hong Kong University of Science and TechnologyDOI Pre-print Media Attached | ||
| 04:205m Talk | What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code Technical Track Yao Wan Huazhong University of Science and Technology, Wei Zhao Huazhong University of Science and Technology, Hongyu Zhang University of Newcastle, Yulei Sui University of Technology Sydney, Guandong Xu University of Technology, Sydney, Hai Jin Huazhong University of Science and TechnologyPre-print Media Attached | ||
| 04:255m Talk | A Universal Data Augmentation Approach for Fault Localization Technical Track Huan Xie Chongqing University, Yan Lei School of Big Data & Software Engineering, Chongqing University, Meng Yan Chongqing University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Xin Xia Huawei Software Engineering Application Technology Lab, Xiaoguang Mao National University of Defense TechnologyDOI Pre-print Media Attached | ||
| 04:305m Talk | DeepState: Selecting Test Suites to Enhance the Robustness of Recurrent Neural Networks Technical Track Zixi Liu Nanjing University, Yang Feng Nanjing University, Yining Yin Nanjing University, China, Zhenyu Chen Nanjing UniversityDOI Pre-print Media Attached | ||


