Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules
Tue 10 May 2022 21:25 - 21:30 at ICSE room 3-odd hours - Machine Learning with and for SE 6 Chair(s): Ali Ouni
Wed 25 May 2022 11:25 - 11:30 at Room 301+302 - Papers 6: Machine Learning with and for SE 1 Chair(s): Baishakhi Ray
Training from scratch is the most common way to build a Convolutional Neural Network (CNN) based model. What if we can build new CNN models by reusing parts from previously build CNN models? What if we can improve a CNN model by replacing (possibly faulty) parts with other parts? In both cases, instead of training, can we identify the part responsible for each output class (module) in the model(s) and reuse or replace only the desired output classes to build a model? Prior work has proposed decomposing dense-based networks into modules (one for each output class) to enable reusability and replaceability in various scenarios. However, this work is limited to the dense layers and based on the one-to-one relationship between the nodes in consecutive layers. Due to the shared architecture in the CNN model, prior work cannot be adapted directly. In this paper, we propose to decompose a CNN model used for image classification problems into modules for each output class. These modules can further be reused or replaced to build a new model. We have evaluated our approach with CIFAR-10, CIFAR-100, and ImageNet tiny datasets with three variations of ResNet models and found that enabling decomposition comes with a small cost (1.77% and 0.85% for top-1 and top-5 accuracy, respectively). Also, building a model by reusing or replacing modules can be done with a 2.3% and 0.5% average loss of accuracy. Furthermore, reusing and replacing these modules reduces CO2e emission by ∼37 times compared to training the model from scratch.
Tue 10 MayDisplayed time zone: Eastern Time (US & Canada) change
21:00 - 22:00 | Machine Learning with and for SE 6Technical Track at ICSE room 3-odd hours Chair(s): Ali Ouni ETS Montreal, University of Quebec | ||
21:00 5mTalk | DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs Technical Track Jialun Cao Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Meiziniu LI Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Xiao Chen Huazhong University of Science and Technology, Ming Wen Huazhong University of Science and Technology, Yongqiang Tian University of Waterloo, Bo Wu MIT-IBM Watson AI Lab in Cambridge, Shing-Chi Cheung Hong Kong University of Science and Technology DOI Pre-print Media Attached | ||
21:05 5mTalk | Fast Changeset-based Bug Localization with BERT Technical Track Agnieszka Ciborowska Virginia Commonwealth University, Kostadin Damevski Virginia Commonwealth University Pre-print Media Attached | ||
21:10 5mTalk | Multilingual training for Software Engineering Technical Track Toufique Ahmed University of California at Davis, Prem Devanbu Department of Computer Science, University of California, Davis DOI Pre-print Media Attached | ||
21:15 5mTalk | NeuronFair: Interpretable White-Box Fairness Testing through Biased Neuron Identification Technical Track haibin zheng Zhejiang University of Technology, Zhiqing Chen Zhejiang University of Technology, Tianyu Du Zhejiang University, Xuhong Zhang Zhejiang University, Yao Cheng Huawei International, Shouling Ji Zhejiang University, Jingyi Wang Zhejiang University, Yue Yu College of Computer, National University of Defense Technology, Changsha 410073, China, Jinyin Chen College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China DOI Pre-print Media Attached | ||
21:20 5mTalk | Type4Py: Practical Deep Similarity Learning-Based Type Inference for Python Technical Track Amir Mir Delft University of Technology, Evaldas Latoskinas Delft University of Technology, Sebastian Proksch Delft University of Technology, Netherlands, Georgios Gousios Endor Labs & Delft University of Technology DOI Pre-print Media Attached | ||
21:25 5mTalk | Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules Technical Track Pre-print Media Attached |