Effective Model Replacement for Solving Objective Mismatches in Pre-trained Model Compositions
Pre-trained models (PTMs) have revolutionized machine learning by significantly enhancing reusability and reducing resources required for model training. Despite their advantages, selecting appropriate PTMs for specific system requirements remains difficult due to application heterogeneity and variable task performance. This has led to the proposal of PTM compositions to enhance capabilities beyond individual models, with an on-the-fly approach being applied further to meet dynamic requirements. However, composing PTMs on-the-fly can result in objective mismatch, leading to inefficiencies and errors in constituent PTMs. This necessitates the timely and efficient replacement of underperforming PTMs. The process is a major challenge due to the vast number of candidates and the extensive time required for evaluation. Therefore, we propose the Sample-Infer-Predict framework for efficient on-the-fly PTM replacement which comprises three phases: sampling, inference, and prediction. First, our novel Density and Diversity sampling algorithm efficiently selects representative user inputs. Second, the inference phase evaluates candidate PTMs from model hubs for the prediction dataset. Third, the prediction phase utilizes the dataset to predict the optimal PTM replacements. We evaluate our methodology using a vehicle detection PTM composition and a dataset of 5849 vehicle images, focusing on the efficiency of the replacement process, the quality of the meta-predictions, and the effectiveness of our sampling technique. Our approach reduces replacement time (155s vs. 28293s), achieves high precision@k values, and lowers mean absolute error values compared to other sampling techniques.
Thu 5 DecDisplayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change
16:00 - 17:30 | Session (12)Technical Track at Room 1 (Zunhui Room) Chair(s): Tao Zhang Macau University of Science and Technology | ||
16:00 30mTalk | SDEFL: A Lightweight Fault Detection and Localization Method for Deep Neural Networks Technical Track Bo Yang Beijing Forestry University, Jiawei Hu Beijing Forestry University, Jialun Cao Hong Kong University of Science and Technology | ||
16:30 30mTalk | A Study of Using Multimodal LLMs for Non-Crash Functional Bug Detection in Android Apps Technical Track Bangyan Ju University of Cincinnati, Jin Yang University of Cincinnati, Tingting Yu University of Connecticut, Tamerlan Abdullayev University of Cincinnati, Yuanyuan Wu University of Cincinati, Dingbang Wang University of Connecticut, Yu Zhao | ||
17:00 30mTalk | Effective Model Replacement for Solving Objective Mismatches in Pre-trained Model Compositions Technical Track Arogya Kharel School of Computing, KAIST, KyeongDeok Baek School of Computing, KAIST, In-Young Ko Korea Advanced Institute of Science and Technology |