Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends
The ubiquity of large-scale Pre-Trained Models (PTMs) is on the rise, sparking interest in model hubs, and dedicated platforms for hosting PTMs. Despite this trend, a comprehensive exploration of the challenges that users encounter and how the community leverages PTMs remains lacking. To address this gap, we conducted an extensive mixed-methods empirical study by focusing on discussion forums and the model hub of HuggingFace, the largest public model hub. Based on our qualitative analysis, we present a taxonomy of the challenges and benefits associated with PTM reuse within this community. We then conduct a quantitative study to track model-type trends and model documentation evolution over time. Our findings highlight prevalent challenges such as limited guidance for beginner users, struggles with model output comprehensibility in training or inference, and a lack of model understanding. We also identified interesting trends among models where some models maintain high upload rates despite a decline in topics related to them. Additionally, we found that despite the introduction of model documentation tools, its quantity has not increased over time, leading to difficulties in model comprehension and selection among users. Our study sheds light on new challenges in reusing PTMs that were not reported before and we provide recommendations for various stakeholders involved in PTM reuse.
Thu 14 MarDisplayed time zone: Athens change
14:00 - 15:30 | |||
14:00 15mTalk | Exploring Markers and Drivers of Gender Bias in Machine Translations Research Papers Pre-print | ||
14:15 15mTalk | Delving into Parameter-Efficient Fine-Tuning in Code Change Learning: An Empirical Study Research Papers Shuo Liu City University of Hong Kong, Jacky Keung City University of Hong Kong, Zhen Yang Shandong University, Fang Liu Beihang University, Qilin Zhou City University of Hong Kong, Yihan Liao City University of Hong Kong | ||
14:30 15mTalk | Catch the Butterfly: Peeking into the Terms and Conflicts among SPDX Licenses Research Papers Liu Tao , Chengwei Liu Nanyang Technological University, Tianwei Liu School of Cyber Engineering, Xidian University, He Wang School of Cyber Engineering, Xidian University, Gaofei Wu School of Cyber Engineering, Xidian University, Yang Liu Nanyang Technological University, Yuqing Zhang University of Chinese Academy of Sciences; Zhongguancun Laboratory | ||
14:45 15mTalk | Comparative Study of Reinforcement Learning in GitHub Pull Request Outcome Predictions Research Papers | ||
15:00 15mTalk | On the Usefulness of Python Structural Pattern Matching: An Empirical Study Research Papers Norbert Vándor University of Szeged, Gabor Antal University of Szeged, Peter Hegedus University of Szeged, Rudolf Ferenc University of Szeged | ||
15:15 15mTalk | Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends Research Papers Mina Taraghi Polytechnique Montréal, Gianolli Dorcelus Polytechnique Montréal, Armstrong Tita Foundjem Ecole Polytechnique de Montreal, Florian Tambon Polytechnique Montréal, Foutse Khomh Polytechnique Montréal Pre-print |