Learning Model Mutations From Faults in Deep Learning
In deep learning (DL) development, faults may originate from training program bugs or issues in the training data. Prior work shows that simulating such faults using pre-training mutation operators can produce realistic mutants. However, this realism comes at high computational cost due to repeated retraining. Post-training mutation avoids retraining, yet relies on a fixed set of predefined operators that struggle to capture the diversity of real faults and are sensitive to training stochasticity. To address these challenges, we propose a new direction for DL mutation testing by framing post-training mutation as a model-to-model transformation problem. Instead of manually designing mutation operators, we treat mutation generation as a learned transformation from an original trained model to its corresponding faulty or realistically mutated trained model. Our approach fine-tunes a pre-trained code model to learn this transformation, enabling direct manipulation of trained model parameters without retraining. As a proof of concept, we fine-tune CodeT5+ on a small neural network using controlled original–mutated model pairs. Even in a few-shot fine-tuning setting, the model successfully generates valid mutated models by modifying the parameters of the original trained model. These results demonstrate the feasibility of learning model-level mutations through model-to-model transformations. However, sequence-based representations face scalability limitations for larger networks. This motivates a transition toward graph neural networks that better align with the structural representation of deep neural models. This work lays the foundation for scalable, efficient, and realistic mutation testing for DL systems.
Mon 13 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
09:45 - 10:30 | |||
09:45 5mTalk | Understanding and Mitigating Library-Related Issues in LLM-Generated Code Journal Ahead Workshop (JAWs) Yacine Majdoub University of Gabes, Rinad Hamid University of Calgary, Canada, Eya Ben Charrada University of Gabes, Ahmad Abdellatif University of Calgary, Haifa Touati IReSCoMath Research Lab, Faculty of Sciences, University Of Gabes, Tunisia | ||
09:50 5mTalk | Magnifying Inefficiency: How LLMs Amplify Performance Anti-Patterns in Mobile Development Journal Ahead Workshop (JAWs) | ||
09:55 5mTalk | BRACE: Unified Benchmarking of Accuracy and Energy for Code Language Models Journal Ahead Workshop (JAWs) Mohammadjavad Mehditabar Dalhousie University, Saurabhsingh Rajput Dalhousie University, Antonio Mastropaolo William and Mary, USA, Tushar Sharma Dalhousie University Pre-print File Attached | ||
10:00 5mTalk | Learning Model Mutations From Faults in Deep Learning Journal Ahead Workshop (JAWs) Zaheed Ahmed Institute of Computer Science, University of Göttingen, Lower Saxony, Germany, Philip Makedonski Institute of Computer Science, University of Göttingen, Lower Saxony, Germany, Jens Grabowski Media Attached | ||
10:05 5mTalk | Artificial or Just Artful? Do LLMs Bend the Rules in Programming? Journal Ahead Workshop (JAWs) Oussama Ben Sghaier Queen's University, Kévin Delcourt Université de Montréal, Houari Sahraoui DIRO, Université de Montréal | ||
10:10 5mTalk | Towards Automated User Story Quality Assessment with LLMs: An Empirical Study on Syntactic and Pragmatic QUS Criteria Journal Ahead Workshop (JAWs) Izabella Silva Federal University of Campina Grande - ISE/VIRTUS, Emanuel Dantas Filho Federal University of Campina Grande - ISE/VIRTUS, Ademar Sousa Neto VIRTUS/UFCG, Mirko Perkusich VIRTUS, Danyllo Albuquerque VIRTUS/UFCG, Kyller Costa Gorgônio Federal University of Campina Grande, Angelo Percusich Federal University of Campina Grande - ISE/VIRTUS | ||
10:15 5mTalk | MARS: Few-Shot Android Malware Detection with RAG-Enhanced LLMs Journal Ahead Workshop (JAWs) Guangquan Xu School of Cybersecurity, Tianjin University, Minhong Dong School of Cybersecurity, Tianjin University, Qi Guo Tianjin University, Hongpeng Bai School of Cybersecurity, Tianjin University, Yao Zhang Tianjin University, Ruitao Feng Southern Cross University, Wenying He Hebei University of Technology, Yude Bai Tianjin University, Ji Zhang University of Southern Queensland | ||
10:20 5mTalk | A Closer Look at the Malicious Pre-Trained Models on Hugging Face Journal Ahead Workshop (JAWs) Junwei Zhang Zhejiang University, Xing Hu Zhejiang University, Xin Xia Zhejiang University, David Lo Singapore Management University, Shanping Li Zhejiang University | ||