Multi-Project Just-in-Time Software Defect Prediction Based on Multi-Task Learning for Mobile Applications
In the rapid development of mobile applications, frequent code commits pose significant challenges for quality assurance (QA). Just-in-Time Software Defect Prediction (JIT-SDP) helps at the commit level but often struggles due to insufficient labeled data, particularly in newer applications. To address this issue, we introduce JMFM, a novel approach leveraging Multi-Task Learning (MTL), Fuzzy C-Means (FCM) clustering, and Multi-Head Attention (MHA) for JIT-SDP. JMFM integrates multiple projects for training under the MTL framework, treating each project as a distinct task and enabling cross-project learning. In JMFM, FCM clustering determines the membership of each data sample to various clusters, which is then used in the MHA module to compute the weights of similarity between samples. By integrating the weighted sum of other samples’ data, each sample is augmented with additional information for shared learning. Simultaneously, each project is trained in a task-specific layer to retain its unique features. We calculate the joint loss by giving greater weight to projects with fewer samples to ensure that they are not overshadowed by larger projects. Experiments on 15 Android mobile applications show that JMFM outperforms existing models on metrics such as F1, MCC and AUC especially for projects with scarce data.
Thu 3 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:00 - 12:30 | Testing ML Systems and Fault LocalisationIndustry / Research Papers at Aula Magna (AM) Chair(s): Atif Memon Apple | ||
11:00 15mTalk | On Accelerating Deep Neural Network Mutation Analysis by Neuron and Mutant Clustering Research Papers Pre-print | ||
11:15 15mTalk | Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Research Papers Stefano Carlo Lambertenghi Technische Universität München, fortiss GmbH, Hannes Leonhard Technical University of Munich, Andrea Stocco Technical University of Munich, fortiss Pre-print | ||
11:30 15mTalk | Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code Research Papers Shahin Honarvar Imperial College London, Mark van der Wilk University of Oxford, Alastair F. Donaldson Imperial College London | ||
11:45 15mTalk | Taming Uncertainty for Critical Scenario Generation in Automated Driving Industry Selma Grosse DENSO Automotive GmbH, Dejan Nickovic Austrian Institute of Technology, Cristinel Mateis AIT Austrian Institute of Technology GmbH, Alessio Gambi Austrian Institute of Technology (AIT), Adam Molin DENSO AUTOMOTIVE | ||
12:00 15mTalk | Multi-Project Just-in-Time Software Defect Prediction Based on Multi-Task Learning for Mobile Applications Research Papers Feng Chen Chongqing University of Posts and Telecommunications, Ke Yuxin Chongqing University of Posts and Telecommunications, Liu Xin Chongqing University of Posts and Telecommunications, Wei Qingjie Chongqing University of Posts and Telecommunications | ||
12:15 15mTalk | Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces Industry Neetha Jambigi University of Cologne, Bartosz Bogacz SAP SE, Moritz Mueller SAP SE, Thomas Bach SAP, Michael Felderer German Aerospace Center (DLR) & University of Cologne |