ICSE 2024
Fri 12 - Sun 21 April 2024 Lisbon, Portugal
Fri 19 Apr 2024 11:30 - 11:45 at Amália Rodrigues - Analytics 4 Chair(s): Gabriele Bavota

Nowadays, software analytics tools using machine learning (ML) models to, for example, predict the risk of a code change are well established. However, as the goals of a project shift over time, and developers and their habits change, the performance of said models tends to degrade (drift) over time. Current retraining practices typically require retraining a new model from scratch on a large updated dataset when performance decay is observed, thus incurring a computational cost; also there is no continuity between the models as the past model is discarded and ignored during the new model training. Even though the literature has taken interest in online learning approaches, those have rarely been integrated and evaluated in industrial environments.

This paper evaluates the use of lifelong learning (LL) for industrial use cases at Ubisoft, evaluating both the performance and the required computational effort in comparison to the retraining-from-scratch approaches commonly used by the industry. LL is used to continuously build and maintain ML-based software analytics tools using an incremental learner that progressively updates the old model using new data. To avoid so-called ``catastrophic forgetting'' of important older data points, we adopt a replay buffer of older data, which still allows us to drastically reduce the size of the overall training dataset, and hence model training time.

Empirical evaluation of our LL approach on two industrial use cases, i.e., a brown build detector and a just-in-time risk prediction tool, shows how LL in practice manages to at least match traditional retraining-from-scratch performance in terms of F1-score, while using 3.3-13.7x less data at each update, thus considerably speeding up the model updating process. Considering both the computational effort of updates and the time between model updates, the LL setup needs 2-40x less computational effort than retraining-from-scratch setups, thus clearly showing the potential of LL setups in the industry.

Fri 19 Apr

Displayed time zone: Lisbon change

11:00 - 12:30
Analytics 4Demonstrations / Software Engineering in Practice / Journal-first Papers / Research Track at Amália Rodrigues
Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana
11:00
15m
Research paper
Shedding Light on Software Engineering-specific Metaphors and Idioms
Research Track
Mia Mohammad Imran Virginia Commonwealth University, Preetha Chatterjee Drexel University, USA, Kostadin Damevski Virginia Commonwealth University
Pre-print
11:15
15m
Talk
MiniMon: Minimizing Android Applications with Intelligent Monitoring-Based Debloating
Research Track
Jiakun Liu Singapore Management University, Zicheng Zhang School of Computing and Information Systems, Singapore Management University, Xing Hu Zhejiang University, Ferdian Thung Singapore Management University, Shahar Maoz Tel Aviv University, Debin Gao Singapore Management University, Eran Toch Tel Aviv University, Zhipeng Zhao Singapore Management University, David Lo Singapore Management University
11:30
15m
Talk
On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics - Empirical Study on Brown Build and Risk Prediction
Software Engineering in Practice
Doriane Olewicki Queen's University, Sarra Habchi Ubisoft Montréal, Mathieu Nayrolles Ubisoft Montreal, Mojtaba Faramarzi Université de Montréal, Sarath Chandar Polytechnique Montréal, Bram Adams Queen's University
Pre-print
11:45
15m
Talk
An Ethnographic Study on the CI of A Large Scale Project
Software Engineering in Practice
Zikuan Wang Nanjing University, Bohan Liu Nanjing University, Zeye Zhan Nanjing University, He Zhang Nanjing University, Gongyuan Li Nanjing University
12:00
7m
Talk
An Empirical Study of Refactoring Rhythms and Tactics in the Software Development Process
Journal-first Papers
Shayan Noei Queen's University, Heng Li Polytechnique Montréal, Stefanos Georgiou Queen's University, Ying Zou Queen's University, Kingston, Ontario
12:07
7m
Talk
Insights into Software Development Approaches: Mining Q&A Repositories [Journal-first]
Journal-first Papers
Arif Ali Khan University of Oulu, Javed Ali Khan University of Hertforshire Hertfordshire, UK, Muhammad Azeem Akbar LUT University, Zhou Peng Nanjing University of Aeronautics and Astronautics Nanjing, China, Mahdi Fahmideh University of Southern Queensland, Arif Ali Khan University of Oulu, Arif Ali Khan University of Oulu
Link to publication DOI
12:14
7m
Talk
Can My Microservice Tolerate an Unreliable Database? Resilience Testing with Fault Injection and Visualization
Demonstrations
Michael Assad Technical University of Munich, Christopher S. Meiklejohn Carnegie Mellon University, Heather Miller Carnegie Mellon University and Two Sigma, Stephan Krusche Technical University of Munich
DOI Pre-print Media Attached
12:21
7m
Talk
CATMA: Conformance Analysis Tool For Microservice Applications
Demonstrations
Clinton Cao Delft University of Technology, Simon Schneider Hamburg University of Technology, Nicolás E. Díaz Ferreyra Hamburg University of Technology, Sicco Verwer TU Delft, Annibale Panichella Delft University of Technology, Riccardo Scandariato Hamburg University of Technology
Pre-print Media Attached