Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations (ICSME 2025 - Research Papers Track) - ICSME 2025 - International Conference on Software Maintenance and Evolution

Who

Samah Kansab, Francis Bordeleau, Ali Tizghadam

Track

ICSME 2025 Research Papers Track

Time Zone

The program is currently displayed in (GMT+12:00) Auckland, Wellington.

Use conference time zone: (GMT+12:00) Auckland, WellingtonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 11 Sep 2025 10:45 - 11:00 at Case Room 2 260-057 - Session 8 - Code Quality 1 Chair(s): Ronnie de Souza Santos

Abstract

Code review is a fundamental practice in software engineering, ensuring code quality, fostering collaboration, and reducing defects. While research has extensively examined various aspects of this process, most studies assume that all code reviews follow a standardized evaluation workflow. However, our industrial partner, which uses Merge Requests (MRs) mechanism for code review, reports that this assumption does not always hold in practice. Many MRs serve alternative purposes beyond rigorous code evaluation. These MRs often bypass the standard review process, requiring minimal oversight. We refer to these cases as deviations, as they disrupt expected workflow patterns. For example, work-in-progress (WIP) MRs may be used as draft implementations without the intention of being reviewed, MRs with huge changes are often created for code rebase, and library updates typically involve dependency version changes that require minimal or no review effort. We hypothesize that overlooking MR deviations can lead to biased analytics and reduced reliability of machine learning (ML) models used to explain the code review process.

This study addresses these challenges by first identifying MR deviations. Our findings show that deviations occur in up to 37.02% of MRs across seven distinct categories. In addition, we develop a detection approach leveraging few-shot learning, achieving up to 91% accuracy in identifying these deviations. Furthermore, we examine the impact of removing MR deviations on ML models predicting code review completion time. Removing deviations significantly enhances model performance in 53.33% of cases, with improvements of up to 2.25 times. Additionally, their exclusion significantly impacts model interpretation, strongly altering overall feature importance rankings in 47% of cases and top-k rankings in 60%.

Our contributions include: (1) a clear definition and categorization of MR deviations, (2) a novel AI-based detection method leveraging few-shot learning, and (3) an empirical analysis of their exclusion impact on ML models explaining code review completion time. These contributions enhance the efficiency of code review analysis, while improving the accuracy of software engineering research and industrial process optimization.

Link to Preprint

https://arxiv.org/abs/2506.08860

Samah Kansab

Software Engineering Departement, Ecole de Technologie Supérieure (ETS) - Québec University

Canada

Francis Bordeleau

École de Technologie Supérieure (ETS)

Canada

Ali Tizghadam

TELUS

Canada

Time Zone

The program is currently displayed in (GMT+12:00) Auckland, Wellington.

Use conference time zone: (GMT+12:00) Auckland, WellingtonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 11 Sep
Displayed time zone: Auckland, Wellington change

10:30 - 12:00	Session 8 - Code Quality 1Research Papers Track / Industry Track at Case Room 2 260-057 Chair(s): Ronnie de Souza Santos University of Calgary

10:30 15m		Adoption and Evolution of Code Style and Best Programming Practices in Open-Source Projects Research Papers Track Alvari Kupari University of Auckland, Nasser Giacaman The University of Auckland, Valerio Terragni University of Auckland Pre-print
10:45 15m		Are All Code Reviews the Same? Identifying and Assessing the Impact of Merge Request Deviations Research Papers Track Samah Kansab Software Engineering Departement, Ecole de Technologie Supérieure (ETS) - Québec University, Francis Bordeleau École de Technologie Supérieure (ETS), Ali Tizghadam TELUS Pre-print
11:00 15m		A Taxonomy of Inefficiencies in LLM-Generated Code Research Papers Track Altaf Allah Abbassi Polytechnique Montreal, Leuson Da Silva Polytechnique Montreal, Amin Nikanjam Huawei Canada, Foutse Khomh Polytechnique Montréal
11:15 15m		Automated Code Review At Ericsson Using Large Language Models: An Experience Report Industry Track Shweta Ramesh Ericsson, Joy Bose Ericsson, Hamender Singh Ericsson R&D, Raghavan Ak Ericsson, Sujoy Roychowdhury Ericsson, Giriprasad Sridhara Ericsson, Nishrith Saini Ericsson, Ricardo Britto Ericsson / Blekinge Institute of Technology Pre-print
11:30 15m		AskGraph: A Dependency-Aware Code Assistant Powered by Code Graphs and LLM-Generated Cypher Queries Industry Track Nan Yang TNO-ESI, Joseph Reynolds TNO-ESI, Laurens Prast TNO-ESI, Rosilde Corvino TNO-ESI
11:45 15m		AI Mentor System: Building A Technical Debt Dashboard For Low Code Industry Track Alexandre Lemos OutSystems, Joana Coutinho OutSystems