The upper bound of information diffusion in code review (ESEIW 2024 - ESEM Journal-First Papers)

Who

Michael Dorner, Daniel Mendez, Krzysztof Wnuk, Ehsan Zabardast, Jacek Czerwonka

Track

ESEIW 2024 ESEM Journal-First Papers

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 24 Oct 2024 15:15 - 15:30 at Multimedia (B3 Building - Hall) - Repository mining Chair(s): Apostolos Ampatzoglou

Abstract

Background: Code review, the discussion around a code change among humans, forms a communication network that enables its participants to exchange and spread information. Although reported by qualitative studies, our understanding of the capability of code review as a communication network is still limited.

Objective: In this article, we report on a first step towards understanding and evaluating the capability of code review as a communication network by quantifying how fast and how far information can spread through code review: the upper bound of information diffusion in code review.

Method: In an in-silico experiment, we simulate an artificial information diffusion within large (Microsoft), mid-sized (Spotify), and small code review systems (Trivago) modelled as communication networks. We then measure the minimal topological and temporal distances between the participants to quantify how far and how fast information can spread in code review.

Results: An average code review participants in the small and mid-sized code review systems can spread information to between 72% and 85% of all code review participants within four weeks independently of network size and tooling; for the large code review systems, we found an absolute boundary of about 11000 reachable participants. On average (median), information can spread between two participants in code review in less than five hops and less than five days.

Conclusion: We found evidence that the communication network emerging from code review scales well and spreads information fast and broadly, corroborating the findings of prior qualitative work. The study lays the foundation for understanding and improving code review as a communication network.

Link to Publication

https://link.springer.com/article/10.1007/s10664-024-10442-y

Link to Preprint

https://arxiv.org/abs/2306.08980

DOI

https://doi.org/10.1007/s10664-024-10442-y

Michael Dorner

Blekinge Institute of Technology

Sweden

Daniel Mendez

Blekinge Institute of Technology and fortiss

Sweden

Krzysztof Wnuk

Ehsan Zabardast

Blekinge Institute of Technology

Sweden

Jacek Czerwonka

Developer Services, Microsoft

United States

Time Zone

The program is currently displayed in (GMT+02:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+02:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 24 Oct
Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

14:00 - 15:30	Repository miningESEM Journal-First Papers / ESEM IGC / ESEM Technical Papers at Multimedia (B3 Building - Hall) Chair(s): Apostolos Ampatzoglou University of Macedonia

14:00 20m Full-paper		Decoding Android Permissions: A Study of Developer Challenges and Solutions on Stack Overflow ESEM Technical Papers Sahrima Jannat Oishwee University of Saskatchewan, Zadia Codabux University of Saskatchewan, Natalia Stakhanova University of Saskatchewan
14:20 20m Full-paper		Negative Results of Image Processing for Identifying Duplicate Questions on Stack Overflow ESEM Technical Papers Faiz Ahmed York University, Suprakash Datta York University, Maleknaz Nayebi York University
14:40 20m Full-paper		Understanding Fairness in Software Engineering: Insights from Stack Exchange Sites ESEM Technical Papers Emeralda Sesari University of Groningen, Federica Sarro University College London, Ayushi Rastogi University of Groningen, The Netherlands DOI Pre-print
15:00 15m Industry talk		Reducing Events to Augment Log-based Anomaly Detection Models: An Empirical Study ESEM IGC Lingzhe Zhang Peking University, China, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Kangjin Wang Alibaba Group, Mengxi Jia Peking University, Yong Yang , Ying Li School of Software and Microelectronics, Peking University, Beijing, China
15:15 15m Journal Early-Feedback		The upper bound of information diffusion in code review ESEM Journal-First Papers Michael Dorner Blekinge Institute of Technology, Daniel Mendez Blekinge Institute of Technology and fortiss, Krzysztof Wnuk , Ehsan Zabardast Blekinge Institute of Technology, Jacek Czerwonka Developer Services, Microsoft Link to publication DOI Pre-print