Write a Blog >>
MSR 2022
Mon 23 - Tue 24 May 2022
co-located with ICSE 2022

Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. However, unlike the other well-known code review environments, such as Gerrit or GitHub, there does not exist a readily accessible public dataset of Phabricator. This paper describes our experience mining code reviews from five different projects that use Phabricator (Blender, FreeBSD, KDE, LLVM, and Mozilla) and discusses the problems associated with the data retrieval process. We publish a dataset with details of 317,476 code reviews conducted via Phabricator. Our dataset is available in both JSON and MySQL database dump formats. The dataset enables analyses of the history of code reviews at a finer granular level than is possible on the other platforms. In addition, given that the projects we mined are accessible via the Conduit API, our dataset can be used as a foundation to fetch additional details and insights.

Wed 18 May

Displayed time zone: Eastern Time (US & Canada) change

21:00 - 21:50
Session 7: Developer Wellbeing & Project CommunicationTechnical Papers / Data and Tool Showcase Track / Industry Track at MSR Main room - odd hours
Chair(s): Bram Adams Queen's University, Kingston, Ontario
21:00
7m
Talk
On the Violation of Honesty in Mobile Apps: Automated Detection and CategoriesDistinguished Paper Award
Technical Papers
Humphrey Obie Monash University, Idowu Oselumhe Ilekura Data Science Nigeria, Hung Du Applied Artificial Intelligence Institute, Deakin University, Mojtaba Shahin RMIT University, Australia, John Grundy Monash University, Li Li Monash University, Jon Whittle CSIRO's Data61 and Monash University, Burak Turhan University of Oulu
Pre-print
21:07
7m
Talk
How heated is it? Understanding GitHub locked issues
Technical Papers
Isabella Ferreira Polytechnique Montréal, Bram Adams Queen's University, Kingston, Ontario, Jinghui Cheng Polytechnique Montreal
Pre-print Media Attached
21:14
4m
Talk
The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories
Data and Tool Showcase Track
Melanie Warrick University of Vermont, Samuel F. Rosenblatt University of Vermont, Jean-Gabriel Young University of Vermont, amanda casari Open Source Programs Office, Google, Laurent Hébert-Dufresne University of Vermont, James P. Bagrow University of Vermont
DOI Pre-print Media Attached
21:18
4m
Talk
The Unexplored Treasure Trove of Phabricator Code Reviews
Data and Tool Showcase Track
Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
21:22
4m
Talk
The Unsolvable Problem or the Unheard Answer? A Dataset of 24,669 Open-Source Software Conference Talks
Data and Tool Showcase Track
Kimberly Truong Oregon State University, Courtney Miller Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University
DOI Pre-print
21:26
4m
Talk
Exploring Apache Incubator Project Trajectories with APEX
Data and Tool Showcase Track
Anirudh Ramchandran University of California, Davis, Likang Yin University of California, Davis, Vladimir Filkov University of California at Davis
21:30
7m
Talk
A Culture of Productivity: Maximizing Productivity by Maximizing Wellbeing
Industry Track
Brian Houck Microsoft Research
21:37
13m
Live Q&A
Discussions and Q&A
Technical Papers

Mon 23 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Blended Technical Session 1 (Integration, Large-scale mining, and Software Ecosystems)Technical Papers / Data and Tool Showcase Track at Room 315+316
Chair(s): Bogdan Vasilescu Carnegie Mellon University, USA
11:00
15m
Talk
Do Small Code Changes Merge Faster? A Multi-Language Empirical Investigation
Technical Papers
Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
11:15
15m
Talk
Mining Code Review Data to Understand Waiting Times Between Acceptance and Merging: An Empirical Analysis
Technical Papers
Gunnar Kudrjavets University of Groningen, Aditya Kumar Snap, Inc., Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
11:30
8m
Talk
Dataset: Dependency Networks of Open Source Libraries Available Through CocoaPods, Carthage and Swift PM
Data and Tool Showcase Track
Kristiina Rahkema University of Tartu, Dietmar Pfahl University of Tartu
Pre-print Media Attached
11:38
8m
Talk
A Large-scale Dataset of (Open Source) License Text VariantsData and Tool Showcase Award
Data and Tool Showcase Track
Stefano Zacchiroli Télécom Paris, Polytechnic Institute of Paris
DOI Pre-print
11:46
8m
Talk
TSSB-3M: Mining single statement bugs at massive scale
Data and Tool Showcase Track
Cedric Richter Carl von Ossietzky Universität Oldenburg / University of Oldenburg, Heike Wehrheim Carl von Ossietzky Universität Oldenburg / University of Oldenburg
Pre-print Media Attached
11:54
8m
Talk
LAGOON: An Analysis Tool for Open Source Communities
Data and Tool Showcase Track
Sourya Dey Galois, Inc., Walt Woods Galois, Inc.
Pre-print Media Attached
12:02
8m
Talk
The Unexplored Treasure Trove of Phabricator Code Reviews
Data and Tool Showcase Track
Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands
DOI Pre-print
12:10
20m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants
Wed 18 May 2022 21:00 - 21:50 at MSR Main room - odd hours - Session 7: Developer Wellbeing & Project Communication Chair(s): Bram Adams
Info for room MSR Main room - odd hours:

Click here to go to the room on Midspace