The Unexplored Treasure Trove of Phabricator Code Reviews
Mon 23 May 2022 12:02 - 12:10 at Room 315+316 - Blended Technical Session 1 (Integration, Large-scale mining, and Software Ecosystems) Chair(s): Bogdan Vasilescu
Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. However, unlike the other well-known code review environments, such as Gerrit or GitHub, there does not exist a readily accessible public dataset of Phabricator. This paper describes our experience mining code reviews from five different projects that use Phabricator (Blender, FreeBSD, KDE, LLVM, and Mozilla) and discusses the problems associated with the data retrieval process. We publish a dataset with details of 317,476 code reviews conducted via Phabricator. Our dataset is available in both JSON and MySQL database dump formats. The dataset enables analyses of the history of code reviews at a finer granular level than is possible on the other platforms. In addition, given that the projects we mined are accessible via the Conduit API, our dataset can be used as a foundation to fetch additional details and insights.
Wed 18 MayDisplayed time zone: Eastern Time (US & Canada) change
21:00 - 21:50 | Session 7: Developer Wellbeing & Project CommunicationTechnical Papers / Data and Tool Showcase Track / Industry Track at MSR Main room - odd hours Chair(s): Bram Adams Queen's University, Kingston, Ontario | ||
21:00 7mTalk | On the Violation of Honesty in Mobile Apps: Automated Detection and CategoriesDistinguished Paper Award Technical Papers Humphrey Obie Monash University, Idowu Oselumhe Ilekura Data Science Nigeria, Hung Du Applied Artificial Intelligence Institute, Deakin University, Mojtaba Shahin RMIT University, Australia, John Grundy Monash University, Li Li Monash University, Jon Whittle CSIRO's Data61 and Monash University, Burak Turhan University of Oulu Pre-print | ||
21:07 7mTalk | How heated is it? Understanding GitHub locked issues Technical Papers Isabella Ferreira Polytechnique Montréal, Bram Adams Queen's University, Kingston, Ontario, Jinghui Cheng Polytechnique Montreal Pre-print Media Attached | ||
21:14 4mTalk | The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories Data and Tool Showcase Track Melanie Warrick University of Vermont, Samuel F. Rosenblatt University of Vermont, Jean-Gabriel Young University of Vermont, amanda casari Open Source Programs Office, Google, Laurent Hébert-Dufresne University of Vermont, James P. Bagrow University of Vermont DOI Pre-print Media Attached | ||
21:18 4mTalk | The Unexplored Treasure Trove of Phabricator Code Reviews Data and Tool Showcase Track Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands DOI Pre-print | ||
21:22 4mTalk | The Unsolvable Problem or the Unheard Answer? A Dataset of 24,669 Open-Source Software Conference Talks Data and Tool Showcase Track Kimberly Truong Oregon State University, Courtney Miller Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University DOI Pre-print | ||
21:26 4mTalk | Exploring Apache Incubator Project Trajectories with APEX Data and Tool Showcase Track Anirudh Ramchandran University of California, Davis, Likang Yin University of California, Davis, Vladimir Filkov University of California at Davis | ||
21:30 7mTalk | A Culture of Productivity: Maximizing Productivity by Maximizing Wellbeing Industry Track Brian Houck Microsoft Research | ||
21:37 13mLive Q&A | Discussions and Q&A Technical Papers |