Two Approaches to Survival Analysis of Open Source Python Projects
A recent study applied frequentist survival analysis methods to a subset of the Software Heritage Graph and determined which attributes of an OSS project contribute to its health. This paper serves as a replication of that study. In addition, we apply Bayesian survival analysis methods to the same dataset and investigate an extra project attribute. Both analyses focus on the effects of certain attributes on the survival of open-source software projects as measured by their revision activity. Methods such as the Kaplan-Meier estimator, Cox Proportional-Hazards model, and the visualization of posterior survival functions were used for each of the project attributes. The results show that projects which publish major releases, have repositories on multiple hosting services, possess a large team of developers, and make frequent revisions have a higher likelihood of survival in the long run. The findings were similar to the original study; however, a deeper look revealed quantitative inconsistencies.
Tue 17 MayDisplayed time zone: Eastern Time (US & Canada) change
11:50 - 12:20 | Session 15: Understanding Development Practices and Challenges 2Research / Replications and Negative Results (RENE) at ICPC room Chair(s): Julia Lawall Inria | ||
11:50 7mTalk | Backports: Change Types, Challenges and Strategies Research Debasish Chakroborti University of Saskatchewan, Kevin Schneider University of Saskatchewan, Chanchal K. Roy University of Saskatchewan | ||
11:57 7mTalk | How do I model my system? A Qualitative Study on the Challenges that Modelers Experience Research Pre-print | ||
12:04 7mTalk | Two Approaches to Survival Analysis of Open Source Python Projects Replications and Negative Results (RENE) Derek Robinson University of Victoria, Keanelek Enns University of Victoria, Neha Koulecar University of Victoria, Manish Sihag University of Victoria Media Attached | ||
12:11 9mLive Q&A | Q&A-Paper Session 15 Research |