From Reinvention to Reuse: An Empirical Example Study On Technical Debt Dataset
This program is tentative and subject to change.
Self-Admitted Technical Debt (SATD) is a subset of Technical Debt (TD), where the developer leaves a comment on the source, thus marking the place where debt has been taken. Previous research on SATD relies on either the creation of new datasets or the reuse of existing ones. One seminal SATD dataset containing over 4,000 SATD comments and their classification into five different TD categories was published by Maldonado et al. The drawback of the dataset is its lack of any other information, e.g. static analysis, seriously limiting its possible use cases. We remedy this situation by reforming the dataset. We combine the original comments with contextual information and static analysis from the source codes and recreate the dataset as an SQLite database. Our reformed dataset contains over 13,000 files, nearly 14,000 classes, almost 100,000 methods, and over 650,000 code violation instances. The reformed dataset allows varied and detailed analyses in the future, which we demonstrate by examining the relationship of SATD comments to code violations. The results show that on the method level, the most important predictors are the number of code violations in total as well as the number of violations labelled as Priority 3 or belonging to the Documentation Rule Set. On the file level, LOC is an important predictor alongside the number of violations from the Documentation Rule Set or having a Priority 2 classification. Overall, our example study demonstrates the potential of what reforming existing datasets can have.
This program is tentative and subject to change.
Tue 3 DecDisplayed time zone: Athens change
16:00 - 17:30 | |||
16:00 18mResearch paper | Defining Security Debt: a case study based on practice Research Papers Maren Maritsdatter Kruke Visma software international AS, Antonio Martini University of Oslo, Norway, Daniela S. Cruzes NTNU, Monica Iovan Visma | ||
16:18 18mResearch paper | From Reinvention to Reuse: An Empirical Example Study On Technical Debt Dataset Research Papers Leevi Rantala University of Oulu, Mika Mäntylä University of Helsinki and University of Oulu, Murali Sridharan | ||
16:36 18mIndustry talk | An Automated Approach to Identify Source Code Files Affected by Architectural Technical Debt Industry Papers Armando Soares Sousa , Lincoln Rocha Federal University of Ceará, Ricardo Britto Ericsson / Blekinge Institute of Technology, Guilherme Amaral Avelino Federal University of Piaui | ||
16:54 36mTalk | Session 6 Discussion Research Papers |