Write a Blog >>
MSR 2022
Mon 23 - Tue 24 May 2022
co-located with ICSE 2022

Software evolution is the process of developing, maintaining, and updating software systems. It is known that the software systems tend to increase their complexity and size over their evolution to meet the demands required by the users. Due to this fact, researchers have increasingly carried out studies on software evolution to understand the systems’ evolution pattern and propose techniques to overcome inherent problems in software evolution. Many of these works collect data but do not make them publicly available. Many datasets on software evolution are outdated, and/or are small, and some of them do not provide time series from software metrics. We propose an extensive software evolution dataset with temporal information about open-source Java systems. To build this dataset, we proposed a methodology of four steps: selecting the systems using a criterion, extracting and measuring their releases, and generating their time series. Our dataset contains time series of 46 software metrics extracted from 46 open-source Java systems, and we make it publicly available.

Fri 20 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 11:50
Session 15: Collaboration & Open SourceRegistered Reports / Data and Tool Showcase Track / Technical Papers / Industry Track at MSR Main room - odd hours
Chair(s): Massimiliano Di Penta University of Sannio, Italy, Fiorella Zampetti University of Sannio, Italy
11:00
7m
Talk
Code Review Practices for Refactoring Changes: An Empirical Study on OpenStack
Technical Papers
Eman Abdullah AlOmar Stevens Institute of Technology, Moataz Chouchen ETS, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec
Pre-print
11:07
4m
Talk
FixJS: A Dataset of Bug-fixing JavaScript Commits
Data and Tool Showcase Track
Viktor Csuvik Department of Software Engineering, MTA-SZTE Research Group on Artificial Intelligence, University of Szeged, Szeged, Hungary, László Vidács University of Szeged, Hungary
File Attached
11:11
4m
Talk
A Time Series-Based Dataset of Open-Source Software Evolution
Data and Tool Showcase Track
Bruno L. Sousa UFMG, Mariza Bigonha Professor at Federal University of Minas Gerais, Kecia A. M. Ferreira CEFET-MG, Glaura C. Franco UFMG
DOI Pre-print Media Attached
11:15
4m
Talk
LAGOON: An Analysis Tool for Open Source Communities
Data and Tool Showcase Track
Sourya Dey Galois, Inc., Walt Woods Galois, Inc.
Pre-print Media Attached
11:19
4m
Talk
A Versatile Dataset of Agile Open Source Software Projects
Data and Tool Showcase Track
Vali Tawosi University College London, Afnan Al-Subaihin University College London, Rebecca Moussa University College London, Federica Sarro University College London
Link to publication DOI Pre-print Media Attached
11:23
7m
Talk
Automatically Prioritizing and Assigning Tasks from Code Repositories in Puzzle Driven Development
Industry Track
Ayomide Bakare Innopolis University, Yegor Bugayenko Huawei, Arina Cheverda Innopolis University, Mirko Farina Innopolis University, Artem Kruglov Innopolis University, Witold Pedrycz University of Alberta, Giancarlo Succi Innopolis University
11:30
4m
Talk
Towards Understanding Barriers and Mitigation Strategies of Software Engineers with Non-traditional Educational and Occupational Backgrounds
Registered Reports
Tavian Barnes University of Waterloo, Ken Jen Lee University of Waterloo, Cristina Tavares University of Waterloo, Gema Rodríguez-Pérez University of British Columbia (UBC), Mei Nagappan University of Waterloo
Pre-print
11:34
4m
Talk
Can instability variations warn developers when open-source projects boost?
Registered Reports
Alejandro Valezate Rey Juan Carlos University, Rafael Capilla Universidad Rey Juan Carlos, Gregorio Robles Universidad Rey Juan Carlos, Victor Salamanca Rey Juan Carlos University
Pre-print
11:38
12m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants