Write a Blog >>
MSR 2022
Mon 23 - Tue 24 May 2022
co-located with ICSE 2022

The field of Automated Program Repair (APR) has received increasing attention in recent years both from the academic world and from leading IT companies. It’s main goal is to repair software bugs automatically, thus reducing the cost of development and maintenance significantly. Recent works use state-of-the-art deep learning models to predict correct patches, for these teaching on a large amount of data is inevitable almost in every scenarios. Despite this, readily accessible data on the field is very scarce. To contribute to related research, we present FixJS, a dataset containing bug-fixing information of ~ 2 million commits. The commits were gathered from GitHub and processed locally to have both the buggy (before bug fixing commit) and fixed (after fix) version of the same program. We focused on JavaScript functions, as it is one of the most popular programming language globally and functions are first class objects there. The data includes more than 300.000 samples of such functions, including commit information, before/after states and 3 source code representations.

MSR22 presentation (MSR_backup_presentation.mkv)5.16MiB

Fri 20 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 11:50
Session 15: Collaboration & Open SourceRegistered Reports / Data and Tool Showcase Track / Technical Papers / Industry Track at MSR Main room - odd hours
Chair(s): Massimiliano Di Penta University of Sannio, Italy, Fiorella Zampetti University of Sannio, Italy
11:00
7m
Talk
Code Review Practices for Refactoring Changes: An Empirical Study on OpenStack
Technical Papers
Eman Abdullah AlOmar Stevens Institute of Technology, Moataz Chouchen ETS, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec
Pre-print
11:07
4m
Talk
FixJS: A Dataset of Bug-fixing JavaScript Commits
Data and Tool Showcase Track
Viktor Csuvik Department of Software Engineering, MTA-SZTE Research Group on Artificial Intelligence, University of Szeged, Szeged, Hungary, László Vidács University of Szeged, Hungary
File Attached
11:11
4m
Talk
A Time Series-Based Dataset of Open-Source Software Evolution
Data and Tool Showcase Track
Bruno L. Sousa UFMG, Mariza Bigonha Professor at Federal University of Minas Gerais, Kecia A. M. Ferreira CEFET-MG, Glaura C. Franco UFMG
DOI Pre-print Media Attached
11:15
4m
Talk
LAGOON: An Analysis Tool for Open Source Communities
Data and Tool Showcase Track
Sourya Dey Galois, Inc., Walt Woods Galois, Inc.
Pre-print Media Attached
11:19
4m
Talk
A Versatile Dataset of Agile Open Source Software Projects
Data and Tool Showcase Track
Vali Tawosi University College London, Afnan Al-Subaihin University College London, Rebecca Moussa University College London, Federica Sarro University College London
Link to publication DOI Pre-print Media Attached
11:23
7m
Talk
Automatically Prioritizing and Assigning Tasks from Code Repositories in Puzzle Driven Development
Industry Track
Ayomide Bakare Innopolis University, Yegor Bugayenko Huawei, Arina Cheverda Innopolis University, Mirko Farina Innopolis University, Artem Kruglov Innopolis University, Witold Pedrycz University of Alberta, Giancarlo Succi Innopolis University
11:30
4m
Talk
Towards Understanding Barriers and Mitigation Strategies of Software Engineers with Non-traditional Educational and Occupational Backgrounds
Registered Reports
Tavian Barnes University of Waterloo, Ken Jen Lee University of Waterloo, Cristina Tavares University of Waterloo, Gema Rodríguez-Pérez University of British Columbia (UBC), Mei Nagappan University of Waterloo
Pre-print
11:34
4m
Talk
Can instability variations warn developers when open-source projects boost?
Registered Reports
Alejandro Valezate Rey Juan Carlos University, Rafael Capilla Universidad Rey Juan Carlos, Gregorio Robles Universidad Rey Juan Carlos, Victor Salamanca Rey Juan Carlos University
Pre-print
11:38
12m
Live Q&A
Discussions and Q&A
Technical Papers


Information for Participants