Write a Blog >>
MSR 2019
Sun 26 - Mon 27 May 2019 Montreal, QC, Canada
co-located with ICSE 2019

The Mining Software Repositories (MSR) field analyzes the rich data associated with software production and operations to uncover interesting and actionable information about software systems and projects. Software repositories such as source control systems, defect tracking systems, code review repositories, archived communications between project personnel, question-and-answer sites, and CI build servers are used to help manage the progress of software projects. Software practitioners and researchers are recognizing the benefits of mining this information to support the maintenance of software systems, improve software design/reuse, and empirically validate novel ideas and techniques. As such, research is now proceeding to uncover the ways in which mining these repositories can help to understand software development and software evolution, to support predictions about software development, and to exploit this knowledge in planning future development.

The goal of this two-day international conference is to advance the science and practice of software engineering via the analysis of data stored in software repositories. The 16th International Conference on Mining Software Repositories is co-located with ICSE 2019 in sunny Montréal, QC, Canada and will be held on May 26-27, 2019.

The important dates for the Technical Papers are:

  • Abstract deadline: January 15, 2019, 23:59 AoE
  • Papers deadline: January 22, 2019, 23:59 AoE
  • Grace period for paper updates: January 23-24, 2019, 23:59 AoE
  • Author Notification: March 1, 2019, 23:59 AoE
  • Camera Ready: March 15, 2019, 23:59 AoE

Please see the Call for Papers for all details.

Dates
Tracks
You're viewing the program in a time zone which is different from your device's time zone change time zone

Sun 26 May

Displayed time zone: Eastern Time (US & Canada) change

09:05 - 10:30
Keynote + Q&A + DiscussionMSR 2019 Keynote at Place du Canada
09:05
45m
Talk
Keynote: We Won! Now What?
MSR 2019 Keynote
A: Robert DeLine Microsoft Research
09:50
10m
Q&A for Keynote
MSR 2019 Keynote

10:00
30m
Discussion: Ethical MSR
MSR 2019 Keynote
Thomas Zimmermann Microsoft Research, Alexander Serebrenik Eindhoven University of Technology
11:00 - 11:45
Session II: Defect Prediction and Testing (Part 1)MSR 2019 Technical Papers at Centre-Ville
Chair(s): Patanamon Thongtanunam The University of Melbourne
11:00
15m
Full-paper
DeepJIT: An End-To-End Deep LearningFramework for Just-In-Time Defect Prediction
MSR 2019 Technical Papers
Thong Hoang Singapore Management University, Singapore, Hoa Khanh Dam University of Wollongong, Yasutaka Kamei Kyushu University, David Lo Singapore Management University, Naoyasu Ubayashi Kyushu University
11:16
15m
Full-paper
Lessons learned from using a deep tree-based model for software defect prediction in practice
MSR 2019 Technical Papers
Hoa Khanh Dam University of Wollongong, Trang Pham Deakin University, Shien Wee Ng University of Wollongong, Truyen Tran , John Grundy Monash University, Aditya Ghose , Taeksu Kim , Chul-Joo Kim
11:32
6m
Short-paper
Empirical study in using version histories for change risk classification
MSR 2019 Technical Papers
11:39
6m
Short-paper
Snoring: a Noise in Defect Prediction Datasets
MSR 2019 Technical Papers
Aalok Ahluwalia , Davide Falessi California Polytechnic State University, Massimiliano Di Penta University of Sannio
11:00 - 11:45
Session I: Representations for Mining (Part 1)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada
Chair(s): Chanchal K. Roy University of Saskatchewan
11:00
15m
Full-paper
SCOR: Source Code Retrieval With Semantics and Order
MSR 2019 Technical Papers
Pre-print Media Attached
11:16
6m
Short-paper
PathMiner : A Library for Mining of Path-Based Representations of Code
MSR 2019 Technical Papers
Vladimir Kovalenko TU Delft, Egor Bogomolov Higher School of Economics, JetBrains Research, Timofey Bryksin , Alberto Bacchelli University of Zurich
DOI Pre-print Media Attached
11:23
15m
Full-paper
Import2vec: learning embeddings for software libraries
MSR 2019 Technical Papers
Bart Theeten Nokia Bell Labs, Belgium, Frederik Vandeputte , Tom Van Cutsem Nokia Bell Labs
Pre-print
11:39
6m
Talk
Semantic Source Code Models Using Identifier Embeddings
MSR 2019 Data Showcase
Vasiliki Efstathiou Athens University of Economics and Business, Diomidis Spinellis Athens University of Economics and Business
Pre-print
11:55 - 12:30
Session IV: Defect Prediction and Testing (Part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Centre-Ville
Chair(s): Jesus M. Gonzalez-Barahona Universidad Rey Juan Carlos
11:55
6m
Talk
A Dataset of Non-Functional Bugs
MSR 2019 Data Showcase
Aida Radu , Sarah Nadi University of Alberta
Pre-print
12:01
6m
Short-paper
Does UML Modeling Associate with Higher Software Quality in Open-Source Software?
MSR 2019 Technical Papers
Adithya Raghuraman , Truong Ho-Quang , Michel Chaudron Chalmers University of Technology, Alexander Serebrenik Eindhoven University of Technology, Bogdan Vasilescu Carnegie Mellon University
Pre-print
12:07
6m
Short-paper
STRAIT: A Tool for Automated Software Reliability Growth Analysis
MSR 2019 Technical Papers
Stanislav Chren Masaryk University, Radoslav Micko , Barbora Buhnova Masaryk University, Bruno Rossi Masaryk University
Pre-print
12:13
6m
Talk
A Data Set of Program Invariants and Error Paths
MSR 2019 Data Showcase
Dirk Beyer LMU Munich
DOI Pre-print Media Attached
12:19
6m
Short-paper
Test Coverage in Python Programs
MSR 2019 Technical Papers
Hongyu Zhai , Casey Casalnuovo University of California at Davis, USA, Prem Devanbu University of California
12:25
6m
Short-paper
On the Effectiveness of Manual and Automatic Unit Test Generation: Ten Years Later
MSR 2019 Technical Papers
Domenico Serra , Giovanni Grano University of Zurich, Fabio Palomba , Filomena Ferrucci University of Salerno, Harald Gall University of Zurich, Alberto Bacchelli University of Zurich
DOI Pre-print Media Attached
11:55 - 12:30
Session III: Representations for Mining (Part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada
Chair(s): Nicole Novielli University of Bari
11:55
15m
Full-paper
Exploring Word Embedding Techniques to Improve Sentiment Analysis of Software Engineering Texts
MSR 2019 Technical Papers
Eeshita Biswas , K. Vijay-Shanker , Lori Pollock University of Delaware, USA
Pre-print
12:10
6m
Talk
Cleaning StackOverflow for Machine Translation
MSR 2019 Data Showcase
Musfiqur Rahman Concordia University, Montreal, Canada, Peter Rigby Concordia University, Montreal, Canada, Dharani Palani Concordia University, Tien N. Nguyen University of Texas at Dallas
12:16
15m
Full-paper
Predicting Good Configurations for GitHub and Stack Overflow Topic Models
MSR 2019 Technical Papers
Christoph Treude The University of Adelaide, Markus Wagner
Pre-print
13:50 - 14:35
Discussion: Data vs. Theory-driven ResearchMSR 2019 Paper Presentations at Place du Canada
Chair(s): Michael W. Godfrey University of Waterloo, Canada, Andy Zaidman TU Delft
14:45 - 15:30
Session VI: Energy and EconomicsMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville
Chair(s): Maleknaz Nayebi Polytechnique Montréal
14:45
15m
Full-paper
Recommending Energy-Efficient Java Collections
MSR 2019 Technical Papers
Pre-print
15:01
6m
Talk
GreenHub Farmer: Real-world data for Android Energy Mining
MSR 2019 Data Showcase
Rui Pereira HASLab/INESC TEC & Universidade do Minho & Universidade da Beira Interior, Marco Couto HASLab/INESC TEC & Universidade do Minho, João Paulo Fernandes Release/LISP, CISUC, Bruno Cabral , Hugo Matalonga University of Minho, Simão Melo de Sousa , Fernando Castor Federal University of Pernambuco (UFPE)
Pre-print
15:08
6m
Talk
GreenSource: a large-scale collection of Android code, tests and energy metrics
MSR 2019 Data Showcase
Rui Rua HASLab/INESC TEC & Universidade do Minho, Marco Couto HASLab/INESC TEC & Universidade do Minho, João Saraiva University of Minho, Portugal
15:15
6m
Short-paper
Striking Gold in Software Repositories? An Econometric Study of Cryptocurrencies on GitHub
MSR 2019 Technical Papers
Asher Trockman University of Evansville, Rijnard van Tonder Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University
Pre-print
15:22
6m
Talk
Panel Data of Cryptocurrency Development Activity on GitHub
MSR 2019 Data Showcase
Rijnard van Tonder Carnegie Mellon University, Asher Trockman University of Evansville, Claire Le Goues Carnegie Mellon University
14:45 - 15:30
Session V: Large-Scale MiningMSR 2019 Technical Papers / MSR 2019 Data Showcase at Place du Canada
Chair(s): Robert Dyer Bowling Green State University
14:45
15m
Full-paper
Time Present and Time Past: Analyzing the Evolution of JavaScript Code in the Wild
MSR 2019 Technical Papers
Dimitris Mitropoulos , Panos Louridas , Vitalis Salis , Diomidis Spinellis Athens University of Economics and Business
Pre-print
15:01
6m
Talk
The Software Heritage Graph Dataset: public software development under one roof
MSR 2019 Data Showcase
Antoine Pietri Inria, Diomidis Spinellis Athens University of Economics and Business, Stefano Zacchiroli University Paris Diderot and Inria, France
Pre-print
15:08
15m
Full-paper
World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data
MSR 2019 Technical Papers
Yuxing Ma , Christopher Bogart Carnegie Mellon University, Sadika Amreen , Russell Zaretzki , Audris Mockus University of Tennessee - Knoxville
15:24
6m
Short-paper
Crossflow: A Framework for Distributed Mining of Software Repositories
MSR 2019 Technical Papers
Dimitris Kolovos University of York, Patrick Neubauer University of York, UK, Konstantinos Barmpis , Nicholas Matragkas , Richard Paige McMaster University
Pre-print
16:00 - 18:00
Mining Challenge presentationsMSR 2019 Mining Challenge at Place du Canada
16:00
10m
Talk
SOTorrent: Studying the Origin, Evolution, and Usage of Stack Overflow Code Snippets
MSR 2019 Mining Challenge
A: Sebastian Baltes University of Trier, A: Christoph Treude The University of Adelaide, A: Stephan Diehl Computer Science, University Trier, Germany
Pre-print
16:10
7m
Talk
Mining Rule Violations in JavaScript Code Snippets
MSR 2019 Mining Challenge
Pre-print
16:17
7m
Talk
Snakes in Paradise?: Insecure Python-related Coding Practices in Stack Overflow
MSR 2019 Mining Challenge
Akond Rahman North Carolina State University, Effat Farhana , Nasif Imtiaz North Carolina State University
Pre-print
16:24
7m
Talk
Man vs Machine -- A Study into language identification of Stackoverflow code snippets
MSR 2019 Mining Challenge
Jens Dietrich Victoria University of Wellington, Markus Luczak-Roesch , Elroy Dalefield
Pre-print
16:31
7m
Talk
Python Coding Style Compliance on Stack Overflow
MSR 2019 Mining Challenge
Nikolaos Bafatakis , Niels Boecker , Wenjie Boon , Martin Cabello Salazar , Jens Krinke University College London, Gazi Oznacar , Robert White University College London, UK
Pre-print Media Attached
16:38
7m
Talk
Towards Mining Answer Edits to Extract Evolution Patterns in Stack Overflow
MSR 2019 Mining Challenge
Themistoklis Diamantopoulos Electrical and Computer Engineering Dept, Aristotle University of Thessaloniki, Maria-Ioanna Sifaki Electrical and Computer Engineering Dept, Aristotle University of Thessaloniki, Andreas Symeonidis Aristotle University of Thessaloniki
Pre-print Media Attached
16:45
7m
Talk
Analyzing Comment-induced Updates on Stack Overflow
MSR 2019 Mining Challenge
Abhishek Soni , Sarah Nadi University of Alberta
Pre-print
16:52
7m
Talk
What Edits Are Done on Highly Answered Stack Overflow Questions? An Empirical Study
MSR 2019 Mining Challenge
Xianhao Jin Virginia Tech, USA, Francisco Servant Virginia Tech
Pre-print
16:59
7m
Talk
Can Duplicate Posts on Stack Overflow Benefit the Software Development Community?
MSR 2019 Mining Challenge
Durham Abric McGill University, Oliver Clark , Matthew Caminiti , Keheliya Gallaba McGill University, Shane McIntosh McGill University
Pre-print
17:06
7m
Talk
How Often and What StackOverflow Posts Do Developers Reference in Their GitHub Projects?
MSR 2019 Mining Challenge
Saraj Singh Manes , Olga Baysal Carleton University
Pre-print
17:13
7m
Talk
Characterizing Duplicate Code Snippets between Stack Overflow and Tutorials
MSR 2019 Mining Challenge
Manziba Nishi , Agnieszka Ciborowska , Kostadin Damevski Virginia Commonwealth University
Pre-print
17:20
7m
Talk
Challenges with Responding to Static Analysis Tool Alerts
MSR 2019 Mining Challenge
Nasif Imtiaz North Carolina State University, Akond Rahman North Carolina State University, Effat Farhana , Laurie Williams North Carolina State University
Pre-print
17:27
7m
Talk
Impact of stack overflow code snippets on software cohesion: a preliminary study
MSR 2019 Mining Challenge
DOI Pre-print
17:34
7m
Talk
We Need to Talk about Microservices: an Analysis from the Discussions on StackOverflow
MSR 2019 Mining Challenge
Alan Bandeira , Carlos Filho , Matheus Paixao State University of Ceara, Brazil, Paulo Maia State University of Ceará
Pre-print Media Attached
17:41
7m
Talk
What do developers know about machine learning: a study of ML discussions on StackOverflow
MSR 2019 Mining Challenge
Hareem-e-Sahar , Abdul Ali Bangash University of Alberta, Canada, Alexander William Wong , Shaiful Chowdhury University of Alberta, Abram Hindle University of Alberta, Karim Ali University of Alberta
17:48
12m
Recap + voting
MSR 2019 Mining Challenge

Mon 27 May

Displayed time zone: Eastern Time (US & Canada) change

08:45 - 09:30
Session II: Automatic SummarizationMSR 2019 Technical Papers at Centre-Ville
Chair(s): Xin Xia Monash University
08:45
15m
Full-paper
Generating Commit Messages from Diffs using Pointer-generator Network
MSR 2019 Technical Papers
Qin Liu , Zihe Liu School of Software Engineering, Tongji University, Shanghai, China, Hongming Zhu , Hongfei Fan , Bowen Du , Yu Qian
09:00
15m
Full-paper
Automatically Generating Documentation for Lambda Expressions in Java
MSR 2019 Technical Papers
Anwar Alqaimi , Patanamon Thongtanunam The University of Melbourne, Christoph Treude The University of Adelaide
Pre-print
09:15
15m
Full-paper
Extracting API Tips from Developer Question and Answer Websites
MSR 2019 Technical Papers
Shaohua Wang New Jersey Institute of Technology, USA, Nhathai Phan , Yan Wang , Yong Zhao
08:45 - 09:30
Session I: APIs & Dependencies (Part 1)MSR 2019 Technical Papers at Place du Canada
Chair(s): Philipp Leitner Chalmers University of Technology & University of Gothenburg
08:45
15m
Full-paper
Investigating Next-Steps in Static API-Misuse Detection
MSR 2019 Technical Papers
Sven Amann CQSE GmbH, Hoan Nguyen Iowa State University, Sarah Nadi University of Alberta, Tien N. Nguyen University of Texas at Dallas, Mira Mezini TU Darmstadt, Germany
Pre-print
09:00
15m
Full-paper
Identifying Experts in Software Libraries and Frameworks among GitHub Users
MSR 2019 Technical Papers
João Eduardo Montandon Universidade Federal de Minas Gerais (UFMG), Luciana L. Silva , Marco Tulio Valente Federal University of Minas Gerais, Brazil
Pre-print
09:15
15m
Full-paper
Data-Driven Solutions to Detect API Compatibility Issues in Android: An Empirical Study
MSR 2019 Technical Papers
Simone Scalabrino University of Molise, Gabriele Bavota Università della Svizzera italiana (USI), Mario Linares-Vasquez Universidad de los Andes, Michele Lanza Universita della Svizzera italiana (USI), Rocco Oliveto University of Molise
09:40 - 10:30
Session IV: SecurityMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville
Chair(s): Sarah Nadi University of Alberta
09:40
15m
Full-paper
Automated Software Vulnerability Assessment with Concept Drift
MSR 2019 Technical Papers
Triet Le Huynh Minh The University of Adelaide, Bushra Sabir , Muhammad Ali Babar
09:55
6m
Talk
A Manually-Curated Dataset of Fixes to Vulnerabilities of Open-Source Software
MSR 2019 Data Showcase
10:01
15m
Full-paper
Negative Results on Mining Crypto-API Usage Rules in Android Apps
MSR 2019 Technical Papers
Jun Gao University of Luxembourg, SnT, Pingfan Kong Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Li Li Monash University, Australia, Tegawendé F. Bissyandé SnT, University of Luxembourg, Jacques Klein University of Luxembourg, SnT
10:16
6m
Talk
A Dataset of Parametric Cryptographic Misuses
MSR 2019 Data Showcase
Anna-Katharina Wickert TU Darmstadt, Germany, Michael Reif TU Darmstadt, Germany, Michael Eichberg TU Darmstadt, Germany, Anam Dodhy , Mira Mezini TU Darmstadt, Germany
Link to publication DOI Pre-print Media Attached
10:22
6m
Talk
RmvDroid: Towards A Reliable Android Malware Dataset with App Metadata
MSR 2019 Data Showcase
Haoyu Wang Beijing University of Posts and Telecommunications, China, Junjun Si , Hao Li , Yao Guo Peking University
09:40 - 10:30
Session III: APIs & Dependencies (Part 2)MSR 2019 Data Showcase / MSR 2019 Technical Papers at Place du Canada
Chair(s): Georgios Gousios TU Delft
09:40
6m
Talk
The Maven Dependency Graph: a Temporal Graph-based Representation of Maven Centra
MSR 2019 Data Showcase
Amine Benelallam , Nicolas Harrand , César Soto-Valero KTH Royal Institute of Technology, Benoit Baudry KTH Royal Institute of Technology, Sweden, Olivier Barais
Pre-print
09:46
15m
Full-paper
The Emergence of Software Diversity in Maven Central
MSR 2019 Technical Papers
César Soto-Valero KTH Royal Institute of Technology, Amine Benelallam , Nicolas Harrand , Olivier Barais , Benoit Baudry KTH Royal Institute of Technology, Sweden
Pre-print
10:01
15m
Full-paper
Dependency Versioning in the Wild
MSR 2019 Technical Papers
Jens Dietrich Victoria University of Wellington, David J. Pearce Victoria University of Wellington, New Zealand, Jacob Stringer , Amjed Tahir Massey University, Kelly Blincoe University of Auckland
Pre-print
10:16
15m
Full-paper
Splitting APIs: An Exploratory Study of Software Unbundling
MSR 2019 Technical Papers
11:00 - 11:45
Session VI: Software Quality (part 1)MSR 2019 Technical Papers at Centre-Ville
Chair(s): Fabio Palomba University of Zurich
11:00
15m
Full-paper
The Rise of Android Code Smells: Who Is to Blame?
MSR 2019 Technical Papers
Sarra Habchi University of Lille, Romain Rouvoy University Lille 1 and INRIA, Naouel Moha University of Montreal
11:15
15m
Full-paper
Assessing Diffusion and Perception of Test Smells in Scala Projects
MSR 2019 Technical Papers
Jonas De Bleser Sofware Languages Lab, Vrije Universiteit Brussel, Dario Di Nucci Vrije Universiteit Brussel, Coen De Roover Vrije Universiteit Brussel
Pre-print
11:30
15m
Full-paper
style-analyzer: fixing code style inconsistencies with interpretable unsupervised algorithms
MSR 2019 Technical Papers
Pre-print
11:00 - 11:45
Session V: Collaboration & Communication (Part 1)MSR 2019 Technical Papers at Place du Canada
Chair(s): Peter Rigby Concordia University, Montreal, Canada
11:00
15m
Full-paper
An Empirical Study of Multiple Names and Email Addresses in OSS Version Control Repositories
MSR 2019 Technical Papers
Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Jun Wei Institute of Software, Chinese Academy of Sciences, China
11:15
15m
Full-paper
Characterizing the Roles of Contributors in Open-source Scientific Software Projects
MSR 2019 Technical Papers
Reed Milewicz Sandia National Laboratories, Gustavo Pinto UFPA, Paige Rodeghero University of Notre Dame
Pre-print
11:30
15m
Full-paper
git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories
MSR 2019 Technical Papers
Christoph Gote Chair of Systems Design, ETH Zurich, Ingo Scholtes , Frank Schweitzer
DOI Pre-print
11:55 - 12:30
Session VIII: Software Quality (part 2)MSR 2019 Technical Papers / MSR 2019 Data Showcase at Centre-Ville
Chair(s): Yasutaka Kamei Kyushu University
11:55
15m
Full-paper
A Large-scale Study about Quality and Reproducibility of Jupyter Notebooks
MSR 2019 Technical Papers
João Felipe Pimentel , Leonardo Murta Universidade Federal Fluminense (UFF), Vanessa Braganholo , Juliana Freire
Pre-print
12:10
15m
Full-paper
Cross-language clone detection by learning over abstract syntax trees
MSR 2019 Technical Papers
Daniel Perez Imperial College London, Shigeru Chiba University of Tokyo, Japan
Pre-print
12:25
6m
Talk
SeSaMe: A Data Set of Semantically Similar Java Methods
MSR 2019 Data Showcase
Marius Kamp , Patrick Kreutzer , Michael Philippsen Friedrich-Alexander University Erlangen-Nürnberg (FAU)
11:55 - 12:30
Session VII: Collaboration & Communication (Part 2)MSR 2019 Technical Papers at Place du Canada
Chair(s): Kelly Blincoe University of Auckland
11:55
15m
Full-paper
Can Issues Reported at Stack Overflow Questions be Reproduced? An Exploratory Study
MSR 2019 Technical Papers
Saikat Mondal University of Saskatchewan, Masud Rahman University of Saskatchewan , Chanchal K. Roy University of Saskatchewan
Pre-print
12:10
15m
Full-paper
Exploratory Study of Slack Q&A Chats as a Mining Source for Software Engineering Tools
MSR 2019 Technical Papers
Preetha Chatterjee University of Delaware, USA, Kostadin Damevski Virginia Commonwealth University, Lori Pollock University of Delaware, USA, Vinay Augustine , Nicholas A. Kraft ABB Corporate Research
Pre-print
12:25
6m
Short-paper
Impacts of Daylight Saving Time on Software Development
MSR 2019 Technical Papers
Junichi Hayashi Osaka University, Yoshiki Higo Osaka University, Shinsuke Matsumoto Osaka University, Shinji Kusumoto Osaka University
Pre-print
13:50 - 14:35
13:50
45m
Tutorial
Qualitative Data Analysis in Software Engineering: A Hands-on Tutorial
MSR 2019 Education
A: Christoph Treude The University of Adelaide
13:50 - 14:35
Discussion: SE for AI for SEMSR 2019 Paper Presentations at Place du Canada
Chair(s): Neil Ernst University of Victoria, Tim Menzies North Carolina State University
14:45 - 15:30
Session X: Building on DataMSR 2019 Data Showcase / MSR 2019 Technical Papers at Centre-Ville
Chair(s): Cor-Paul Bezemer University of Alberta, Canada
14:45
15m
Full-paper
Standing on Shoulders or Feet? The Usage of the MSR Data Papers
MSR 2019 Technical Papers
Zoe Kotti Athens University of Economics and Business, Diomidis Spinellis Athens University of Economics and Business
Pre-print
15:00
6m
Talk
Boa Meets Python: A Boa Dataset of Data Science Software in Python Language
MSR 2019 Data Showcase
Sumon Biswas Iowa State University, Md Johirul Islam Iowa State University, Yijia Huang , Hridesh Rajan Iowa State University
Pre-print Media Attached
15:06
6m
Talk
A Benchmark of Data Loss Bugs for Android Apps
MSR 2019 Data Showcase
Oliviero Riganelli , Marco Mobilio , Daniela Micucci University of Milano-Bicocca, Italy, Leonardo Mariani University of Milano Bicocca
15:12
6m
Talk
RapidRelease - A Dataset of Projects and Issues on GitHub with Rapid Release
MSR 2019 Data Showcase
Saket Joshi Indian Institute of Technology Tirupati, Sridhar Chimalakonda Indian Institute of Technology Tirupati
15:18
6m
Short-paper
A Tool to Analyze Packages in Software Containers
MSR 2019 Technical Papers
Ahmed Zerouali UMONS, Valerio Cosentino Bitergia, Jesus M. Gonzalez-Barahona Universidad Rey Juan Carlos, Gregorio Robles Universidad Rey Juan Carlos, Tom Mens University of Mons
Pre-print
15:24
6m
Talk
An Empirical History of Permission Requests and Mistakes in Open Source Android Apps
MSR 2019 Data Showcase
Gian Luca Scoccia , Anthony Peruma Rochester Institute of Technology, Virginia Pujols , Ben Christians , Daniel Krutz Rochester Institute of Technology
14:45 - 15:30
Session IX: TraceabilityMSR 2019 Technical Papers at Place du Canada
Chair(s): Francisco Servant Virginia Tech
14:45
15m
Full-paper
Predicting Co-Changes between Functionality Specifications and Source Code in Behavior Driven Development
MSR 2019 Technical Papers
Aidan Z.H. Yang Queen's University, Canada, Daniel Alencar Da Costa Queen's University, Kingston, Ontario, Ying Zou Queen's University, Kingston, Ontario
15:01
6m
Short-paper
Tracing Back Log Data to its Log Statement: From Research to Practice
MSR 2019 Technical Papers
Daan Schipper , Maurício Aniche Delft University of Technology, Netherlands, Arie van Deursen Delft University of Technology
Pre-print
15:08
6m
Short-paper
Beyond GumTree: A hybrid approach to generate edit scripts
MSR 2019 Technical Papers
Junnosuke Matsumoto , Yoshiki Higo Osaka University, Shinji Kusumoto Osaka University
Pre-print
15:15
6m
Short-paper
The Impact of Systematic Edits in History Slicing
MSR 2019 Technical Papers
Ryosuke Funaki , Shinpei Hayashi Tokyo Institute of Technology, Motoshi Saeki Tokyo Institute of Technology
Pre-print
15:22
6m
Short-paper
Scalable Software Merging Studies with MERGANSER
MSR 2019 Technical Papers
Moein Owhadi-Kareshk , Sarah Nadi University of Alberta

Not scheduled yet

Not scheduled yet
Break
Break
MSR 2019 Technical Papers

Not scheduled yet
Break
mini-break
MSR 2019 Technical Papers

Accepted Papers

Title
A Large-scale Study about Quality and Reproducibility of Jupyter Notebooks
MSR 2019 Technical Papers
Pre-print
An Empirical Study of Multiple Names and Email Addresses in OSS Version Control Repositories
MSR 2019 Technical Papers
Assessing Diffusion and Perception of Test Smells in Scala Projects
MSR 2019 Technical Papers
Pre-print
A Tool to Analyze Packages in Software Containers
MSR 2019 Technical Papers
Pre-print
Automated Software Vulnerability Assessment with Concept Drift
MSR 2019 Technical Papers
Automatically Generating Documentation for Lambda Expressions in Java
MSR 2019 Technical Papers
Pre-print
Beyond GumTree: A hybrid approach to generate edit scripts
MSR 2019 Technical Papers
Pre-print
Can Issues Reported at Stack Overflow Questions be Reproduced? An Exploratory Study
MSR 2019 Technical Papers
Pre-print
Characterizing the Roles of Contributors in Open-source Scientific Software Projects
MSR 2019 Technical Papers
Pre-print
Crossflow: A Framework for Distributed Mining of Software Repositories
MSR 2019 Technical Papers
Pre-print
Cross-language clone detection by learning over abstract syntax trees
MSR 2019 Technical Papers
Pre-print
Data-Driven Solutions to Detect API Compatibility Issues in Android: An Empirical Study
MSR 2019 Technical Papers
DeepJIT: An End-To-End Deep LearningFramework for Just-In-Time Defect Prediction
MSR 2019 Technical Papers
Dependency Versioning in the Wild
MSR 2019 Technical Papers
Pre-print
Does UML Modeling Associate with Higher Software Quality in Open-Source Software?
MSR 2019 Technical Papers
Pre-print
Empirical study in using version histories for change risk classification
MSR 2019 Technical Papers
Exploratory Study of Slack Q&A Chats as a Mining Source for Software Engineering Tools
MSR 2019 Technical Papers
Pre-print
Exploring Word Embedding Techniques to Improve Sentiment Analysis of Software Engineering Texts
MSR 2019 Technical Papers
Pre-print
Extracting API Tips from Developer Question and Answer Websites
MSR 2019 Technical Papers
Generating Commit Messages from Diffs using Pointer-generator Network
MSR 2019 Technical Papers
git2net - Mining Time-Stamped Co-Editing Networks from Large git Repositories
MSR 2019 Technical Papers
DOI Pre-print
Identifying Experts in Software Libraries and Frameworks among GitHub Users
MSR 2019 Technical Papers
Pre-print
Impacts of Daylight Saving Time on Software Development
MSR 2019 Technical Papers
Pre-print
Import2vec: learning embeddings for software libraries
MSR 2019 Technical Papers
Pre-print
Investigating Next-Steps in Static API-Misuse Detection
MSR 2019 Technical Papers
Pre-print
Lessons learned from using a deep tree-based model for software defect prediction in practice
MSR 2019 Technical Papers
mini-break
MSR 2019 Technical Papers

Negative Results on Mining Crypto-API Usage Rules in Android Apps
MSR 2019 Technical Papers
On the Effectiveness of Manual and Automatic Unit Test Generation: Ten Years Later
MSR 2019 Technical Papers
DOI Pre-print Media Attached
PathMiner : A Library for Mining of Path-Based Representations of Code
MSR 2019 Technical Papers
DOI Pre-print Media Attached
Predicting Co-Changes between Functionality Specifications and Source Code in Behavior Driven Development
MSR 2019 Technical Papers
Predicting Good Configurations for GitHub and Stack Overflow Topic Models
MSR 2019 Technical Papers
Pre-print
Recommending Energy-Efficient Java Collections
MSR 2019 Technical Papers
Pre-print
Scalable Software Merging Studies with MERGANSER
MSR 2019 Technical Papers
SCOR: Source Code Retrieval With Semantics and Order
MSR 2019 Technical Papers
Pre-print Media Attached
Snoring: a Noise in Defect Prediction Datasets
MSR 2019 Technical Papers
Splitting APIs: An Exploratory Study of Software Unbundling
MSR 2019 Technical Papers
Standing on Shoulders or Feet? The Usage of the MSR Data Papers
MSR 2019 Technical Papers
Pre-print
STRAIT: A Tool for Automated Software Reliability Growth Analysis
MSR 2019 Technical Papers
Pre-print
Striking Gold in Software Repositories? An Econometric Study of Cryptocurrencies on GitHub
MSR 2019 Technical Papers
Pre-print
style-analyzer: fixing code style inconsistencies with interpretable unsupervised algorithms
MSR 2019 Technical Papers
Pre-print
Test Coverage in Python Programs
MSR 2019 Technical Papers
The Emergence of Software Diversity in Maven Central
MSR 2019 Technical Papers
Pre-print
The Impact of Systematic Edits in History Slicing
MSR 2019 Technical Papers
Pre-print
The Rise of Android Code Smells: Who Is to Blame?
MSR 2019 Technical Papers
Time Present and Time Past: Analyzing the Evolution of JavaScript Code in the Wild
MSR 2019 Technical Papers
Pre-print
Tracing Back Log Data to its Log Statement: From Research to Practice
MSR 2019 Technical Papers
Pre-print
World of Code: An Infrastructure for Mining the Universe of Open Source VCS Data
MSR 2019 Technical Papers

Call for Papers

Scope

The technical track of MSR 2019 solicits novel, high quality submissions on a wide range of topics, including (but not limited to):

  • Analysis of change patterns and trends to assist in future development
  • Analysis of natural language artifacts in software repositories
  • Analysis of software ecosystems and mining of repositories across multiple projects
  • Approaches, applications, and tools for software repository mining
  • Characterization, classification, and prediction of software defects based on analysis of software repositories
  • Characterization of bias in mining and guidelines to ensure quality results
  • Empirical studies on extracting data from repositories of large long-lived and/or industrial projects
  • Energy efficiency of software
  • Meta-models, exchange formats, and infrastructure tools to facilitate the sharing of extracted data and to encourage reuse and repeatability
  • Methods of integrating mined data from various historical sources
  • Mining code review repositories
  • Mining execution traces and logs
  • Mining human and social aspects of development
  • Mining interaction data repositories
  • Mining mobile app stores and app reviews
  • Mining multimedia repositories
  • Mining software licensing and copyrights
  • Models for social and development processes in large software projects
  • Models of software project evolution based on historical repository data
  • Prediction and modeling of software quality
  • Privacy and ethics in mining software repositories
  • Release engineering, including continuous integration, delivery and deployment
  • Search-driven software development, including search techniques to assist developers in finding suitable components and code fragments for reuse, and software search engines
  • Software science
  • Studies of programming language features and their usage
  • Techniques and tools for capturing new forms of data for storage in software repositories, such as effort data, fine-grained changes, and refactoring
  • Techniques to model reliability and defect occurrences
  • Visualization techniques and models of mined data

Types of Technical Track Submissions

We accept both full (10 pages plus 2 additional pages of references) and short (4 pages plus 2 additional pages of references) papers. Furthermore, in order to facilitate the reviewing process of your paper’s contribution, you should select one of the following paper categories:

1. Research Paper

Full research papers are expected to describe new mining methodologies and/or provide novel research results, and should be evaluated scientifically. While a high degree of technical rigor is expected for long papers, short research papers should discuss controversial issues in the field, or describe interesting or thought provoking ideas that are not yet fully developed. Accepted short papers will be presented in a short lightning talk.

Relevant review criteria:

  • relevance to field of MSR (+ clarity of relation with related work)
  • novelty
  • soundness of approach
  • quality of evaluation [for long papers]
  • ability to replicate the evaluation [for long papers]
  • quality of presentation (e.g., paper is readable, easy to follow, figures are clear, etc.)

2. Practice Experience

Adapting existing algorithms or proposing new algorithms or approaches for practical use are considered a plus.

Full practice papers are expected to evaluate the application of mining repository algorithms in an industry/open source organization context and discuss the actionable results of the evaluation (whether positive or negative) as well as any adaptations and/or modifications to the algorithms. While a detailed account of an empirical evaluation is expected for long papers, short practice papers can focus more on presenting adaptations of existing approaches or proposals of new approaches/variants for their application in an industry/open source organization context. Accepted short papers will be presented in a short lightning talk.

Relevant review criteria:

  • relevance to field of MSR (+ clarity of relation with related work)
  • explicit discussion of any adaptations or variants required by the application of the existing/new approach in practice
  • quality of empirical evaluation [long papers]
  • explicit discussion on the usefulness/impact of the approach in practice [long papers]
  • quality of presentation (e.g., paper is readable, easy to follow, figures are clear, etc.)

3. Tool Track

Given that MSR has reached its 16th edition in 2019, it is essential to promote and recognize the creation and use of tools that are designed and built not only for a specific research project, but for the MSR community as a whole. Those tools enable other researchers to jumpstart their own research efforts, and also enable reproducibility of earlier work.

Short Tool papers should include descriptions and promising use cases of previously unpublished tools built by the authors that are publicly available for reuse by other researchers and practitioners. Long tool papers should also provide an empirical evaluation (qualitative and/or quantitative) of a tool.

Since the tool track is double blind, any citations that might reveal the tool’s name or authors should be marked as [removed for double-blind]. Furthermore, we do not request the tool to be available for review, since blinding out tool names in the code or documentation is a hassle. Instead, we require a short appendix (counted in the 2 pages of bibliography) explaining how you will be making the tool available for the next 5 years, how people will be able to install and run the tool, and what kinds of tutorials/sample data will be made available with the tool. After acceptance, the tool should be made available according to those specifications.

Relevant review criteria:

  • relevance to field of MSR, including potential for reuse by other research teams and practitioners
  • novelty (+ clarity of relation with related work)
  • presence of comprehensive details on tool’s internals and usage
  • evaluation of usefulness/usability of the tool [for long papers]
  • quality of presentation (e.g., paper is readable, easy to follow, figures are clear, etc.)
  • before acceptance: short appendix explaining plans for public access to the tool for the next 5 years, for documentation on hot to install and use the tool, and for sample data included with the tool
  • after acceptance: tool, documentation and sample data should be made available according to the appendix

Submission Process

Papers must be submitted electronically through EasyChair. All types of technical papers will be peer-reviewed according to the specified review criteria, hence it is required to choose the right type of paper according to the paper’s major contributions. Submissions should follow the IEEE Conference Proceedings Formatting Guidelines, with title in 24pt font and full text in 10pt type. LaTEX users must use \documentclass[10pt,conference]{IEEEtran} without including the compsoc or compsocconf option.

Papers submitted for consideration should not have been published elsewhere and should not be under review or submitted for review elsewhere for the duration of consideration. ACM plagiarism policy and procedures shall be followed for cases of double submission. The submission must also comply with the IEEE Policy on Authorship.

Upon notification of acceptance, all authors of accepted papers will be asked to complete a copyright form and will receive further instructions for preparing their camera ready versions. At least one author of each paper is expected to register and present the results at the MSR 2019 conference. All accepted contributions will be published in the conference electronic proceedings.

The official publication date is the date the proceedings are made available in the ACM or IEEE Digital Libraries. This date may be up to two weeks prior to the first day of ICSE 2019. The official publication date affects the deadline for any patent filings related to the published work. Purchases of additional pages in the proceedings is not allowed.

A selection of the best papers will be invited to an EMSE Special Issue. All accepted technical papers in 2019 have a chance to win the "MSR FOSS Impact Paper Award“.

IMPORTANT: The main technical track of MSR 2019 follows the double-blind submission model. Submissions should not reveal the identity of the authors in any way. This means that authors should:

  • leave out author names and affiliations from the body and metadata of the submitted pdf
  • ensure that any citations to related work by themselves are written in the third person, for example “the prior work of XYZ” as opposed to “our prior work [2]”
  • not refer to their personal, lab or university website; similarly, care should be taken with personal accounts on github, bitbucket, Google Drive, etc.
  • not upload unblinded versions of their paper on archival websites during bidding/reviewing, however uploading unblinded versions prior to submission is allowed and sometimes unavoidable (e.g., thesis)

However, mind that it is easily possible to anonymously share replication packages with reviewers through the process described below in the “Open Science Policy” section or through anonymous sharing via Dropbox, Google Drive, etc. We highly recommend doing so, except for confidentiality or privacy reasons.

Open Science Policy

MSR encourages authors to submit replication packages and/or data sets with their papers, since access to data and scripts is essential during peer review. The following guidelines are recommendations and not mandatory. Your choice to use open science or not will not affect the review process for your paper.

If you decide to share data, we strongly encourage you to archive data sets and scripts on online archival sites such as zenodo.org, figshare.com, or archive.org. Note that zenodo.org accepts up to 50GB per dataset (more upon request), while archive.org allows to upload terabytes of data. During peer review the data set or scripts should be privately shared via an anonymous link in your manuscript. Once accepted, an option can be toggled to publish the data and scripts with an official DOI.

Apart from openness during peer review, the above archival sites also enable openness to the reader once the paper is accepted and to cite the archived content via an automatically generated DOI. A clearly illustrated walkthrough to archive data and scripts on zenodo.org and figshare.com can be found here.

Finally, MSR encourages authors to self-archive a preprint of your accepted manuscript in open, preserved repositories such as arXiv.org. This is legal and allowed by all major publishers including ACM and IEEE, enabling open access to any interested party. Note that the final version of the paper, as laid out by the publisher, cannot be self-archived. Instead, use the manuscript with reviewer comments addressed, but before applying the camera-ready instructions and templates. Feel free to contact the MSR 2019 PC or proceedings chairs for more details.

Deadlines

Abstract deadline: January 15, 2019, 23:59 AoE

Papers deadline: January 22, 2019, 23:59 AoE

Grace period for paper updates: January 23-24, 2019, 23:59 AoE

Author Notification: March 1, 2019, 23:59 AoE

Camera Ready: March 15, 2019, 23:59 AoE

Organization

Bram Adams (PC co-chair), Polytechnique Montreal, Canada

Sonia Haiduc (PC co-chair), Florida State University, USA

Margaret-Anne Storey (General Chair), University of Victoria, Canada