Tue 17 MayDisplayed time zone: Eastern Time (US & Canada) change
20:00 - 20:50 | MSR 2022 OpeningTechnical Papers / Shadow PC / Hackathon / FOSS Award / MSR Awards / Mining Challenge / Registered Reports / Keynotes / Industry Track / MIP Award / Tutorials / Vision and Reflection / Data and Tool Showcase Track at MSR Plenary room | ||
21:00 - 21:50 | Newcomer Orientation ITechnical Papers / Shadow PC / Hackathon / FOSS Award / MSR Awards / Mining Challenge / Registered Reports / Keynotes / Industry Track / MIP Award / Tutorials / Vision and Reflection / Data and Tool Showcase Track at MSR Newcomer Orientation room Chair(s): Yuan Tian Queens University, Kingston, Canada, Gias Uddin University of Calgary, Canada Mentors: Bram Adams, Fatemeh Fard, Li Li, Ali Ouni, Tianyi Zhang | ||
Wed 18 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 04:50 | Newcomer Orientation IITechnical Papers at MSR Newcomer Orientation room Chair(s): Tegawendé F. Bissyandé SnT, University of Luxembourg, Chaiyong Rakhitwetsagul Mahidol University, Thailand Mentors: Bodin Chinthanet, Raula Gaikovina Kula, Christoph Treude, Xin Xia | ||
11:00 - 11:50 | Keynote: Christian Kästner – From Models to Systems: Rethinking the Role of Software Engineering for Machine LearningTechnical Papers at MSR Plenary room Chair(s): Nicole Novielli University of Bari | ||
12:00 - 12:50 | Mining ChallengeMining Challenge / Technical Papers at MSR Main room - even hours Chair(s): Steffen Herbold TU Clausthal | ||
12:00 4mTalk | An Exploratory Study on Refactoring Documentation in Issues Handling Mining Challenge Eman Abdullah AlOmar Stevens Institute of Technology, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Christian D. Newman Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
12:04 4mTalk | Between JIRA and GitHub: ASFBot and its Influence on Human Comments in Issue Trackers Mining Challenge Ambarish Moharil Eindhoven University of Technology, Dmitrii Orlov Eindhoven University of Technology, Samar Jameel Eindhoven University of Technology, Tristan Trouwen Eindhoven University of Technology, Nathan Cassee Eindhoven University of Technology, Alexander Serebrenik Eindhoven University of Technology Pre-print | ||
12:08 4mTalk | Is Refactoring Always a Good Egg? Exploring the Interconnection Between Bugs and Refactorings Mining Challenge File Attached | ||
12:12 4mTalk | On the Co-Occurrence of Refactoring of Test and Source Code Mining Challenge Pre-print Media Attached | ||
12:16 4mTalk | Refactoring Debt: Myth or Reality? An Exploratory Study on the Relationship Between Technical Debt and RefactoringBest Mining Challenge Paper Award Mining Challenge Anthony Peruma Rochester Institute of Technology, Eman Abdullah AlOmar Stevens Institute of Technology, Christian D. Newman Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print Media Attached | ||
12:20 4mTalk | Studying the Impact of Continuous Delivery Adoption on Bug-Fixing Time in Apache’s Open-Source Projects Mining Challenge Carlos Diego Andrade de Almeida Federal University of Ceará, Diego N. Feijó Federal University of Ceará, Lincoln Rocha Federal University of Ceará Media Attached | ||
12:24 4mTalk | Which bugs are missed in code reviews: An empirical study on SmartSHARK dataset Mining Challenge fatemeh khoshnoud Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Ali Rezaei Nasab Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Zahra Toudeji Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Ashkan Sami Shiraz University | ||
12:28 22mLive Q&A | Discussions and Q&A Technical Papers |
13:00 - 13:50 | Session 4: Software Quality (Bugs & Smells)Data and Tool Showcase Track / Technical Papers at MSR Main room - odd hours Chair(s): Maxime Lamothe Polytechnique Montreal, Montreal, Canada, Mahmoud Alfadel University of Waterloo | ||
13:00 7mTalk | Dazzle: Using Optimized Generative Adversarial Networks to Address Security Data Class Imbalance Issue Technical Papers Rui Shu North Carolina State University, Tianpei Xia North Carolina State University, Laurie Williams North Carolina State University, Tim Menzies North Carolina State University | ||
13:07 7mTalk | To What Extent do Deep Learning-based Code Recommenders Generate Predictions by Cloning Code from the Training Set? Technical Papers Matteo Ciniselli Università della Svizzera Italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print | ||
13:14 7mTalk | How to Improve Deep Learning for Software Analytics (a case study with code smell detection) Technical Papers Pre-print | ||
13:21 7mTalk | Using Active Learning to Find High-Fidelity Builds Technical Papers Harshitha Menon Lawrence Livermore National Lab, Konstantinos Parasyris Lawrence Livermore National Laboratory, Todd Gamblin Lawrence Livermore National Laboratory, Tom Scogland Lawrence Livermore National Laboratory Pre-print | ||
13:28 4mTalk | ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction Data and Tool Showcase Track Hossein Keshavarz David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada, Mei Nagappan University of Waterloo Pre-print | ||
13:32 4mTalk | ReCover: a Curated Dataset for Regression Testing Research Data and Tool Showcase Track Francesco Altiero Università degli Studi di Napoli Federico II, Anna Corazza Università degli Studi di Napoli Federico II, Sergio Di Martino Università degli Studi di Napoli Federico II, Adriano Peron Università degli Studi di Napoli Federico II, Luigi Libero Lucio Starace Università degli Studi di Napoli Federico II | ||
13:36 14mLive Q&A | Discussions and Q&A Technical Papers |
13:00 - 13:50 | |||
13:00 50mTutorial | Empirical Standards for Repository Mining Tutorials Paul Ralph Dalhousie University, Tushar Sharma Dalhousie University, Preetha Chatterjee Drexel University, USA Pre-print |
14:00 - 14:50 | Session 5: Communication & Domains Data and Tool Showcase Track / Technical Papers at MSR Main room - even hours Chair(s): Masud Rahman Dalhousie University, Mahmoud Alfadel University of Waterloo | ||
14:00 7mTalk | Painting the Landscape of Automotive Software in GitHub Technical Papers Sangeeth Kochanthara Eindhoven University of Technology, Yanja Dajsuren Eindhoven University of Technology, Loek Cleophas Eindhoven University of Technology (TU/e) and Stellenbosch University (SU), Mark van den Brand Eindhoven University of Technology Pre-print Media Attached | ||
14:07 7mFull-paper | Mining the Usage of Reactive Programming APIs: A Study on GitHub and Stack Overflow Technical Papers Carlos Zimmerle Federal University of Pernambuco, Kiev Gama Federal University of Pernambuco, Fernando Castor Utrecht University & Federal University of Pernambuco, José Murilo Filho Federal University of Pernambuco DOI Pre-print | ||
14:14 4mTalk | SoCCMiner: A Source Code-Comments and Comment-Context Miner Data and Tool Showcase Track Murali Sridharan University of Oulu, Mika Mäntylä University of Oulu, Maëlick Claes University of Oulu, Leevi Rantala University of Oulu Pre-print | ||
14:18 4mTalk | SLNET: A Redistributable Corpus of 3rd-party Simulink Models Data and Tool Showcase Track Sohil Lal Shrestha The University of Texas at Arlington, Shafiul Azam Chowdhury University of Texas at Arlington, Christoph Csallner University of Texas at Arlington DOI Pre-print Media Attached | ||
14:22 4mTalk | SOSum: A Dataset of Stack Overflow Post Summaries Data and Tool Showcase Track Bonan Kou Purdue University, Yifeng Di Purdue University, Muhao Chen University of Southern California, Tianyi Zhang Purdue University | ||
14:26 4mTalk | Inspect4py: A Knowledge Extraction Framework for Python Code Repositories Data and Tool Showcase Track | ||
14:30 4mTalk | DISCO: A Dataset of Discord Chat Conversations for Software Engineering Research Data and Tool Showcase Track Keerthana Muthu Subash Carleton University, Canada, Lakshmi Prasanna Kumar Carleton University, Canada, Sri Lakshmi Vadlamani Carleton University, Canada, Preetha Chatterjee Drexel University, USA, Olga Baysal Carleton University DOI Pre-print Media Attached | ||
14:34 16mLive Q&A | Discussions and Q&A Technical Papers |
14:00 - 14:50 | |||
14:00 50mTutorial | Mining the Ethereum Blockchain Platform: Best Practices and Pitfalls Tutorials Gustavo A. Oliva Queen's University |
21:00 - 21:50 | Session 7: Developer Wellbeing & Project CommunicationTechnical Papers / Data and Tool Showcase Track / Industry Track at MSR Main room - odd hours Chair(s): Bram Adams Queen's University, Kingston, Ontario | ||
21:00 7mTalk | On the Violation of Honesty in Mobile Apps: Automated Detection and CategoriesDistinguished Paper Award Technical Papers Humphrey Obie Monash University, Idowu Oselumhe Ilekura Data Science Nigeria, Hung Du Applied Artificial Intelligence Institute, Deakin University, Mojtaba Shahin RMIT University, Australia, John Grundy Monash University, Li Li Monash University, Jon Whittle CSIRO's Data61 and Monash University, Burak Turhan University of Oulu Pre-print | ||
21:07 7mTalk | How heated is it? Understanding GitHub locked issues Technical Papers Isabella Ferreira Polytechnique Montréal, Bram Adams Queen's University, Kingston, Ontario, Jinghui Cheng Polytechnique Montreal Pre-print Media Attached | ||
21:14 4mTalk | The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories Data and Tool Showcase Track Melanie Warrick University of Vermont, Samuel F. Rosenblatt University of Vermont, Jean-Gabriel Young University of Vermont, amanda casari Open Source Programs Office, Google, Laurent Hébert-Dufresne University of Vermont, James P. Bagrow University of Vermont DOI Pre-print Media Attached | ||
21:18 4mTalk | The Unexplored Treasure Trove of Phabricator Code Reviews Data and Tool Showcase Track Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands DOI Pre-print | ||
21:22 4mTalk | The Unsolvable Problem or the Unheard Answer? A Dataset of 24,669 Open-Source Software Conference Talks Data and Tool Showcase Track Kimberly Truong Oregon State University, Courtney Miller Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University DOI Pre-print | ||
21:26 4mTalk | Exploring Apache Incubator Project Trajectories with APEX Data and Tool Showcase Track Anirudh Ramchandran University of California, Davis, Likang Yin University of California, Davis, Vladimir Filkov University of California at Davis | ||
21:30 7mTalk | A Culture of Productivity: Maximizing Productivity by Maximizing Wellbeing Industry Track Brian Houck Microsoft Research | ||
21:37 13mLive Q&A | Discussions and Q&A Technical Papers |
Thu 19 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 04:50 | Session 9: Scaling & CloudIndustry Track / Registered Reports / Data and Tool Showcase Track / Technical Papers at MSR Main room - even hours Chair(s): Lwin Khin Shar Singapore Management University | ||
04:00 4mTalk | SniP: An Efficient Stack Tracing Framework for Multi-threaded Programs Data and Tool Showcase Track Arun KP Indian Institute of Technology Kanpur, Saurabh Kumar Indian Institute of Technology Kanpur, Debadatta Mishra , Biswabandan Panda Indian Institute of Technology Bombay DOI Pre-print | ||
04:04 4mTalk | Tooling for Time- and Space-efficient git Repository Mining Data and Tool Showcase Track Fabian Heseding Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Willy Scheibel Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Jürgen Döllner Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam | ||
04:08 4mTalk | TSSB-3M: Mining single statement bugs at massive scale Data and Tool Showcase Track Cedric Richter Carl von Ossietzky Universität Oldenburg / University of Oldenburg, Heike Wehrheim Carl von Ossietzky Universität Oldenburg / University of Oldenburg Pre-print Media Attached | ||
04:12 7mTalk | Improved Business Outcomes from Cloud Applications – using Integrated Process and Runtime Product Data Mining Industry Track | ||
04:19 7mTalk | Improve Quality of Cloud Serverless Architectures through Software Repository Mining Industry Track | ||
04:26 4mTalk | Toward Granular Automatic Unit Test Case Generation Registered Reports Fabiano Pecorelli Tampere University, Giovanni Grano LocalStack, Fabio Palomba University of Salerno, Harald C. Gall University of Zurich, Andrea De Lucia University of Salerno Pre-print | ||
04:30 20mLive Q&A | Discussions and Q&A Technical Papers |
10:00 - 10:50 | Virtual CoffeeTechnical Papers at MSR Main room - even hours This session will be for informal conversations on Midspace. | ||
11:00 - 11:50 | Session 11: Machine Learning & Information RetrievalTechnical Papers at MSR Main room - odd hours Chair(s): Phuong T. Nguyen University of L’Aquila | ||
11:00 4mShort-paper | On the Naturalness of Fuzzer Generated Code Technical Papers Rajeswari Hita Kambhamettu Carnegie Mellon University, John Billos Wake Forest University, Carolyn "Tomi" Oluwaseun-Apo Pennsylvania State University, Benjamin Gafford Carnegie Mellon University, Rohan Padhye Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
11:04 7mTalk | Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes Technical Papers DOI Pre-print Media Attached | ||
11:11 7mTalk | Multimodal Recommendation of Messenger Channels Technical Papers Ekaterina Koshchenko JetBrains Research, Egor Klimov JetBrains Research, Vladimir Kovalenko JetBrains Research | ||
11:18 7mTalk | Senatus: A Fast and Accurate Code-to-Code Recommendation Engine Technical Papers Fran Silavong JP Morgan Chase & Co., Sean Moran JP Morgan Chase & Co., Antonios Georgiadis JP Morgan Chase & Co., Rohan Saphal JP Morgan Chase & Co., Robert Otter JP Morgan Chase & Co. DOI Pre-print Media Attached | ||
11:25 7mTalk | Challenges in Migrating Imperative Deep Learning Programs to Graph Execution: An Empirical Study Technical Papers Tatiana Castro Vélez City University of New York (CUNY) Graduate Center, Raffi Khatchadourian City University of New York (CUNY) Hunter College, Mehdi Bagherzadeh Oakland University, Anita Raja City University of New York (CUNY) Hunter College Pre-print Media Attached | ||
11:32 7mTalk | GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses Technical Papers Wei Ma SnT, University of Luxembourg, Mengjie Zhao LMU Munich, Ezekiel Soremekun SnT, University of Luxembourg, Qiang Hu University of Luxembourg, Jie M. Zhang King's College London, Mike Papadakis University of Luxembourg, Luxembourg, Maxime Cordy University of Luxembourg, Luxembourg, Xiaofei Xie Singapore Management University, Singapore, Yves Le Traon University of Luxembourg, Luxembourg Pre-print | ||
11:39 11mLive Q&A | Discussions and Q&A Technical Papers |
12:00 - 12:51 | Vision & Reflections Track: PastTechnical Papers at MSR Plenary room Chair(s): Bram Adams Queen's University, Kingston, Ontario, Shaowei Wang University of Manitoba | ||
12:00 10mTalk | Back to the future: Empirical Revolution(s) in Software Engineering Technical Papers Audris Mockus The University of Tennessee | ||
12:10 10mTalk | Engineering the MSR Field and the Joy of Research Technical Papers Ahmed E. Hassan Queen's University | ||
12:20 10mTalk | It's all in your network: How mining developer collaboration allowed us to peer into complex socio-technical aspects of software development Technical Papers Daniela Damian University of Victoria | ||
12:30 21mOther | Discussion Technical Papers |
13:00 - 13:51 | Vision & Reflections Track: FutureTechnical Papers at MSR Plenary room Chair(s): Bram Adams Queen's University, Kingston, Ontario, Shaowei Wang University of Manitoba | ||
13:00 10mTalk | Bias in MSR research Technical Papers Alexander Serebrenik Eindhoven University of Technology | ||
13:10 10mTalk | The Next Generation of Software Developers Technical Papers Denae Ford Microsoft Research | ||
13:20 10mTalk | Mining Software Repositories in the age of AI Technical Papers Foutse Khomh Polytechnique Montréal | ||
13:30 21mOther | Discussion Technical Papers |
14:00 - 14:50 | MIP Award SessionMIP Award at MSR Plenary room Chair(s): Massimiliano Di Penta University of Sannio, Italy Most Influential Paper: “GHTorrent: Github’s data from a firehose” by Georgios Gousios and Diomidis Spinellis (MSR 2012) for conceiving and maintaining the GHTorrent archive, extensively leveraged by the MSR community. | ||
14:00 50mTalk | MIP Award Talk MIP Award Georgios Gousios Endor Labs & Delft University of Technology, Diomidis Spinellis Athens University of Economics and Business; Delft University of Technology |
22:00 - 22:50 | Foundational Contribution Award SessionTechnical Papers at MSR Plenary room Chair(s): Miryung Kim University of California at Los Angeles, USA | ||
22:00 50mAwards | MSR Foundational Contribution Award Technical Papers |
Fri 20 MayDisplayed time zone: Eastern Time (US & Canada) change
05:00 - 05:30 | Closing Session of Virtual MSR 2022 + Introduction of MSR 2023Technical Papers at MSR Plenary room Speakers: David Lo, Shane McIntosh, Nicole Novielli, Emad Shihab | ||
10:00 - 10:50 | Shadow PC RetrospectiveTechnical Papers at MSR Main room - even hours Chair(s): Eleni Constantinou Eindhoven University of Technology, Sarah Nadi University of Alberta Closed to Shadow PC Members. | ||
12:00 - 12:50 | |||
12:00 50mTutorial | Using Datalore for Reproducible Research Tutorials Jodie Burchell JetBrains |
13:00 - 13:50 | Tutorial: Software Bots in Software Engineering: Benefits and ChallengesTutorials at MSR Tutorials room | ||
13:00 50mTutorial | Software Bots in Software Engineering: Benefits and Challenges Tutorials Mairieli Wessel Delft University of Technology, Marco Gerosa Northern Arizona University, USA, Emad Shihab Concordia University |
14:00 - 15:00 | Session 16: Non-functional Properties (Availability, Security, Legal Aspects)Industry Track / Technical Papers / Registered Reports / Data and Tool Showcase Track at MSR Main room - even hours Chair(s): Maxime Lamothe Polytechnique Montreal, Montreal, Canada, Jin L.C. Guo McGill University | ||
14:00 7mTalk | A Deep Study of the Effects and Fixes of Server-Side Request Races in Web Applications Technical Papers Zhengyi Qiu North Carolina State University, Shudi Shao North Carolina State University, Qi Zhao North Carolina State University, Hassan Ali Khan North Carolina State University, Xinning Hui North Carolina State University, Guoliang Jin North Carolina State University Media Attached | ||
14:07 4mTalk | A Large-scale Dataset of (Open Source) License Text VariantsData and Tool Showcase Award Data and Tool Showcase Track Stefano Zacchiroli Télécom Paris, Polytechnic Institute of Paris DOI Pre-print | ||
14:11 7mTalk | SECOM: Towards a convention for security commit messagesFOSS Impact Paper Award Industry Track Sofia Reis Instituto Superior Técnico, U. Lisboa & INESC-ID, Rui Abreu Faculty of Engineering, University of Porto, Portugal, Hakan Erdogmus Carnegie Mellon University, Corina S. Păsăreanu Carnegie Mellon University Pre-print | ||
14:18 7mTalk | Varangian: A Git Bot for Augmented Static Analysis Industry Track Saurabh Pujar IBM Research, Yunhui Zheng IBM Research, Luca Buratti IBM Research, Burn Lewis IBM Research, Alessandro Morari IBM Research, Jim A. Laredo IBM Research, Kevin Postlethwait Red Hat, Christoph Görn Red Hat | ||
14:25 7mTalk | Detecting Privacy-Sensitive Code Changes with Language Modeling Industry Track Gökalp Demirci Meta Platforms, Inc., Vijayaraghavan Murali Meta Platforms, Inc., Imad Ahmad Meta Platforms, Inc., Rajeev Rao Meta Platforms, Inc., Gareth Ari Aye Meta Platforms, Inc. | ||
14:32 4mTalk | Is GitHub's Copilot as Bad As Humans at Introducing Vulnerabilities in Code? Registered Reports Owura Asare University of Waterloo, Mei Nagappan University of Waterloo, N. Asokan University of Waterloo Pre-print | ||
14:36 7mTalk | Finding the Fun in Fundraising: Public Issues and Pull Requests in VC-backed Open-Core Companies Industry Track Kevin Xu GitHub | ||
14:43 17mLive Q&A | Discussions and Q&A Technical Papers |
Mon 23 MayDisplayed time zone: Eastern Time (US & Canada) change
09:00 - 10:30 | In-Person MSR Opening, Keynote and MIP SessionTechnical Papers / MIP Award at Room 315+316 Chair(s): David Lo Singapore Management University | ||
09:00 20mTalk | In-Person MSR 2022 Opening Session Technical Papers David Lo Singapore Management University, Shane McIntosh University of Waterloo, Nicole Novielli University of Bari | ||
09:20 35mKeynote | From Models to Systems: Rethinking the Role of Software Engineering for Machine Learning Technical Papers Christian Kästner Carnegie Mellon University | ||
09:55 35mTalk | MIP Award Talk MIP Award Georgios Gousios Endor Labs & Delft University of Technology, Diomidis Spinellis Athens University of Economics and Business; Delft University of Technology |
10:30 - 11:00 | |||
12:30 - 13:30 | |||
13:30 - 15:00 | Blended Technical Session 2 (Machine Learning and Information Retrieval) Technical Papers / Data and Tool Showcase Track at Room 315+316 Chair(s): Preetha Chatterjee Drexel University, USA | ||
13:30 15mTalk | Methods for Stabilizing Models across Large Samples of Projects(with case studies on Predicting Defect and Project Health) Technical Papers Suvodeep Majumder North Carolina State University, Tianpei Xia North Carolina State University, Rahul Krishna North Carolina State University, Tim Menzies North Carolina State University Pre-print Media Attached | ||
13:45 15mTalk | GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses Technical Papers Wei Ma SnT, University of Luxembourg, Mengjie Zhao LMU Munich, Ezekiel Soremekun SnT, University of Luxembourg, Qiang Hu University of Luxembourg, Jie M. Zhang King's College London, Mike Papadakis University of Luxembourg, Luxembourg, Maxime Cordy University of Luxembourg, Luxembourg, Xiaofei Xie Singapore Management University, Singapore, Yves Le Traon University of Luxembourg, Luxembourg Pre-print | ||
14:00 15mTalk | Senatus: A Fast and Accurate Code-to-Code Recommendation Engine Technical Papers Fran Silavong JP Morgan Chase & Co., Sean Moran JP Morgan Chase & Co., Antonios Georgiadis JP Morgan Chase & Co., Rohan Saphal JP Morgan Chase & Co., Robert Otter JP Morgan Chase & Co. DOI Pre-print Media Attached | ||
14:15 8mShort-paper | Comments on Comments: Where Code Review and Documentation Meet Technical Papers Nikitha Rao Carnegie Mellon University, Jason Tsay IBM Research, Martin Hirzel IBM Research, Vincent J. Hellendoorn Carnegie Mellon University DOI Pre-print File Attached | ||
14:23 8mShort-paper | On the Naturalness of Fuzzer Generated Code Technical Papers Rajeswari Hita Kambhamettu Carnegie Mellon University, John Billos Wake Forest University, Carolyn "Tomi" Oluwaseun-Apo Pennsylvania State University, Benjamin Gafford Carnegie Mellon University, Rohan Padhye Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
14:31 8mTalk | SOSum: A Dataset of Stack Overflow Post Summaries Data and Tool Showcase Track Bonan Kou Purdue University, Yifeng Di Purdue University, Muhao Chen University of Southern California, Tianyi Zhang Purdue University | ||
14:39 21mLive Q&A | Discussions and Q&A Technical Papers |
15:00 - 15:30 | |||
15:30 - 17:00 | Networking & Poster SessionTechnical Papers at Room 315+316 Chair(s): Miikka Kuutila University of Oulu The following are the registered posters:
| ||
Tue 24 MayDisplayed time zone: Eastern Time (US & Canada) change
10:30 - 11:00 | |||
12:15 - 12:30 | Brainstorming / Discussion ITechnical Papers at Room 315+316 Chair(s): Shane McIntosh University of Waterloo | ||
12:30 - 13:30 | |||
13:30 - 15:00 | Brainstorming / Discussion II Technical Papers at Room 315+316 Chair(s): Shane McIntosh University of Waterloo | ||
15:00 - 15:30 | |||
15:30 - 17:00 | Blended Technical Session 5 (Miscellaneous) Technical Papers / Data and Tool Showcase Track / Mining Challenge at Room 315+316 Chair(s): Luís Cruz Deflt University of Technology | ||
15:30 15mTalk | Code Review Practices for Refactoring Changes: An Empirical Study on OpenStack Technical Papers Eman Abdullah AlOmar Stevens Institute of Technology, Moataz Chouchen ETS, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
15:45 15mTalk | Painting the Landscape of Automotive Software in GitHub Technical Papers Sangeeth Kochanthara Eindhoven University of Technology, Yanja Dajsuren Eindhoven University of Technology, Loek Cleophas Eindhoven University of Technology (TU/e) and Stellenbosch University (SU), Mark van den Brand Eindhoven University of Technology Pre-print Media Attached | ||
16:00 8mTalk | SLNET: A Redistributable Corpus of 3rd-party Simulink Models Data and Tool Showcase Track Sohil Lal Shrestha The University of Texas at Arlington, Shafiul Azam Chowdhury University of Texas at Arlington, Christoph Csallner University of Texas at Arlington DOI Pre-print Media Attached | ||
16:08 8mTalk | SoCCMiner: A Source Code-Comments and Comment-Context Miner Data and Tool Showcase Track Murali Sridharan University of Oulu, Mika Mäntylä University of Oulu, Maëlick Claes University of Oulu, Leevi Rantala University of Oulu Pre-print | ||
16:16 8mTalk | An Exploratory Study on Refactoring Documentation in Issues Handling Mining Challenge Eman Abdullah AlOmar Stevens Institute of Technology, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Christian D. Newman Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
16:24 8mTalk | Between JIRA and GitHub: ASFBot and its Influence on Human Comments in Issue Trackers Mining Challenge Ambarish Moharil Eindhoven University of Technology, Dmitrii Orlov Eindhoven University of Technology, Samar Jameel Eindhoven University of Technology, Tristan Trouwen Eindhoven University of Technology, Nathan Cassee Eindhoven University of Technology, Alexander Serebrenik Eindhoven University of Technology Pre-print | ||
16:32 28mLive Q&A | Discussions and Q&A Technical Papers |
17:00 - 17:30 | |||
Unscheduled Events
Not scheduled Talk | MIP Award Talk Technical Papers Georgios Gousios Endor Labs & Delft University of Technology | ||
Not scheduled Talk | MSR Foundational Contribution Award Technical Papers |
Accepted Papers
Call for Papers
The Mining Software Repositories (MSR) conference is the premier conference for data science, machine learning, and artificial intelligence in software engineering. The goal of the conference is to improve software engineering practices by uncovering interesting and actionable information about software systems and projects using the vast amounts of software data such as source control systems, defect tracking systems, code review repositories, archived communications between project personnel, question-and-answer sites, CI build servers, and run-time telemetry. Mining this information can help to understand software development and evolution, software users, and runtime behavior; support the maintenance of software systems; improve software design/reuse; empirically validate novel ideas and techniques; support predictions about software development; and exploit this knowledge in planning future development. The goal of this two-day international conference is to advance the science and practice of software engineering with data-driven techniques. The 19th International Conference on Mining Software Repositories will be held on May 23-24, 2022.
Evaluation Criteria
Research papers are expected to describe new methodologies and/or provide novel research results, and should be evaluated scientifically. While a high degree of technical rigor is expected for long papers, short research papers should discuss controversial issues in the field, or describe interesting or thought-provoking ideas that are not yet fully developed. Accepted short papers will be presented in a short lightning talk. Relevant review criteria:
- soundness of approach
- relevance to software engineering
- clarity of relation with related work
- quality of presentation
- quality of evaluation [for long papers]
- ability to replicate [for long papers]
- novelty
Submission Process
All authors should use the official “ACM Primary Article Template”, as can be obtained from the ACM Proceedings Template page. LaTeX users should use the sigconf
option, as well as the review (to produce line numbers for easy reference by the reviewers) and anonymous
(omitting author names) options. To that end, the following LaTeX code can be placed at the start of the LaTeX document:
\documentclass[sigconf,review,anonymous]{acmart}
\acmConference[MSR 2022]{MSR '22: Proceedings of the 19th International Conference on Mining Software Repositories}{May 23–24, 2022}{Pittsburgh, PA, USA}
Submissions to the Technical Track can be made via the submission site by the submission deadline. We encourage authors to upload their paper info early (the PDF can be submitted later) to properly enter conflicts for anonymous reviewing. All submissions must adhere to the following requirements:
- Submissions must not exceed the page limit (10 pages plus 2 additional pages of references for full papers; 4 pages plus 1 additional page of references for short papers). The page limit is strict, and it will not be possible to purchase additional pages at any point in the process (including after acceptance).
- Submissions must strictly conform to the ACM formatting instructions. Alterations of spacing, font size, and other changes that deviate from the instructions may result in desk rejection without further review.
- Submissions must not reveal the authors’ identities. The authors must make every effort to honor the double-anonymous review process. In particular, the authors’ names must be omitted from the submission and references to their prior work should be in the third person. Further advice, guidance, and explanation about the double-anonymous review process can be found in the Q&A page for ICSE 2022.
Any submission that does not comply with these requirements is likely to be desk rejected by the PC Chairs without further review. In addition, by submitting to the MSR Technical Track, the authors acknowledge that they are aware of and agree to be bound by the following policies:
- The ACM Policy and Procedures on Plagiarism and the IEEE Plagiarism FAQ. In particular, papers submitted to MSR 2022 must not have been published elsewhere and must not be under review or submitted for review elsewhere whilst under consideration for MSR 2022. Contravention of this concurrent submission policy will be deemed a serious breach of scientific ethics, and appropriate action will be taken in all such cases (including immediate rejection and reporting of the incident to ACM/IEEE). To check for double submission and plagiarism issues, the chairs reserve the right to (1) share the list of submissions with the PC Chairs of other conferences with overlapping review periods and (2) use external plagiarism detection software, under contract to the ACM or IEEE, to detect violations of these policies.
- The authorship policy of the ACM and the authorship policy of the IEEE.
Upon notification of acceptance, all authors of accepted papers will be asked to fill a copyright form and will receive further instructions for preparing the camera-ready version of their papers. At least one author of each paper is expected to register and present the paper at the MSR 2022 conference. All accepted contributions will be published in the electronic proceedings of the conference.
A selection of the best papers will be invited to an Empirical Software Engineering (EMSE) Special Issue. The authors of accepted papers that show outstanding contributions to the FOSS community will have a chance to self-nominate their paper for the MSR FOSS Impact Paper Award. Please note that providing a replication package is strongly recommended even in double-anonymous submissions, since not providing one effectively hinders the peer-review process. Since access to data and scripts is essential during peer review, we strongly recommend to archive data sets on online archival sites such as dropbox.com, zenodo.org or figshare.com (Instructions available in Open Science Policy below). The latter two even allow to receive a DOI and hence become citable.
Submission Link
Papers must be submitted through HotCRP: https://msr2022-technical.hotcrp.com/
Shadow PC
We are continuing the Shadow PC process established at MSR 2021. The Shadow PC provides an opportunity to recruit and train the next generation of MSR researchers on reviewing MSR papers. The primary audience is PhD students and Post Docs who have not yet served on the MSR PC. They will review papers submitted to MSR and have a parallel program committee. We will have experienced MSR researchers look at the reviews and comments and give feedback to the Shadow PC. This has been done in other venues like EuroSys and USENIX.
We are requesting authors to volunteer their paper to be reviewed by the Shadow PC. This is purely a learning experience, and the reviews from the Shadow PC will not be visible to the regular PC and will not impact the decision of the paper in MSR. The authors may request the reviews from the Shadow PC. The reviews and comments will be in a completely different HotCRP installation to prevent any cross-over. Therefore, we are requesting authors to please volunteer their papers. There will be an option in the submission page for this. When you volunteer you are not only helping future generations of MSR researchers become better reviewers, but you also will get more feedback on your work.
Important Dates
- Abstract Deadline: Jan 17
- Paper Deadline: Jan 20
- Author Response Period: Feb 22 – Feb 24
- Author Notification: March 8
- Camera Ready Deadline: Late March
Open Science Policy
Openness in science is key to fostering progress via transparency, reproducibility and replicability. Our steering principle is that all research output should be accessible to the public and that empirical studies should be reproducible. In particular, we actively support the adoption of open data and open source principles. The following guidelines are recommendations and not mandatory. Your choice to use open science or not will not affect the review process for your paper. However, to increase reproducibility and replicability, we encourage all contributing authors to disclose:
- the source code of relevant software used or proposed in the paper, including that used to retrieve and analyze data
- the data used in the paper (e.g., evaluation data, anonymized survey data, etc.)
- instructions for other researchers describing how to reproduce or replicate the results
Already upon submission, authors can privately share their anonymized data and software on preserved archives, such as Zenodo or Figshare (tutorial available here please make sure that any links shared during peer review are anonymized*). Zenodo accepts up to 50GB per dataset (more upon request). There is no need to use Dropbox or Google Drive. Once accepted, an option can be toggled to publish the data and scripts with an official DOI. Zenodo and Figshare accounts can easily be linked with GitHub repositories to automatically archive software releases. In the unlikely case that authors need to upload terabytes of data, <Archive.org> may be used. After acceptance, we encourage authors to self-archive pre-prints of their papers in open, preserved repositories such as arXiv.org. This is legal and allowed by all major publishers including ACM and IEEE, and it lets anybody in the world reach your paper. Note that you are usually not allowed to self-archive the PDF of the published article (that is, the publisher proof or the Digital Library version). Instead, use the manuscript with reviewer comments addressed, but before applying the camera-ready instructions and templates. Feel free to contact the MSR 2021 PC or proceedings chairs for more details. We recognise that anonymising artifacts such as source code is more difficult than preserving anonymity in a paper. We ask authors to take a best effort approach to not reveal their identities. We will also ask reviewers to avoid trying to identify authors by looking at commit histories and other such information that is not easily anonymised. Authors wanting to share GitHub repositories may want to look into using https://anonymous.4open.science/ which is an open source tool that helps you to quickly double-blind your repository. Please note that the success of the open science initiative depends on the willingness (and possibilities) of authors to disclose their data and that all submissions will undergo the same review process independent of whether or not they disclose their analysis code or data. We encourage authors who cannot disclose industrial or otherwise non-public data, for instance due to non-disclosure agreements, to provide an explicit (short) statement in the paper.
Accepted Papers and Attendance Expectation
Accepted papers will be permitted an additional page of content to allow authors to incorporate review feedback. The page limit for published papers will therefore be 11 pages for full papers (or 5 pages, for short papers), plus 2 pages which may only contain references.
- The official publication date is the date the proceedings are made available in the ACM or IEEE Digital Libraries. This date may be up to two weeks prior to the first day of the ICSE 2022. The official publication date affects the deadline for any patent filings related to published work.
- Purchases of additional pages in the proceedings is not allowed.
After acceptance, the list of paper authors can not be changed under any circumstances and the list of authors on camera-ready papers must be identical to those on submitted papers. After acceptance paper titles can not be changed except by permission of the Program Co-Chairs, and only then when referees recommended a change for clarity or accuracy with paper content.
If a submission is accepted, at least one author of the paper is required to register for MSR 2022 and present the paper. [We will add more info on this as soon as the MSR 2022 format is finalized.]
Scope
The technical track of MSR 2021 solicits high-quality submissions on a wide range of topics related to artificial intelligence (AI), machine learning (ML), and data science (DS) in one or more of the following three main themes.
1. AI/ML/DS and SE
The analysis should aim to improve understanding of development processes and practices or aid in the development of new techniques or models to support software developers. This includes (but is not limited to) analysis or models for:
- commits,
- execution traces and logs,
- interaction data,
- code review data,
- natural language artifacts,
- software licenses and copyrights,
- app store data,
- programming language features,
- release information,
- CI logs,
- deployment and delivery,
- test data,
- runtime information,
- software ecosystems,
- defect and software quality data,
- human and social aspects of development,
- development process,
- energy profile data.
2. New techniques, tools, and models.
The techniques, tools, and models should facilitate new ways to mine, analyze, or model software data. A submission could include (but is not limited to) techniques, tools, or models to:
- capture new forms of data,
- integrate data from multiple sources,
- visualize software data,
- model software data,
- solve SE problems,
- improve AI/ML/DS.
3. Considerations related to AI/ML/DS and SE.
These submissions should reflect on the current state-of-the-art research methods or current practices in mining, analyzing, or modeling software data. These submissions can also propose new research methods or guidelines. This theme includes topics such as (but not limited to)
- privacy of collected data,
- ethics of mining, analyzing, or modelling software data,
- biases in software data, analyses, and tools,
- fairness in software data, analyses, and tools,
- Replication studies.