MSR 2022 features the new “Vision and Reflection” track, which seeks to invite MSR experts to give mini-keynotes reflecting on how the MSR community has gotten to its current state and on providing a vision about the future of MSR. More concretely, this year’s edition of the track consists of 2 sessions, i.e., one reflecting on the past and one focusing on the future. The session about the “past” involves looking back at popular ideas and common expectations about MSR research from the early beginning of the field, to see the extent to which those expectations have been met or have evolved. The session about the “future” explores emerging topics and themes in MSR research and the extent to which the field is able to address those.
For the past session, we will hold the following mini keynotes:
Title: Back to the future: Empirical Revolution(s) in Software Engineering
Audris Mockus
University of Tennessee
Abstract: The desire to better understand software development lead to numerous attempts to quantify it. Easy-to-measure artifacts, such as source code, could provide only the most basic understanding of the entire development process and the attempts to directly measure quality and effort were cost-prohibitive, error-prone, and rarely shared with researchers or made public. The rise of open source not only provided a reliable software infrastructure but also the rich data source for the software engineering community to finally measure aspects of software development never seen before. Over time and with increased use of open source software, actual developers increasingly need to deal not just with their own project but with projects upstream, downstream, or sideways in the huge open source software supply chain. Measures derived from the entire software supply chain are now likely to bring about the next software engineering revolution.
Bio: Audris Mockus received BS and MS degrees in applied mathematics from the Moscow Institute of Physics and Technology in 1988, the MS degree in 1991 and a PhD degree in statistics from Carnegie Mellon University in 1994. He studies software developers’ culture and behavior through the recovery, documentation, and analysis of digital remains. These digital traces reflect projections of collective and individual activity. He reconstructs the reality from these projections by designing data mining methods to summarize and augment these digital traces, interactive visualization techniques to inspect, present, and control the behavior of teams and individuals, and statistical models and optimization techniques to understand the nature of individual and collective behavior. Mockus is the Ericsson-Harlan Mills Chair Professor in the Min H. Kao Department of Electrical Engineering and Computer Science at UT. He also continues to work part-time at Avaya Labs Research. Previously he worked in the Software Production Research Department at Bell Labs. He is a member of the IEEE and ACM.
Title: Engineering the MSR Field and the Joy of Research
Ahmed E. Hassan
Queen’s University
Abstract: The joy of exploring the unknowns and trailblazing new directions is what drives many researchers. However such activities come with great risks. It is of great importance for us to encourage such activities for the community as a whole to grow and thrive.
For the past 20 years, the MSR community has led many innovations (e.g., the data track, the challenge track, registered reports, and the MSR awards) with an eye on encouraging risks and bringing joy to MSR researchers. Over the years, the SE community has adopted many MSR innovations that have had an enormous impact on the rigour and sophistication of SE research.
In this brief talk, I will explain the motivation behind many of these MSR innovations while discussing challenges in the road ahead, as the rigour and sophistication expectations of published research threaten the freedoms and joys of researchers, in turn slowing the overall progress of our field.
Bio: Ahmed E. Hassan is the NSERC/RIM Industrial Research Chair in Software Engineering for Ultra Large Scale systems and the Canada Research Chair in Software Analytics at the School of Computing in Queen’s University. Dr. Hassan spearheaded the organization and creation of the Mining Software Repositories (MSR) conference and its research community. He co-edited special issues of the IEEE Transaction on Software Engineering and the Journal of Empirical Software Engineering on the MSR topic. Early tools and techniques developed by Dr. Hassan’s team are already integrated into products used by millions of users worldwide. Dr. Hassan industrial experience includes helping architect the Blackberry wireless platform at RIM, and working for IBM Research at the Almaden Research Lab and the Computer Research Lab at Nortel Networks. Dr. Hassan is the named inventor of patents at several jurisdictions around the world including the United States, Europe, India, Canada, and Japan. Dr. Hassan received the Ph.D., MMath, and BMath degrees from the School of Computer Science at the University of Waterloo in Canada.
Title: It’s all in your network: How mining developer collaboration allowed us to peer into complex socio-technical aspects of software development
Daniela Damian
University of Victoria
Abstract: The way in which developers collaborate and use collaborative tools has changed significantly in the last two decades. So did the way in which software is being developed. A number of techniques have been developed to mine developer collaboration from project repositories, from mailing lists in the early open source projects, to the integrated development environments that followed, to GitHub that has offered the case of open, large scale collaborations with unprecedented modes of implicit coordination. This talk will review some of these techniques, as well as the rich and fruitful empirical research they facilitated into the socio-technical aspects of software development. This includes properties and patterns of developer collaboration and knowledge flow in open source and proprietary software projects. The software engineering community now has a much better understanding of the social structures in software projects, intricacies of socio-technical congruence in co-located and distributed software development, as well as their complicated relationships to software quality and performance.
Bio: Daniela Damian is a Professor of Software Engineering in University of Victoria’s Department of Computer Science, where she leads research in the Software Engineering Global interAction Laboratory (SEGAL, thesegalgroup.org). Her work has studied socio-technical coordination in large, distributed projects, requirements engineering, global software development, software engineering education, and platform-based software ecosystems. Her recent interests include diversity and inclusion in software development. Daniela has served on the program committee boards or Program Co-Chair of several software engineering conferences, as well as on the editorial boards of Transactions on Software Engineering and the Journal of Requirements Engineering. She is serving on the Advisory Board at the Empirical Software Engineering Journal and as the Human Aspects Area Editor for the Journal of Software and Systems. She is ICSE 2022 Program co-Chair, together with Andreas Zeller. Contact her at danielad@uvic.ca, danieladamian.ca, @DanaHDamian
For the future session, we will hold the following mini keynotes:
Title: Bias in MSR research
Alexander Serebrenik
TU Eindhoven
Abstract: Rather than talking about the topics we study as is customary in scientific presentations, in this talk I would like to reflect on what we as the MSR research community usually do not study, what kind of voices we rarely hear, what kind of projects we rarely see, how these voices and projects influence results of our studies and how can we try and improve this situation.
Bio: Alexander Serebrenik (PhD, KU Leuven, 2003) is a Full Professor of Social Software Engineering at Eindhoven University of Technology, The Netherlands. His research goal is to facilitate evolution of software by taking into account social aspects of software development. He has co-authored a book Evolving Software Systems (Springer Verlag, 2014) and circa 200 scientific papers and articles. He has won several distinguished paper and distinguished review awards, as well as acted as steering committee chair, general chair, program (co-)chair and track (co-)chair of such software engineering conferences as ICSE, ICSME, ICPC and SANER.
Title: The Next Generation of Software Developers
Denae Ford Robinson
Microsoft Research
Abstract: Microsoft is home to the world’s largest developer communities and ecosystems with Azure, GitHub, and Visual Studio. Thus, having sustainable and inclusive communities is of strategic importance as it has the potential to transform society by enabling more people to develop software. Developers in these communities and others (e.g., Stack Overflow, YouTube, Twitter) often intersect aspects of their professional work with their personal life on social media platforms which allow them to feel more comfortable engaging. Therefore, understanding how developers operate at these intersections helps practitioners to better prepare for the evolution of online professional communities and continue to bridge its enterprise and consumer markets. In this talk, I will cover recent research on evolving developer communities and outline opportunities on how we can usher in the next generation of software developers by fostering healthy and inclusive communities.
Bio: Dr. Denae Ford Robinson is a Senior Researcher at Microsoft Research in the SAINTes group and an Affiliate Assistant Professor in the Human Centered Design and Engineering Department at the University of Washington. Her research lies at the intersection of Human-Computer Interaction and Software Engineering. In her work she identifies and dismantles cognitive and social barriers by designing mechanisms to support software developer participation in online socio-technical ecosystems. She is best known for her research on just-in-time mentorship as a mode to empower welcoming engagement in collaborative Q&A for online programming communities including open-source software and work to empower marginalized software developers in online communities. She received her B.S. and M.S. in Computer Science from North Carolina State University. She also received her Ph.D. in Computer Science and Graduate Minor in Cognitive Science from North Carolina State University. She is also a recipient of the National GEM Consortium Fellowship, National Science Foundation Graduate Research Fellowship, and Microsoft Research Ph.D. Fellowship. Her research publications can be found under her pen name ‘Denae Ford’. More information about her latest research can be found on her website: http://denaeford.me/
Title: Mining Software Repositories in the age of AI
Foutse Khomh
Polytechnique Montreal
Abstract: Artificial Intelligence (AI), in particular Machine Learning (ML) is increasingly deployed in large-scale and critical systems thanks to recent breakthroughs in deep learning and reinforcement learning. Traditionally, software systems are constructed deductively, by writing down the rules that govern the behavior of the system as program code. However, with ML, these rules are inferred from training data (i.e., they are generated inductively). This paradigm shift in application development makes it difficult to reason about the behavior of software systems with ML components. This talk will explore how mining software repositories techniques can help support the quality assurance of AI-empowered software systems, and discuss some potential research avenues on this topic.
Bio: Foutse Khomh is a Full Professor of Software Engineering at Polytechnique Montréal, Canada CIFAR AI Chair on Trustworthy Machine Learning Software Systems, and FRQ-IVADO Research Chair on Software Quality Assurance for Machine Learning Applications. He received a Ph.D. in Software Engineering from the University of Montreal in 2011, with the Award of Excellence. He also received a CS-Can/Info-Can Outstanding Young Computer Science Researcher Prize for 2019. His research interests include software maintenance and evolution, machine learning systems engineering, cloud engineering, and dependable and trustworthy ML/AI. His work has received four ten-year Most Influential Paper (MIP) Awards, and six Best/Distinguished Paper Awards. He also served on the steering committee of SANER (chair), MSR, PROMISE, ICPC (chair), and ICSME (vice-chair). He initiated and co-organized the Software Engineering for Machine Learning Applications (SEMLA) symposium and the RELENG (Release Engineering) workshop series. He is co-founder of the NSERC CREATE SE4AI: A Training Program on the Development, Deployment, and Servicing of Artificial Intelligence-based Software Systems, and one of the Principal Investigators of the DEpendable Explainable Learning (DEEL) project. He is on the editorial board of multiple international software engineering journals and is a Senior Member of IEEE.
Tue 17 MayDisplayed time zone: Eastern Time (US & Canada) change
20:00 - 20:50 | MSR 2022 OpeningTechnical Papers / Shadow PC / Hackathon / FOSS Award / MSR Awards / Mining Challenge / Registered Reports / Keynotes / Industry Track / MIP Award / Tutorials / Vision and Reflection / Data and Tool Showcase Track at MSR Plenary room | ||
21:00 - 21:50 | Newcomer Orientation ITechnical Papers / Shadow PC / Hackathon / FOSS Award / MSR Awards / Mining Challenge / Registered Reports / Keynotes / Industry Track / MIP Award / Tutorials / Vision and Reflection / Data and Tool Showcase Track at MSR Newcomer Orientation room Chair(s): Yuan Tian Queens University, Kingston, Canada, Gias Uddin University of Calgary, Canada Mentors: Bram Adams, Fatemeh Fard, Li Li, Ali Ouni, Tianyi Zhang | ||
Wed 18 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 04:50 | Newcomer Orientation IITechnical Papers at MSR Newcomer Orientation room Chair(s): Tegawendé F. Bissyandé SnT, University of Luxembourg, Chaiyong Rakhitwetsagul Mahidol University, Thailand Mentors: Bodin Chinthanet, Raula Gaikovina Kula, Christoph Treude, Xin Xia | ||
11:00 - 11:50 | Keynote: Christian Kästner – From Models to Systems: Rethinking the Role of Software Engineering for Machine LearningTechnical Papers at MSR Plenary room Chair(s): Nicole Novielli University of Bari | ||
12:00 - 12:50 | Mining ChallengeMining Challenge / Technical Papers at MSR Main room - even hours Chair(s): Steffen Herbold TU Clausthal | ||
12:00 4mTalk | An Exploratory Study on Refactoring Documentation in Issues Handling Mining Challenge Eman Abdullah AlOmar Stevens Institute of Technology, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Christian D. Newman Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
12:04 4mTalk | Between JIRA and GitHub: ASFBot and its Influence on Human Comments in Issue Trackers Mining Challenge Ambarish Moharil Eindhoven University of Technology, Dmitrii Orlov Eindhoven University of Technology, Samar Jameel Eindhoven University of Technology, Tristan Trouwen Eindhoven University of Technology, Nathan Cassee Eindhoven University of Technology, Alexander Serebrenik Eindhoven University of Technology Pre-print | ||
12:08 4mTalk | Is Refactoring Always a Good Egg? Exploring the Interconnection Between Bugs and Refactorings Mining Challenge File Attached | ||
12:12 4mTalk | On the Co-Occurrence of Refactoring of Test and Source Code Mining Challenge Pre-print Media Attached | ||
12:16 4mTalk | Refactoring Debt: Myth or Reality? An Exploratory Study on the Relationship Between Technical Debt and RefactoringBest Mining Challenge Paper Award Mining Challenge Anthony Peruma Rochester Institute of Technology, Eman Abdullah AlOmar Stevens Institute of Technology, Christian D. Newman Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print Media Attached | ||
12:20 4mTalk | Studying the Impact of Continuous Delivery Adoption on Bug-Fixing Time in Apache’s Open-Source Projects Mining Challenge Carlos Diego Andrade de Almeida Federal University of Ceará, Diego N. Feijó Federal University of Ceará, Lincoln Rocha Federal University of Ceará Media Attached | ||
12:24 4mTalk | Which bugs are missed in code reviews: An empirical study on SmartSHARK dataset Mining Challenge fatemeh khoshnoud Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Ali Rezaei Nasab Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Zahra Toudeji Department of Computer Science and Engineering and IT; School of Electrical and Computer Engineering, Shiraz University, Ashkan Sami Shiraz University | ||
12:28 22mLive Q&A | Discussions and Q&A Technical Papers |
13:00 - 13:50 | Session 4: Software Quality (Bugs & Smells)Data and Tool Showcase Track / Technical Papers at MSR Main room - odd hours Chair(s): Maxime Lamothe Polytechnique Montreal, Montreal, Canada, Mahmoud Alfadel University of Waterloo | ||
13:00 7mTalk | Dazzle: Using Optimized Generative Adversarial Networks to Address Security Data Class Imbalance Issue Technical Papers Rui Shu North Carolina State University, Tianpei Xia North Carolina State University, Laurie Williams North Carolina State University, Tim Menzies North Carolina State University | ||
13:07 7mTalk | To What Extent do Deep Learning-based Code Recommenders Generate Predictions by Cloning Code from the Training Set? Technical Papers Matteo Ciniselli Università della Svizzera Italiana, Luca Pascarella Università della Svizzera italiana (USI), Gabriele Bavota Software Institute, USI Università della Svizzera italiana Pre-print | ||
13:14 7mTalk | How to Improve Deep Learning for Software Analytics (a case study with code smell detection) Technical Papers Pre-print | ||
13:21 7mTalk | Using Active Learning to Find High-Fidelity Builds Technical Papers Harshitha Menon Lawrence Livermore National Lab, Konstantinos Parasyris Lawrence Livermore National Laboratory, Todd Gamblin Lawrence Livermore National Laboratory, Tom Scogland Lawrence Livermore National Laboratory Pre-print | ||
13:28 4mTalk | ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction Data and Tool Showcase Track Hossein Keshavarz David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada, Mei Nagappan University of Waterloo Pre-print | ||
13:32 4mTalk | ReCover: a Curated Dataset for Regression Testing Research Data and Tool Showcase Track Francesco Altiero Università degli Studi di Napoli Federico II, Anna Corazza Università degli Studi di Napoli Federico II, Sergio Di Martino Università degli Studi di Napoli Federico II, Adriano Peron Università degli Studi di Napoli Federico II, Luigi Libero Lucio Starace Università degli Studi di Napoli Federico II | ||
13:36 14mLive Q&A | Discussions and Q&A Technical Papers |
13:00 - 13:50 | |||
13:00 50mTutorial | Empirical Standards for Repository Mining Tutorials Paul Ralph Dalhousie University, Tushar Sharma Dalhousie University, Preetha Chatterjee Drexel University, USA Pre-print |
14:00 - 14:50 | Session 5: Communication & Domains Data and Tool Showcase Track / Technical Papers at MSR Main room - even hours Chair(s): Masud Rahman Dalhousie University, Mahmoud Alfadel University of Waterloo | ||
14:00 7mTalk | Painting the Landscape of Automotive Software in GitHub Technical Papers Sangeeth Kochanthara Eindhoven University of Technology, Yanja Dajsuren Eindhoven University of Technology, Loek Cleophas Eindhoven University of Technology (TU/e) and Stellenbosch University (SU), Mark van den Brand Eindhoven University of Technology Pre-print Media Attached | ||
14:07 7mFull-paper | Mining the Usage of Reactive Programming APIs: A Study on GitHub and Stack Overflow Technical Papers Carlos Zimmerle Federal University of Pernambuco, Kiev Gama Federal University of Pernambuco, Fernando Castor Utrecht University & Federal University of Pernambuco, José Murilo Filho Federal University of Pernambuco DOI Pre-print | ||
14:14 4mTalk | SoCCMiner: A Source Code-Comments and Comment-Context Miner Data and Tool Showcase Track Murali Sridharan University of Oulu, Mika Mäntylä University of Oulu, Maëlick Claes University of Oulu, Leevi Rantala University of Oulu Pre-print | ||
14:18 4mTalk | SLNET: A Redistributable Corpus of 3rd-party Simulink Models Data and Tool Showcase Track Sohil Lal Shrestha The University of Texas at Arlington, Shafiul Azam Chowdhury University of Texas at Arlington, Christoph Csallner University of Texas at Arlington DOI Pre-print Media Attached | ||
14:22 4mTalk | SOSum: A Dataset of Stack Overflow Post Summaries Data and Tool Showcase Track Bonan Kou Purdue University, Yifeng Di Purdue University, Muhao Chen University of Southern California, Tianyi Zhang Purdue University | ||
14:26 4mTalk | Inspect4py: A Knowledge Extraction Framework for Python Code Repositories Data and Tool Showcase Track | ||
14:30 4mTalk | DISCO: A Dataset of Discord Chat Conversations for Software Engineering Research Data and Tool Showcase Track Keerthana Muthu Subash Carleton University, Canada, Lakshmi Prasanna Kumar Carleton University, Canada, Sri Lakshmi Vadlamani Carleton University, Canada, Preetha Chatterjee Drexel University, USA, Olga Baysal Carleton University DOI Pre-print Media Attached | ||
14:34 16mLive Q&A | Discussions and Q&A Technical Papers |
14:00 - 14:50 | |||
14:00 50mTutorial | Mining the Ethereum Blockchain Platform: Best Practices and Pitfalls Tutorials Gustavo A. Oliva Queen's University |
21:00 - 21:50 | Session 7: Developer Wellbeing & Project CommunicationTechnical Papers / Data and Tool Showcase Track / Industry Track at MSR Main room - odd hours Chair(s): Bram Adams Queen's University, Kingston, Ontario | ||
21:00 7mTalk | On the Violation of Honesty in Mobile Apps: Automated Detection and CategoriesDistinguished Paper Award Technical Papers Humphrey Obie Monash University, Idowu Oselumhe Ilekura Data Science Nigeria, Hung Du Applied Artificial Intelligence Institute, Deakin University, Mojtaba Shahin RMIT University, Australia, John Grundy Monash University, Li Li Monash University, Jon Whittle CSIRO's Data61 and Monash University, Burak Turhan University of Oulu Pre-print | ||
21:07 7mTalk | How heated is it? Understanding GitHub locked issues Technical Papers Isabella Ferreira Polytechnique Montréal, Bram Adams Queen's University, Kingston, Ontario, Jinghui Cheng Polytechnique Montreal Pre-print Media Attached | ||
21:14 4mTalk | The OCEAN mailing list data set: Network analysis spanning mailing lists and code repositories Data and Tool Showcase Track Melanie Warrick University of Vermont, Samuel F. Rosenblatt University of Vermont, Jean-Gabriel Young University of Vermont, amanda casari Open Source Programs Office, Google, Laurent Hébert-Dufresne University of Vermont, James P. Bagrow University of Vermont DOI Pre-print Media Attached | ||
21:18 4mTalk | The Unexplored Treasure Trove of Phabricator Code Reviews Data and Tool Showcase Track Gunnar Kudrjavets University of Groningen, Nachiappan Nagappan Microsoft Research, Ayushi Rastogi University of Groningen, The Netherlands DOI Pre-print | ||
21:22 4mTalk | The Unsolvable Problem or the Unheard Answer? A Dataset of 24,669 Open-Source Software Conference Talks Data and Tool Showcase Track Kimberly Truong Oregon State University, Courtney Miller Carnegie Mellon University, Bogdan Vasilescu Carnegie Mellon University, USA, Christian Kästner Carnegie Mellon University DOI Pre-print | ||
21:26 4mTalk | Exploring Apache Incubator Project Trajectories with APEX Data and Tool Showcase Track Anirudh Ramchandran University of California, Davis, Likang Yin University of California, Davis, Vladimir Filkov University of California at Davis | ||
21:30 7mTalk | A Culture of Productivity: Maximizing Productivity by Maximizing Wellbeing Industry Track Brian Houck Microsoft Research | ||
21:37 13mLive Q&A | Discussions and Q&A Technical Papers |
Thu 19 MayDisplayed time zone: Eastern Time (US & Canada) change
04:00 - 04:50 | Session 9: Scaling & CloudIndustry Track / Registered Reports / Data and Tool Showcase Track / Technical Papers at MSR Main room - even hours Chair(s): Lwin Khin Shar Singapore Management University | ||
04:00 4mTalk | SniP: An Efficient Stack Tracing Framework for Multi-threaded Programs Data and Tool Showcase Track Arun KP Indian Institute of Technology Kanpur, Saurabh Kumar Indian Institute of Technology Kanpur, Debadatta Mishra , Biswabandan Panda Indian Institute of Technology Bombay DOI Pre-print | ||
04:04 4mTalk | Tooling for Time- and Space-efficient git Repository Mining Data and Tool Showcase Track Fabian Heseding Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Willy Scheibel Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Jürgen Döllner Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam | ||
04:08 4mTalk | TSSB-3M: Mining single statement bugs at massive scale Data and Tool Showcase Track Cedric Richter Carl von Ossietzky Universität Oldenburg / University of Oldenburg, Heike Wehrheim Carl von Ossietzky Universität Oldenburg / University of Oldenburg Pre-print Media Attached | ||
04:12 7mTalk | Improved Business Outcomes from Cloud Applications – using Integrated Process and Runtime Product Data Mining Industry Track | ||
04:19 7mTalk | Improve Quality of Cloud Serverless Architectures through Software Repository Mining Industry Track | ||
04:26 4mTalk | Toward Granular Automatic Unit Test Case Generation Registered Reports Fabiano Pecorelli Tampere University, Giovanni Grano LocalStack, Fabio Palomba University of Salerno, Harald C. Gall University of Zurich, Andrea De Lucia University of Salerno Pre-print | ||
04:30 20mLive Q&A | Discussions and Q&A Technical Papers |
10:00 - 10:50 | Virtual CoffeeTechnical Papers at MSR Main room - even hours This session will be for informal conversations on Midspace. | ||
11:00 - 11:50 | Session 11: Machine Learning & Information RetrievalTechnical Papers at MSR Main room - odd hours Chair(s): Phuong T. Nguyen University of L’Aquila | ||
11:00 4mShort-paper | On the Naturalness of Fuzzer Generated Code Technical Papers Rajeswari Hita Kambhamettu Carnegie Mellon University, John Billos Wake Forest University, Carolyn "Tomi" Oluwaseun-Apo Pennsylvania State University, Benjamin Gafford Carnegie Mellon University, Rohan Padhye Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
11:04 7mTalk | Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes Technical Papers DOI Pre-print Media Attached | ||
11:11 7mTalk | Multimodal Recommendation of Messenger Channels Technical Papers Ekaterina Koshchenko JetBrains Research, Egor Klimov JetBrains Research, Vladimir Kovalenko JetBrains Research | ||
11:18 7mTalk | Senatus: A Fast and Accurate Code-to-Code Recommendation Engine Technical Papers Fran Silavong JP Morgan Chase & Co., Sean Moran JP Morgan Chase & Co., Antonios Georgiadis JP Morgan Chase & Co., Rohan Saphal JP Morgan Chase & Co., Robert Otter JP Morgan Chase & Co. DOI Pre-print Media Attached | ||
11:25 7mTalk | Challenges in Migrating Imperative Deep Learning Programs to Graph Execution: An Empirical Study Technical Papers Tatiana Castro Vélez City University of New York (CUNY) Graduate Center, Raffi Khatchadourian City University of New York (CUNY) Hunter College, Mehdi Bagherzadeh Oakland University, Anita Raja City University of New York (CUNY) Hunter College Pre-print Media Attached | ||
11:32 7mTalk | GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses Technical Papers Wei Ma SnT, University of Luxembourg, Mengjie Zhao LMU Munich, Ezekiel Soremekun SnT, University of Luxembourg, Qiang Hu University of Luxembourg, Jie M. Zhang King's College London, Mike Papadakis University of Luxembourg, Luxembourg, Maxime Cordy University of Luxembourg, Luxembourg, Xiaofei Xie Singapore Management University, Singapore, Yves Le Traon University of Luxembourg, Luxembourg Pre-print | ||
11:39 11mLive Q&A | Discussions and Q&A Technical Papers |
12:00 - 12:51 | Vision & Reflections Track: PastTechnical Papers at MSR Plenary room Chair(s): Bram Adams Queen's University, Kingston, Ontario, Shaowei Wang University of Manitoba | ||
12:00 10mTalk | Back to the future: Empirical Revolution(s) in Software Engineering Technical Papers Audris Mockus The University of Tennessee | ||
12:10 10mTalk | Engineering the MSR Field and the Joy of Research Technical Papers Ahmed E. Hassan Queen's University | ||
12:20 10mTalk | It's all in your network: How mining developer collaboration allowed us to peer into complex socio-technical aspects of software development Technical Papers Daniela Damian University of Victoria | ||
12:30 21mOther | Discussion Technical Papers |
13:00 - 13:51 | Vision & Reflections Track: FutureTechnical Papers at MSR Plenary room Chair(s): Bram Adams Queen's University, Kingston, Ontario, Shaowei Wang University of Manitoba | ||
13:00 10mTalk | Bias in MSR research Technical Papers Alexander Serebrenik Eindhoven University of Technology | ||
13:10 10mTalk | The Next Generation of Software Developers Technical Papers Denae Ford Microsoft Research | ||
13:20 10mTalk | Mining Software Repositories in the age of AI Technical Papers Foutse Khomh Polytechnique Montréal | ||
13:30 21mOther | Discussion Technical Papers |
14:00 - 14:50 | MIP Award SessionMIP Award at MSR Plenary room Chair(s): Massimiliano Di Penta University of Sannio, Italy Most Influential Paper: “GHTorrent: Github’s data from a firehose” by Georgios Gousios and Diomidis Spinellis (MSR 2012) for conceiving and maintaining the GHTorrent archive, extensively leveraged by the MSR community. | ||
14:00 50mTalk | MIP Award Talk MIP Award Georgios Gousios Endor Labs & Delft University of Technology, Diomidis Spinellis Athens University of Economics and Business; Delft University of Technology |
22:00 - 22:50 | Foundational Contribution Award SessionTechnical Papers at MSR Plenary room Chair(s): Miryung Kim University of California at Los Angeles, USA | ||
22:00 50mAwards | MSR Foundational Contribution Award Technical Papers |
Fri 20 MayDisplayed time zone: Eastern Time (US & Canada) change
05:00 - 05:30 | Closing Session of Virtual MSR 2022 + Introduction of MSR 2023Technical Papers at MSR Plenary room Speakers: David Lo, Shane McIntosh, Nicole Novielli, Emad Shihab | ||
10:00 - 10:50 | Shadow PC RetrospectiveTechnical Papers at MSR Main room - even hours Chair(s): Eleni Constantinou Eindhoven University of Technology, Sarah Nadi University of Alberta Closed to Shadow PC Members. | ||
12:00 - 12:50 | |||
12:00 50mTutorial | Using Datalore for Reproducible Research Tutorials Jodie Burchell JetBrains |
13:00 - 13:50 | Tutorial: Software Bots in Software Engineering: Benefits and ChallengesTutorials at MSR Tutorials room | ||
13:00 50mTutorial | Software Bots in Software Engineering: Benefits and Challenges Tutorials Mairieli Wessel Delft University of Technology, Marco Gerosa Northern Arizona University, USA, Emad Shihab Concordia University |
14:00 - 15:00 | Session 16: Non-functional Properties (Availability, Security, Legal Aspects)Industry Track / Technical Papers / Registered Reports / Data and Tool Showcase Track at MSR Main room - even hours Chair(s): Maxime Lamothe Polytechnique Montreal, Montreal, Canada, Jin L.C. Guo McGill University | ||
14:00 7mTalk | A Deep Study of the Effects and Fixes of Server-Side Request Races in Web Applications Technical Papers Zhengyi Qiu North Carolina State University, Shudi Shao North Carolina State University, Qi Zhao North Carolina State University, Hassan Ali Khan North Carolina State University, Xinning Hui North Carolina State University, Guoliang Jin North Carolina State University Media Attached | ||
14:07 4mTalk | A Large-scale Dataset of (Open Source) License Text VariantsData and Tool Showcase Award Data and Tool Showcase Track Stefano Zacchiroli Télécom Paris, Polytechnic Institute of Paris DOI Pre-print | ||
14:11 7mTalk | SECOM: Towards a convention for security commit messagesFOSS Impact Paper Award Industry Track Sofia Reis Instituto Superior Técnico, U. Lisboa & INESC-ID, Rui Abreu Faculty of Engineering, University of Porto, Portugal, Hakan Erdogmus Carnegie Mellon University, Corina S. Păsăreanu Carnegie Mellon University Pre-print | ||
14:18 7mTalk | Varangian: A Git Bot for Augmented Static Analysis Industry Track Saurabh Pujar IBM Research, Yunhui Zheng IBM Research, Luca Buratti IBM Research, Burn Lewis IBM Research, Alessandro Morari IBM Research, Jim A. Laredo IBM Research, Kevin Postlethwait Red Hat, Christoph Görn Red Hat | ||
14:25 7mTalk | Detecting Privacy-Sensitive Code Changes with Language Modeling Industry Track Gökalp Demirci Meta Platforms, Inc., Vijayaraghavan Murali Meta Platforms, Inc., Imad Ahmad Meta Platforms, Inc., Rajeev Rao Meta Platforms, Inc., Gareth Ari Aye Meta Platforms, Inc. | ||
14:32 4mTalk | Is GitHub's Copilot as Bad As Humans at Introducing Vulnerabilities in Code? Registered Reports Owura Asare University of Waterloo, Mei Nagappan University of Waterloo, N. Asokan University of Waterloo Pre-print | ||
14:36 7mTalk | Finding the Fun in Fundraising: Public Issues and Pull Requests in VC-backed Open-Core Companies Industry Track Kevin Xu GitHub | ||
14:43 17mLive Q&A | Discussions and Q&A Technical Papers |
Mon 23 MayDisplayed time zone: Eastern Time (US & Canada) change
09:00 - 10:30 | In-Person MSR Opening, Keynote and MIP SessionTechnical Papers / MIP Award at Room 315+316 Chair(s): David Lo Singapore Management University | ||
09:00 20mTalk | In-Person MSR 2022 Opening Session Technical Papers David Lo Singapore Management University, Shane McIntosh University of Waterloo, Nicole Novielli University of Bari | ||
09:20 35mKeynote | From Models to Systems: Rethinking the Role of Software Engineering for Machine Learning Technical Papers Christian Kästner Carnegie Mellon University | ||
09:55 35mTalk | MIP Award Talk MIP Award Georgios Gousios Endor Labs & Delft University of Technology, Diomidis Spinellis Athens University of Economics and Business; Delft University of Technology |
13:30 - 15:00 | Blended Technical Session 2 (Machine Learning and Information Retrieval) Technical Papers / Data and Tool Showcase Track at Room 315+316 Chair(s): Preetha Chatterjee Drexel University, USA | ||
13:30 15mTalk | Methods for Stabilizing Models across Large Samples of Projects(with case studies on Predicting Defect and Project Health) Technical Papers Suvodeep Majumder North Carolina State University, Tianpei Xia North Carolina State University, Rahul Krishna North Carolina State University, Tim Menzies North Carolina State University Pre-print Media Attached | ||
13:45 15mTalk | GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses Technical Papers Wei Ma SnT, University of Luxembourg, Mengjie Zhao LMU Munich, Ezekiel Soremekun SnT, University of Luxembourg, Qiang Hu University of Luxembourg, Jie M. Zhang King's College London, Mike Papadakis University of Luxembourg, Luxembourg, Maxime Cordy University of Luxembourg, Luxembourg, Xiaofei Xie Singapore Management University, Singapore, Yves Le Traon University of Luxembourg, Luxembourg Pre-print | ||
14:00 15mTalk | Senatus: A Fast and Accurate Code-to-Code Recommendation Engine Technical Papers Fran Silavong JP Morgan Chase & Co., Sean Moran JP Morgan Chase & Co., Antonios Georgiadis JP Morgan Chase & Co., Rohan Saphal JP Morgan Chase & Co., Robert Otter JP Morgan Chase & Co. DOI Pre-print Media Attached | ||
14:15 8mShort-paper | Comments on Comments: Where Code Review and Documentation Meet Technical Papers Nikitha Rao Carnegie Mellon University, Jason Tsay IBM Research, Martin Hirzel IBM Research, Vincent J. Hellendoorn Carnegie Mellon University DOI Pre-print File Attached | ||
14:23 8mShort-paper | On the Naturalness of Fuzzer Generated Code Technical Papers Rajeswari Hita Kambhamettu Carnegie Mellon University, John Billos Wake Forest University, Carolyn "Tomi" Oluwaseun-Apo Pennsylvania State University, Benjamin Gafford Carnegie Mellon University, Rohan Padhye Carnegie Mellon University, Vincent J. Hellendoorn Carnegie Mellon University | ||
14:31 8mTalk | SOSum: A Dataset of Stack Overflow Post Summaries Data and Tool Showcase Track Bonan Kou Purdue University, Yifeng Di Purdue University, Muhao Chen University of Southern California, Tianyi Zhang Purdue University | ||
14:39 21mLive Q&A | Discussions and Q&A Technical Papers |
15:30 - 17:00 | Networking & Poster SessionTechnical Papers at Room 315+316 Chair(s): Miikka Kuutila University of Oulu The following are the registered posters:
| ||
Tue 24 MayDisplayed time zone: Eastern Time (US & Canada) change
12:15 - 12:30 | Brainstorming / Discussion ITechnical Papers at Room 315+316 Chair(s): Shane McIntosh University of Waterloo | ||
13:30 - 15:00 | Brainstorming / Discussion II Technical Papers at Room 315+316 Chair(s): Shane McIntosh University of Waterloo | ||
15:30 - 17:00 | Blended Technical Session 5 (Miscellaneous) Technical Papers / Data and Tool Showcase Track / Mining Challenge at Room 315+316 Chair(s): Luís Cruz Deflt University of Technology | ||
15:30 15mTalk | Code Review Practices for Refactoring Changes: An Empirical Study on OpenStack Technical Papers Eman Abdullah AlOmar Stevens Institute of Technology, Moataz Chouchen ETS, Mohamed Wiem Mkaouer Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
15:45 15mTalk | Painting the Landscape of Automotive Software in GitHub Technical Papers Sangeeth Kochanthara Eindhoven University of Technology, Yanja Dajsuren Eindhoven University of Technology, Loek Cleophas Eindhoven University of Technology (TU/e) and Stellenbosch University (SU), Mark van den Brand Eindhoven University of Technology Pre-print Media Attached | ||
16:00 8mTalk | SLNET: A Redistributable Corpus of 3rd-party Simulink Models Data and Tool Showcase Track Sohil Lal Shrestha The University of Texas at Arlington, Shafiul Azam Chowdhury University of Texas at Arlington, Christoph Csallner University of Texas at Arlington DOI Pre-print Media Attached | ||
16:08 8mTalk | SoCCMiner: A Source Code-Comments and Comment-Context Miner Data and Tool Showcase Track Murali Sridharan University of Oulu, Mika Mäntylä University of Oulu, Maëlick Claes University of Oulu, Leevi Rantala University of Oulu Pre-print | ||
16:16 8mTalk | An Exploratory Study on Refactoring Documentation in Issues Handling Mining Challenge Eman Abdullah AlOmar Stevens Institute of Technology, Anthony Peruma Rochester Institute of Technology, Mohamed Wiem Mkaouer Rochester Institute of Technology, Christian D. Newman Rochester Institute of Technology, Ali Ouni ETS Montreal, University of Quebec Pre-print | ||
16:24 8mTalk | Between JIRA and GitHub: ASFBot and its Influence on Human Comments in Issue Trackers Mining Challenge Ambarish Moharil Eindhoven University of Technology, Dmitrii Orlov Eindhoven University of Technology, Samar Jameel Eindhoven University of Technology, Tristan Trouwen Eindhoven University of Technology, Nathan Cassee Eindhoven University of Technology, Alexander Serebrenik Eindhoven University of Technology Pre-print | ||
16:32 28mLive Q&A | Discussions and Q&A Technical Papers |
17:00 - 17:30 | |||