FSE 2025
Mon 23 - Fri 27 June 2025 Trondheim, Norway
Mon 23 Jun 2025 10:30 - 10:50 at Aurora A - Clones Chair(s): Julia Lawall

Business process management is a multi-billion-dollar industry focused on modeling business processes to analyze, improve, and automate them. These processes are interconnected activities an organization follows to achieve its goals. Modeling is essential to understand, manage, and boost organizational efficiency and competitiveness.

Characterizing modeling artifacts is essential in empirical software engineering, with prior studies exploring domains like machine learning, automotive, video games, and bots. Given the rise in organizational use of process models, we focus on business process models. To our knowledge, no characterization of open-source business process models exists in the literature, which we aim to address by answering the following research question:

RQ1: What is the landscape of business process models in open source?

By characterizing business process models in open-source environments, we aim to inform (a) the development of tools suited to the unique needs of open-source business process management projects, (b) influence best practices for model creation and maintenance, and (c) contribute insights for refining business process modeling standardization efforts and future revisions.

Our study shows that business process models on GitHub span at least 16 domains, with machine learning, traditional software, sales, business services, and financial services being the most common. Business process model adoption is rising, reflected by the steady increase in model creation and updates. While organization-owned repositories contribute significantly (approximately 16%), most models (approximately 84%) are hosted in individual-owned repositories. Additionally, we find a strong reliance on tools from industry-leading vendors like Drools, Camunda, Activiti, and SAP Signavio, underscoring their importance in the open-source business process modeling community.

Our characterization explores both the diversity and reuse of business process models, emphasizing cloning as a critical factor impacting model quality. Initial exploration of GitHub models reveals significant duplication, which can complicate maintenance, evolution, and quality assurance. Although clone detection is well-studied in software engineering, especially for source code, an analysis of business process model similarity in open-source settings is still missing. Thus, we expand our business process model characterization by addressing our second research question:

RQ2: What is the extent of cloning in business process models in open source?

We find that the majority of clones (approximately 80%) in business process models were found to be exact clones, with cross-project clones being the most common (approximately 57%) and predominantly associated with industry-owned (as opposed to academia-owned) repositories. An analysis of the model-level clones indicates that machine learning, traditional software, and business services are the top three domains in the dataset. A further clone detection analysis on subprocess elements within the models reveals a substantial degree (approximately 92%) of cloning among these model fragments. It also shows that all the detected subprocess clones have at least one exact clone.

In summary, the primary contributions of this study are:

First study characterizing open-source business process models on GitHub. First study on business process model and subprocess-level cloning on GitHub. A unique dataset with tagged, distinct, non-trivial business process models, their similarities, and distinct subprocesses.

Mon 23 Jun

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:30 - 12:30
10:30
20m
Talk
An empirical study of business process models and model clones on GitHub
Journal First
Mahdi Saeedi Nikoo Eindhoven University of Technology, Sangeeth Kochanthara Netherlands' Space Obervatory - ASTRON, Önder Babur Eindhoven University of Technology, Mark van den Brand Eindhoven University of Technology
10:50
20m
Talk
The Struggles of LLMs in Cross-lingual Code Clone Detection
Research Papers
Micheline Bénédicte MOUMOULA University of Luxembourg, Abdoul Kader Kaboré University of Luxembourg, Jacques Klein University of Luxembourg, Tegawendé F. Bissyandé University of Luxembourg
DOI
11:10
20m
Talk
Clone Detection for Smart Contracts: How Far Are We?
Research Papers
Zuobin Wang Zhejiang University, Zhiyuan Wan Zhejiang University, Yujing Chen Zhejiang University, Yun Zhang Hangzhou City University, David Lo Singapore Management University, Difan Xie Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security, Xiaohu Yang Zhejiang University
DOI
11:30
20m
Talk
Measuring Model Alignment for Code Clone Detection Using Causal Interpretation
Journal First
Shamsa Abid National University of Computer and Emerging Sciences, Xuemeng Cai Singapore Management University, Lingxiao Jiang Singapore Management University
11:50
20m
Talk
An Empirical Study of Code Clones from Commercial AI Code Generators
Research Papers
Weibin Wu Sun Yat-sen University, Haoxuan Hu Sun Yat-sen University, China, Zhaoji Fan Sun Yat-sen University, Yitong Qiao Sun Yat-sen University, China, Yizhan Huang The Chinese University of Hong Kong, Yichen LI The Chinese University of Hong Kong, Zibin Zheng Sun Yat-sen University, Michael Lyu Chinese University of Hong Kong
DOI
12:10
20m
Talk
VexIR2Vec: An Architecture-Neutral Embedding Framework for Binary Similarity
Journal First
S. VenkataKeerthy IIT Hyderabad, Soumya Banerjee IIT Hyderabad, Sayan Dey IIT Hyderabad, Yashas Andaluri IIT Hyderabad, Raghul PS IIT Hyderabad, Subrahmanyam Kalyanasundaram IIT Hyderabad, Fernando Magno Quintão Pereira Federal University of Minas Gerais, Ramakrishna Upadrasta IIT Hyderabad

Information for Participants
Mon 23 Jun 2025 10:30 - 12:30 at Aurora A - Clones Chair(s): Julia Lawall
Info for room Aurora A:

Aurora A is the first room in the Aurora wing.

When facing the main Cosmos Hall, access to the Aurora wing is on the right, close to the side entrance of the hotel.