Exploring and Understanding Cross-service Code Clones in Microservice Projects (ICPC 2022 - Research)

Who

Yang Zhao, Ran Mo, Yao Zhang, Siyuan Zhang, Pu Xiong

Track

ICPC 2022 Research

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 16 May 2022 22:18 - 22:25 at ICPC room - Session 10: Code Clones Chair(s): Chaiyong Ragkhitwetsagul

Abstract

Microservice is an architecture style that decomposes complex software into loosely coupled services, which could be developed, maintained, and deployed independently. In recent years, the microservice architecture has been drawing more and more attention from both industrial and academic communities. Many companies, such as Google, Netflix, Amazon, and IBM have applied microservice architecture in their projects. Researchers have also studied microservices in different directions, such as microservices extraction, fault localization, and code quality analysis. The recent work has presented cross-service code clones are prevalent in microservice projects and have caused considerable co-modifications among different services, which undermines the independence of microservices. But there is no systematic study to reveal the underlying reasons for the emergence of such clones. In this paper, we first build a dataset consisting of 2,722 pairs of cross-service clones from 22 open-source microservice projects. Then we manually inspect the implementations of files and methods involved in cross-service clones to understand why the clones are introduced. In the file-level analysis, we categorize files into three types: DPFile (Data-processing File), DRFile (Data-related File), and DIFile (Data-irrelevant File), and have presented that DRFiles are more likely to encounter cross-service clones. For each type of files, we further classify them into specific cases. Each case describes the characteristics of involved files and why the clones happen. In the method-level analysis, we dig information from the code of involved methods. On this basis, we propose a catalog containing 4 categories with 10 subcategories of method-level implementations that result in cross-service clones. We believe our analyses have provided the fundamental knowledge of cross-service clones, which can help developers better manage and resolve such clones in microservice projects.

Yang Zhao

Central China Normal University

China

Ran Mo

Central China Normal University

China

Yao Zhang

Central China Normal University

Siyuan Zhang

Central China Normal University

Pu Xiong

Central China Normal University

Media

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 16 May
Displayed time zone: Eastern Time (US & Canada) change

22:00 - 22:50	Session 10: Code ClonesResearch / Early Research Achievements (ERA) at ICPC room Chair(s): Chaiyong Ragkhitwetsagul Mahidol University, Thailand

22:00 7m Talk		C4: Contrastive Cross-Language Code Clone Detection Research Chenning Tao Zhejiang University, Qi Zhan Zhejiang University, Xing Hu Zhejiang University, Xin Xia Huawei Software Engineering Application Technology Lab DOI Pre-print Media Attached
22:07 7m Talk		Predicting Change Propagation between Code Clone Instances by Graph-based Deep Learning Research Bin Hu Fudan University, Yijian Wu Fudan University, Xin Peng Fudan University, Chaofeng Sha Fudan University, Xiaocheng Wang Fudan University, Baiqiang Fu Fudan University, Wenyun Zhao Fudan University, China Media Attached File Attached
22:14 4m Talk		An Exploratory Study of Analyzing JavaScript Online Code Clones Early Research Achievements (ERA) Md Rakib Hossain Misu University of California, Irvine, Abdus Satter University of Dhaka DOI Pre-print Media Attached
22:18 7m Talk		Exploring and Understanding Cross-service Code Clones in Microservice Projects Research Yang Zhao Central China Normal University, Ran Mo Central China Normal University, Yao Zhang Central China Normal University, Siyuan Zhang Central China Normal University, Pu Xiong Central China Normal University Media Attached
22:25 7m Talk		MSCCD: Grammar Pluggable Clone Detection Based on ANTLR Parser Generation Research Wenqing ZHU Nagoya University, Norihiro Yoshida Ritsumeikan University, Toshihiro Kamiya Shimane University, Eunjong Choi Kyoto Institute of Technology, Hiroaki Takada Nagoya University Pre-print Media Attached
22:32 7m Talk		Algorithm Identification in Programming Assignments Research Pranshu Chourasia Indian Institute of technology - Bombay, Ganesh Ramakrishnan Indian Institute of technology - Bombay, Varsha Apte Indian Institute of technology - Bombay, Suraj Kumar Indian Institute of technology - Bombay Media Attached
22:39 11m Live Q&A		Q&A-Paper Session 10 Research

Information for Participants

Mon 16 May 2022 22:00 - 22:50 at ICPC room - Session 10: Code Clones Chair(s): Chaiyong Ragkhitwetsagul

Info for room ICPC room:

Click here to go to the room on Midspace