CCGraph: a PDG-based code clone detector with approximate graph matching
The software clone detection is an active research area, which is very important for software maintenance, bug detection etc. The two pieces of cloned code reflect some similarities or equivalents in the syntax or structure of the code representations. There are many representations of code like AST, token, PDG etc. The PDG (Program Dependency Graph) of source code can contain both syntactic and structural information. However, most existing PDG-based tools have high time consuming and miss many clones because they detect code clones with exact graph matching by using subgraph isomorphism. In this paper, we propose a novel PDG-based code clone detector, CCGraph, that uses graph kernels. Firstly, we normalize the structure of PDGs and design a two-stage filtering strategy by measuring the characteristic vectors of codes. Then we detect the code clones by using approximate graph matching algorithm based on the reforming WL (Weisfeiler-Lehman) graph kernel. Experiment results show that CCGraph retains a high accuracy, has both better recall and F1-score values, and detects more unique clones than other two related state-of-the-art tools. Besides, CCGraph is much more efficient than the existing PDG-based tools.
Thu 24 SepDisplayed time zone: (UTC) Coordinated Universal Time change
02:20 - 03:20 | Software Analysis (3)Research Papers / Industry Showcase at Kangaroo Chair(s): Rahul Purandare IIIT-Delhi | ||
02:20 20mTalk | CCGraph: a PDG-based code clone detector with approximate graph matching Research Papers Yue Zou Department of Computer Science and Technology, University of Science and Technology of China, Bihuan Ban School of Data Science, University of Science and Technology of China, Yinxing Xue , Yun Xu University of Science and Technology of China | ||
02:40 20mTalk | Towards Generating Thread-Safe Classes Automatically Research Papers Haichi Wang College of Intelligence and Computing, Tianjin University, Zan Wang Tianjin University, China, Jun Sun Singapore Management University, Shuang Liu Tianjin University, China, Ayesha Sadiq Monash University, Yuan-Fang Li Monash University | ||
03:00 20mTalk | Industry practice of JavaScript Dynamic Analysis on WeChat Mini-Programs Industry Showcase Yi Liu Southern University of Science and Technology, Jinhui Xie Tencent Inc., Jianbo Yang Tencent Inc., Shiyu Guo Tencent Inc., Yuetang Deng Tencent, Inc., Shuqing Li Southern University of Science and Technology, Yechang Wu Southern University of Science and Technology, Yepang Liu Southern University of Science and Technology |