Write a Blog >>
ISSTA 2020
Sat 18 - Wed 22 July 2020
Wed 22 Jul 2020 15:50 - 16:10 at Zoom - NUMERICAL SOFTWARE ANALYSIS & CLONE DETECTION Chair(s): Darko Marinov

In order to speed up spreadsheet development productivity, end users can create a spreadsheet table by copying and modifying an existing one. These two tables share the similar computational semantics, and form a table clone. End users may modify the tables in a table clone, e.g., adding new rows and deleting columns, thus introducing structure changes into the table clone. Our empirical study on real-world spreadsheets shows that about 58.5% of table clones involve structure changes. However, existing table clone detection approaches in spreadsheets can only detect table clones with the same structures. Therefore, many table clones with structure changes cannot be detected.

We observe that, although the tables in a table clone may be modified, they usually share the similar structures and formats, e.g., headers, formulas and background colors. Based on this observation, we propose LTC (Learning to detect Table Clones), to automatically detect table clones with or without structure changes. LTC utilizes the structure and format information from labeled table clones and non table clones to train a binary classifier. LTC first identifies tables in spreadsheets, and then uses the trained binary classifier to judge whether every two tables can form a table clone. Our experiments on real-world spreadsheets from the EUSES and Enron corpora show that, LTC can achieve a precision of 97.8% and recall of 92.1% in table clone detection, significantly outperforming the state-of-the-art technique (a precision of 37.5% and recall of 11.1%).

Wed 22 Jul

Displayed time zone: Tijuana, Baja California change

14:50 - 16:10
NUMERICAL SOFTWARE ANALYSIS & CLONE DETECTIONTechnical Papers at Zoom
Chair(s): Darko Marinov University of Illinois at Urbana-Champaign

Public Live Stream/Recording. Registered participants should join via the Zoom link distributed in Slack.

14:50
20m
Talk
Discovering Discrepancies in Numerical LibrariesArtifacts Evaluated – ReusableArtifacts AvailableArtifacts Evaluated – FunctionalDistinguished Artifact
Technical Papers
Jackson Vanover University of California, Davis, Xuan Deng University of California, Davis, Cindy Rubio-González University of California, Davis
DOI Media Attached
15:10
20m
Talk
Testing High Performance Numerical Simulation Programs: Experience, Lessons Learned, and Open Issues
Technical Papers
Xiao He University of Science and Technology Beijing, China, Xingwei Wang , Jia Shi , Yi Liu
DOI Media Attached
15:30
20m
Talk
Functional Code Clone Detection with Syntax and Semantics Fusion LearningArtifacts Evaluated – ReusableArtifacts AvailableArtifacts Evaluated – Functional
Technical Papers
Chunrong Fang Nanjing University, Zixi Liu Nanjing University, Yangyang Shi , Jeff Huang Texas A&M University, Qingkai Shi The Hong Kong University of Science and Technology
DOI Media Attached
15:50
20m
Talk
Learning to Detect Table Clones in Spreadsheets
Technical Papers
Yakun Zhang Institute of software, Chinese academy of sciences, Wensheng Dou Institute of Software, Chinese Academy of Sciences, Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Liang Xu , Zhiyong Zhou Institute of Software, Chinese Academy of Sciences, Jun Wei State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences & University of Chinese Academy of Sciences, Dan Ye , Bo Yang
DOI Media Attached