De-duplicating Silent Compiler Bugs via Deep Semantic Representation
The compiler bug duplication problem (where many test failures are caused by the same compiler bug) can lead to huge waste of time and resource in diagnosing test failures produced by compiler testing. It is particularly challenging with regard to the silent compiler bugs that do not produce any error messages. To address this problem, multiple white-box techniques were proposed, but they are inapplicable in many practical scenarios. Black-box techniques are more practical, but the existing ones are less effective as they often rely on irrelevant syntactic information. To bridge this gap, we propose a novel black-box technique (BLADE), which aims to improve the effectiveness of black-box de-duplication by extracting failure-relevant semantic information from failure-triggering test programs in a black-box manner. It first learns failure-relevant semantic information based on intermediate representation learning by employing the classification of failure-triggering and failure-free test programs as the auxiliary objective, and then extracts such information based on model interpretation. Our experiments on four widely-used datasets (collected from GCC and LLVM) show that BLADE significantly outperforms the two existing black-box techniques with an average improvement of 36% and 12% in identifying unique silent compiler bugs when analyzing the same number of test failures respectively, and achieves competitive effectiveness with the state-of-the-art white-box techniques.
Wed 25 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:00 - 12:30 | CompilerResearch Papers / Journal First / Ideas, Visions and Reflections at Cosmos 3C Chair(s): Na Meng Virginia Tech | ||
11:00 20mTalk | De-duplicating Silent Compiler Bugs via Deep Semantic Representation Research Papers Junjie Chen Tianjin University, Xingyu Fan Tianjin University, Chen Yang Tianjin University, Shuang Liu Renmin University of China, Jun Sun Singapore Management University DOI | ||
11:20 20mTalk | DiSCo: Towards Decompiling EVM Bytecode to Source Code using Large Language Models Research Papers Xing Su National Key Lab for Novel Software Technology, Nanjing University, China, Hanzhong Liang National Key Lab for Novel Software Technology, Nanjing University, China, Hao Wu , Ben Niu State Key Laboratory of Information Security, Institute of Information Engineering, China, Fengyuan Xu National Key Lab for Novel Software Technology, Nanjing University, China, Sheng Zhong National Key Lab for Novel Software Technology, Nanjing University, China DOI | ||
11:40 20mTalk | Compiler Autotuning through Multiple Phase Learning Journal First | ||
12:00 20mTalk | PDCAT: Preference-Driven Compiler Auto-Tuning Research Papers Mingxuan Zhu Peking University, Zeyu Sun Institute of Software, Chinese Academy of Sciences, Dan Hao Peking University DOI | ||
12:20 10mTalk | Compiler Optimization Testing Based on Optimization-Guided Equivalence Transformations Ideas, Visions and Reflections Jingwen Wu Shandong University, Jiajing Zheng Shandong University, Zhenyu Yang Shandong University, Zhongxing Yu Shandong University |
Cosmos 3C is the third room in the Cosmos 3 wing.
When facing the main Cosmos Hall, access to the Cosmos 3 wing is on the left, close to the stairs. The area is accessed through a large door with the number “3”, which will stay open during the event.