DiSCo: Towards Decompiling EVM Bytecode to Source Code using Large Language Models
Understanding the Ethereum smart contract bytecode is essential for ensuring cryptoeconomics security. However, existing decompilers primarily convert bytecode into pseudocode, which is not easily comprehensible for general users, potentially leading to misunderstanding of contract behavior and increased vulnerability to scams or exploits. In this paper, we propose DiSCo, the first LLMs-based EVM decompilation pipeline, which aims to enable LLMs to understand the opaque bytecode and lift it into smart contract code. DiSCo introduces three core technologies. First, a logic-invariant intermediate representation is proposed to reproject the low-level bytecode into high-level abstracted units. The second technique involves semantic enhancement based on a novel type-aware graph model to infer stripped variables during compilation, enhancing the lifting effect. The third technology is a flexible method incorporating code specifications to construct LLM-comprehensible prompts for source code generation. Extensive experiments illustrate that our generated code guarantees a high compilability rate at 75%, with differential fuzzing pass rate averaging at 50%. Manual validation results further indicate that the generated solidity contracts significantly outperforms baseline methods in tasks such as code comprehension and attack reproduction.
Wed 25 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
11:00 - 12:30 | CompilerResearch Papers / Journal First / Ideas, Visions and Reflections at Cosmos 3C Chair(s): Na Meng Virginia Tech | ||
11:00 20mTalk | De-duplicating Silent Compiler Bugs via Deep Semantic Representation Research Papers Junjie Chen Tianjin University, Xingyu Fan Tianjin University, Chen Yang Tianjin University, Shuang Liu Renmin University of China, Jun Sun Singapore Management University DOI | ||
11:20 20mTalk | DiSCo: Towards Decompiling EVM Bytecode to Source Code using Large Language Models Research Papers Xing Su National Key Lab for Novel Software Technology, Nanjing University, China, Hanzhong Liang National Key Lab for Novel Software Technology, Nanjing University, China, Hao Wu , Ben Niu State Key Laboratory of Information Security, Institute of Information Engineering, China, Fengyuan Xu National Key Lab for Novel Software Technology, Nanjing University, China, Sheng Zhong National Key Lab for Novel Software Technology, Nanjing University, China DOI | ||
11:40 20mTalk | Compiler Autotuning through Multiple Phase Learning Journal First | ||
12:00 20mTalk | PDCAT: Preference-Driven Compiler Auto-Tuning Research Papers Mingxuan Zhu Peking University, Zeyu Sun Institute of Software, Chinese Academy of Sciences, Dan Hao Peking University DOI | ||
12:20 10mTalk | Compiler Optimization Testing Based on Optimization-Guided Equivalence Transformations Ideas, Visions and Reflections Jingwen Wu Shandong University, Jiajing Zheng Shandong University, Zhenyu Yang Shandong University, Zhongxing Yu Shandong University |
Cosmos 3C is the third room in the Cosmos 3 wing.
When facing the main Cosmos Hall, access to the Cosmos 3 wing is on the left, close to the stairs. The area is accessed through a large door with the number “3”, which will stay open during the event.