Leveraging Artificial Intelligence on Binary Code Comprehension
Understanding binary code is an essential but complex software engineering task for reverse engineering, malware analysis, and compiler optimization. Unlike source code, binary code has limited semantic information, which makes it challenging for human comprehension. At the same time, compiling source to binary code, or transpiling among different programming languages (PLs) can provide a way to introduce external knowledge into binary comprehension. We propose to develop Artificial Intelligence (AI) models that aid human comprehension of binary code. Specifically, we propose to incorporate domain knowledge from large corpora of source code (e.g., variable names, comments) to build AI models that capture a generalizable representation of binary code. Lastly, we will investigate metrics to assess the performance of models that apply to binary code by using human studies of comprehension.
Yifan is a researcher focusing on AI for Software Engineering (AI4SE), Graph Data Mining, and Domain Generalization. For the time being, he is pursuing a Ph.D. in Computer Science at Vanderbilt University, affiliated with Institute for Software Integrated Systems.
We are hiring Ph.D., Post-Doc and Research Intern. Feel free to check the recruitment documents if you are interested in:
Prof. Leach’s Lab: https://kjl.name/recruitment.pdf
Prof. Huang’s Lab: https://yuhuang-lab.github.io/index_files/Huang-Recruitment.pdf
Mon 10 OctDisplayed time zone: Eastern Time (US & Canada) change
10:30 - 12:00 | |||
10:30 30mDoctoral symposium paper | Leveraging Artificial Intelligence on Binary Code Comprehension Doctoral Symposium Yifan Zhang Vanderbilt University | ||
11:00 30mDoctoral symposium paper | Assessment of Automated (Intelligent) Toolchains Doctoral Symposium Aurora Papotti Vrije Universiteit Amsterdam | ||
11:30 30mDoctoral symposium paper | Extraction and Management of Rationale Doctoral Symposium Mouna Dhaouadi University of Montreal |