SANER 2026
Tue 17 - Fri 20 March 2026 Limassol, Cyprus

Large language models have recently improved in code generation and comprehension, but they remain brittle when handling obfuscated code with degraded structural and semantic cues. Code obfuscation, widely used to resist reverse engineering, hampers malware analysis, vulnerability detection, and software maintenance. Recovering the original semantics of obfuscated programs is therefore an important yet understudied problem. This work analyzes how and why model performance degrades on obfuscated code, and further proposes a prototype prompt guard mechanism to validate ideas for improving the reliability of semantic restoration. Using a Java based ProGuard dataset that we constructed from 857 valid samples, we first found that functional or structural degradation occurred in about 32% of baseline outputs. We then applied three progressively constrained prompts (V1–V3) designed to preserve API signatures, control flow, and semantic integrity. When evaluated on 43 low performing baseline samples (< 80), the proportion of low quality outputs decreased from 100% to 34.9%, representing a relative reduction of approximately 65%. This targeted improvement also translated to the full dataset: the global proportion of low performing samples decreased from 30% to 18%, yielding an overall relative reduction of about 40%. Furthermore, the number of high quality outputs (≥ 90) increased from 0 to 18, indicating substantial gains in both stability and readability. Our findings quantitatively reveal the reliability limits of LLM based code de-obfuscation and demonstrate that prompt level structural constraints alone can markedly enhance restoration accuracy and coherence. This work provides empirical evidence and methodological foundations for advancing automated code recovery, reverse engineering, and security analysis using LLMs.

Wed 18 Mar

Displayed time zone: Athens change

16:00 - 17:30
16:00
15m
Talk
Feedback Loops and Code Perturbations in LLM-based Software Engineering: A Case Study on a C-to-Rust Translation System
Industrial Track
16:15
15m
Talk
Efficient Translation of Long Code Blocks using Large Language Models
Research Track
Venkatesan Chakaravarthy IBM Research - India, Anamitra Roy Choudhury IBM, Vini Kanvar IBM Research, Rami Katan IBM Research Haifa, Shivmaran Pandian IBM Research - India, Aditya Raghuvanshi International Institute of Information Technology, Hyderabad, Yogish Sabharwal IBM Research - India
16:30
15m
Talk
Translating Code with Large Language Models and Human-in-the-loop feedback
Journal First Track
Gabriele Dario De Siano University of Naples Federico II, Anna Rita Fasolino Federico II University of Naples, Giancarlo Sperlì University of Naples Federico II, Andrea Vignali University of Naples Federico II
16:45
15m
Talk
Migrating Esope to Fortran 2008 using model transformations
Industrial Track
Younoussa Sow DTIPD Framatome, Nicolas Anquetil University of Lille, Lille, France, Léandre Brault , Stéphane Ducasse Inria; University of Lille; CNRS; Centrale Lille; CRIStAL
17:00
15m
Talk
Refining LLM-based COBOL-to-Java Translation via Natural Language Summary Augmentation
Industrial Track
Aman Bhardwaj IBM Research - India, Vijay Arya IBM Research, Yogish Sabharwal IBM Research - India
17:15
7m
Talk
Toward Reliable Code De-obfuscation with Large Language Models
Short Papers and Posters Track
Yujeong Choi Duksung Women’s University, Dohwan Ji Hanbat National University, Yujin Kwon Duksung Women’s University, Jinyoung Kim Sungkyunkwan University
17:22
7m
Talk
LLMs as Idiomatic Decompilers: Recovering High-Level Code from x86-64 Assembly for Dart
Early Research Achievement (ERA) Track