MARS: Few-Shot Android Malware Detection with RAG-Enhanced LLMs
Android malware threats have expanded significantly in recent years and now endanger user privacy and device security across billions of devices worldwide. Traditional supervised detection methods excel when substantial training data exists but encounter major difficulties in scenarios with limited labeled samples. These approaches fail to generalize in few-shot environments and suffer from both overfitting and underfitting problems. To this end, we present MARS, a novel framework that bridges the knowledge gap between general-purpose language models and domain-specific security expertise for effective malware detection. MARS integrates Core-set optimization with Retrieval-Augmented Generation (RAG)-enhanced large language model inference to address the challenges of malware classification in limited-data scenarios. Our approach implements a three-stage process: it applies Core-set selection to extract representative samples from each malware family, encodes security expert-defined feature-label associations into a structured rule repository, and employs Chain-of-Thought reasoning to interpret malicious behaviors while constraining outputs to align with expert knowledge. When evaluated on benchmark datasets, MARS demonstrates substantial performance improvements over traditional approaches and offers a promising solution to the persistent challenge of malware detection with limited training data. The main source code is publicly available at https://anonymous.4open.science/r/Mars-CDB9.
Mon 13 AprDisplayed time zone: Brasilia, Distrito Federal, Brazil change
09:45 - 10:30 | |||
09:45 5mTalk | Understanding and Mitigating Library-Related Issues in LLM-Generated Code Journal Ahead Workshop (JAWs) Yacine Majdoub University of Gabes, Rinad Hamid University of Calgary, Canada, Eya Ben Charrada University of Gabes, Ahmad Abdellatif University of Calgary, Haifa Touati IReSCoMath Research Lab, Faculty of Sciences, University Of Gabes, Tunisia | ||
09:50 5mTalk | Magnifying Inefficiency: How LLMs Amplify Performance Anti-Patterns in Mobile Development Journal Ahead Workshop (JAWs) | ||
09:55 5mTalk | BRACE: Unified Benchmarking of Accuracy and Energy for Code Language Models Journal Ahead Workshop (JAWs) Mohammadjavad Mehditabar Dalhousie University, Saurabhsingh Rajput Dalhousie University, Antonio Mastropaolo William and Mary, USA, Tushar Sharma Dalhousie University Pre-print File Attached | ||
10:00 5mTalk | Learning Model Mutations From Faults in Deep Learning Journal Ahead Workshop (JAWs) Zaheed Ahmed Institute of Computer Science, University of Göttingen, Lower Saxony, Germany, Philip Makedonski Institute of Computer Science, University of Göttingen, Lower Saxony, Germany, Jens Grabowski Media Attached | ||
10:05 5mTalk | Artificial or Just Artful? Do LLMs Bend the Rules in Programming? Journal Ahead Workshop (JAWs) Oussama Ben Sghaier Queen's University, Kévin Delcourt Université de Montréal, Houari Sahraoui DIRO, Université de Montréal | ||
10:10 5mTalk | Towards Automated User Story Quality Assessment with LLMs: An Empirical Study on Syntactic and Pragmatic QUS Criteria Journal Ahead Workshop (JAWs) Izabella Silva Federal University of Campina Grande - ISE/VIRTUS, Emanuel Dantas Filho Federal University of Campina Grande - ISE/VIRTUS, Ademar Sousa Neto VIRTUS/UFCG, Mirko Perkusich VIRTUS, Danyllo Albuquerque VIRTUS/UFCG, Kyller Costa Gorgônio Federal University of Campina Grande, Angelo Percusich Federal University of Campina Grande - ISE/VIRTUS | ||
10:15 5mTalk | MARS: Few-Shot Android Malware Detection with RAG-Enhanced LLMs Journal Ahead Workshop (JAWs) Guangquan Xu School of Cybersecurity, Tianjin University, Minhong Dong School of Cybersecurity, Tianjin University, Qi Guo Tianjin University, Hongpeng Bai School of Cybersecurity, Tianjin University, Yao Zhang Tianjin University, Ruitao Feng Southern Cross University, Wenying He Hebei University of Technology, Yude Bai Tianjin University, Ji Zhang University of Southern Queensland | ||
10:20 5mTalk | A Closer Look at the Malicious Pre-Trained Models on Hugging Face Journal Ahead Workshop (JAWs) Junwei Zhang Zhejiang University, Xing Hu Zhejiang University, Xin Xia Zhejiang University, David Lo Singapore Management University, Shanping Li Zhejiang University | ||