ASE 2024 (series) / Student Research Competition /
Can Large Language Models Comprehend Code Stylometry?
Tue 29 Oct 2024 13:15 - 13:30 at Bondi - SRC Presentations
Code Authorship Attribution (CAA) has several applications such as copyright disputes, plagiarism detection and criminal prosecution. Existing studies mainly focused on CAA by proposing machine learning (ML) and Deep Learning (DL) based techniques. The main limitations of ML-based techniques are (a) manual feature engineering is required to train these models and (b) they are vulnerable to adversarial attack. In this study, we initially fine-tune five Large Language Models (LLMs) for CAA and evaluate their performance.Our results show that LLMs are robust and less vulnerable compared to existing techniques in CAA task.
Tue 29 OctDisplayed time zone: Pacific Time (US & Canada) change
Tue 29 Oct
Displayed time zone: Pacific Time (US & Canada) change
13:00 - 13:45 | |||
13:00 15mTalk | Calibration of Large Language Models for Code Summarization Student Research Competition Yuvraj Virk UC Davis | ||
13:15 15mTalk | Can Large Language Models Comprehend Code Stylometry? Student Research Competition | ||
13:30 15mTalk | Semi-Automated Verification of Interior Unsafe Code Encapsulation in Real-World Rust Systems Student Research Competition Zihao Rao Fudan University |