LLM-based Vulnerability Discovery through the Lens of Code Metrics (ICSE 2026 - Research Track)

ICSE 2026

Sun 12 - Sat 18 April 2026 Rio de Janeiro, Brazil

Attending
Sponsors
- ICSE 2026 Sponsors
- Sponsorships Opportunities
Program
- Submitting to ICSE2026: Q&A
Tracks
Organization
Search
Series
- Series
- ICSE 2026
- ICSE 2025
- ICSE 2024
- ICSE 2023
- ICSE 2022
- ICSE 2021
- ICSE 2020
- ICSE 2019
- * ICSE 2018 *

ICSE 2026 (series) / Research Track /

LLM-based Vulnerability Discovery through the Lens of Code Metrics

Who

Felix Weissberg, Lukas Pirch, Erik Imgrund, Jonas Möller, Thorsten Eisenhofer, Konrad Rieck

Track

ICSE 2026 Research Track

Abstract

Large language models (LLMs) excel in many tasks of software engineering, yet progress in leveraging them for vulnerability discovery has stalled in recent years. To understand this phenomenon, we investigate LLMs through the lens of classic code metrics. Surprisingly, we find that a classifier trained solely on these metrics performs on par with state-of-the-art LLMs for vulnerability discovery. A root-cause analysis reveals a strong correlation and a causal effect between LLMs and code metrics: When the value of a metric is changed, LLM predictions tend to shift by a corresponding magnitude. This dependency suggests that LLMs operate at a similarly shallow level as code metrics, limiting their ability to grasp complex patterns and fully realize their potential in vulnerability discovery.

Felix Weissberg

BIFOLD & TU Berlin

Lukas Pirch

BIFOLD & TU Berlin

Erik Imgrund

BIFOLD & TU Berlin

Jonas Möller

BIFOLD & TU Berlin

Thorsten Eisenhofer

BIFOLD & TU Berlin

Konrad Rieck

BIFOLD & TU Berlin

xMon 27 Oct 09:19