Understanding Formal Reasoning Failures in LLMs as Abstract Interpreters (LMPL 2025)

Who

Jacqueline Mitchell, Brian Hyeongseok Kim, Chenyu Zhou, Chao Wang

Track

LMPL 2025

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 15 Oct 2025 17:10 - 17:30 at Orchid East - LLMs for Program Analysis and Verification II Chair(s): Zhuo Zhang

Abstract

Large language models (LLMs) are increasingly used for program verification, and yet little is known about \emph{how} they reason about program semantics during this process. In this work, we focus on abstract interpretation based-reasoning for invariant generation and introduce two novel prompting strategies that aim to elicit such reasoning from LLMs. We evaluate these strategies across several state-of-the-art LLMs on 22 programs from the SV-COMP benchmark suite, widely used in software verification, and we analyze both the soundness of the generated invariants and the key thematic patterns in the models’ reasoning errors. This work aims to highlight new research opportunities at the intersection of large language models and program verification, both for applying LLMs to verification tasks and for advancing their reasoning capabilities in this application.

Jacqueline Mitchell

University of Southern California

Brian Hyeongseok Kim

University of Southern California

Chenyu Zhou

University of Southern California

United States

Chao Wang

University of Southern California

United States

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 15 Oct
Displayed time zone: Perth change

16:00 - 17:40	LLMs for Program Analysis and Verification IILMPL at Orchid East Chair(s): Zhuo Zhang Columbia University

16:00 15m Talk		Hallucination-Resilient LLM-Driven Sound and Tunable Static Analysis LMPL Guannan Wei Tufts University, Zhuo Zhang Columbia University, Caterina Urban Inria & ENS \| PSL
16:15 20m Talk		Toward Repository-Level Program Verification with Large Language Models LMPL Si Cheng Zhong University of Toronto, Xujie Si University of Toronto DOI Pre-print
16:35 15m Talk		Preguss: It Analyzes, It Specifies, It Verifies LMPL Zhongyi Wang Zhejiang University, China, Tengjie Lin Zhejiang University, Mingshuai Chen Zhejiang University, Mingqi Yang Zhejiang University, Haokun Li Peking University, Xiao Yi The Chinese University of Hong Kong, Shengchao Qin Xidian University, Jianwei Yin Zhejiang University
16:50 20m Talk		A Case Study on the Effectiveness of LLMs in Verification with Proof Assistants LMPL Barış Bayazıt University of Toronto, Yao Li Portland State University, Xujie Si University of Toronto DOI Pre-print
17:10 20m Talk		Understanding Formal Reasoning Failures in LLMs as Abstract Interpretersremote LMPL Jacqueline Mitchell University of Southern California, Brian Hyeongseok Kim University of Southern California, Chenyu Zhou University of Southern California, Chao Wang University of Southern California