Toward Repository-Level Program Verification with Large Language Models (LMPL 2025)

Sun 12 - Sat 18 October 2025 Singapore

Who

Si Cheng Zhong, Xujie Si

Track

LMPL 2025

Time Zone

The program is currently displayed in (GMT+08:00) Perth.

Use conference time zone: (GMT+08:00) PerthSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 15 Oct 2025 16:15 - 16:35 at Orchid East - LLMs for Program Analysis and Verification II Chair(s): Zhuo Zhang

Abstract

Recent advancements in large language models (LLMs) suggest great promises in code and proof generations. However, scaling automated formal verification to real-world projects requires resolving cross-module dependencies and global contexts, which are crucial challenges overlooked by existing LLM-based methods with a special focus on targeting isolated, function-level verification tasks. To systematically explore and address the significant challenges of verifying entire software repositories, we introduce RVBench, the first verification benchmark explicitly designed for repository-level evaluation, constructed from four diverse and complex open-source Verus projects.

We further introduce RagVerus, an extensible framework that synergizes retrieval-augmented generation with context-aware prompting to automate proof synthesis for multi-module repositories. RagVerus triples proof pass rates on existing benchmarks under constrained model inference budgets, and achieves a 27% relative improvement on the more challenging RVBench benchmark, demonstrating a scalable and sample-efficient verification solution.

Link to Preprint

https://arxiv.org/pdf/2509.25197

DOI

https://doi.org/10.1145/3759425.3763382

Si Cheng Zhong

University of Toronto

Xujie Si