FSE 2026
Sun 5 - Thu 9 July 2026 Montreal, Canada

This program is tentative and subject to change.

Wed 8 Jul 2026 14:20 - 14:40 at MB 2.430 - Software Tests

A central challenge in software testing is deciding which parts of a program’s state to check as evidence of correct behavior to reveal regressions. These checks are embodied as test oracles, typically as assertions in test cases. State observation strategies play a decisive role in shaping how effectively regressions can be revealed. Such strategies range from exhaustive memory snapshots to selective attribute checks via getter methods and nullability checks. These strategies are deeply embedded in both research and practice: academic work has explored heuristic- and serialization-based oracles, while industry has widely adopted snapshot testing. Despite their importance, the effects of different state-capture choices remain poorly understood from a scientific perspective. In this work, we present an experimental framework for systematically analyzing these design choices along the dimensions of observation scope, extraction approach, and extraction depth. Using this framework, we conduct an empirical study across twelve real-world projects, measuring how state-capture strategies influence regression-revealing capability and the richness of oracle information. Our findings reveal that human-written assertions often under-utilize available program state, achieving well below the fault-revealing potential of systematic observation strategies. Simple design choices, such as exposing unchecked intermediate return values, carefully selecting getters, and deepening state extraction, can yield measurable improvements (avg. 35.7%) in regression detection without needing to observe an overwhelmingly large amount of program-state data. Additionally, we highlight the challenges of observability, assertion desirability, and the trade-offs of capturing richer program states. Such insights show how small design choices can yield major differences in regression detection and potentially offer concrete directions for both tool builders and practitioners.

This program is tentative and subject to change.

Wed 8 Jul

Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30
14:00
20m
Talk
IntentTester: Intent-Driven Multi-Agent Framework for Cross-Library Test Migration
Research Papers
Yi Gao Zhejiang University, Ziyuan Zhang Zhejiang University, Xing Hu Zhejiang University, Xiaohu Yang Zhejiang University, Xin Xia Zhejiang University
14:20
20m
Talk
Revealing Regressions: A Comparative Study of State-Capture Strategies in Validating Program Behavior
Research Papers
Hang Du University of California at Irvine, Vijay Krishna Palepu Microsoft, James Jones University of California at Irvine
14:40
20m
Talk
TestTailor: Generating High-Coverage Tests via Path-Proximal Tests with LLMs
Research Papers
Xiaoxuan Zhou Northeastern University, Yiling Lou University of Illinois at Urbana-Champaign, Jinhao Dong Peking University, Dan Hao Peking University
15:00
20m
Talk
Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair
Industry Papers
Sam Cheng Google, Michele Tufano Google, José Pablo Cambronero Google, USA, Renyao Wei Google, Sherry Shi Google, Grant Uy Google, Patrick Rondon Google, Franjo Ivančić Google
15:20
10m
Talk
Energy-Aware Test Prioritization for High-Performance Computing: A Multi-Objective Approach
Ideas, Visions and Reflections
Ninad Anklesaria Oregon State University, Manish Motwani Oregon State University