1–2–3–Go! Policy Synthesis for Parameterized Markov Decision Processes via Decision-Tree Learning and Generalization (VMCAI 2025 - 26th International Conference on Verification, Model Checking, and Abstract Interpretation (VMCAI 2025))

Who

Muqsit Azeem, Debraj Chakraborty, Sudeep Kanav, Jan Kretinsky, Mohammadsadegh Mohagheghi, Stefanie Mohr, Maximilian Weininger

Track

VMCAI 2025

Time Zone

The program is currently displayed in (GMT-07:00) Mountain Time (US & Canada).

Use conference time zone: (GMT-07:00) Mountain Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 21 Jan 2025 10:00 - 10:30 at Hopscotch - Keynote Talk (Tuesday) and Learning Chair(s): Ashutosh Trivedi

Abstract

Despite the advances in probabilistic model checking, the scalability of the verification methods remains limited. In particular, the state space often becomes extremely large when instantiating parameterized Markov decision processes (MDPs) even with moderate values. Synthesizing policies for such huge MDPs is beyond the reach of available tools. We propose a learning-based approach to obtain a reasonable policy for such huge MDPs.

The idea is to generalize optimal policies obtained by model-checking small instances to larger ones using decision-tree learning. Consequently, our method bypasses the need for explicit state-space exploration of large models, providing a practical solution to the state-space explosion problem. We demonstrate the efficacy of our approach by performing extensive experimentation on the relevant models from the quantitative verification benchmark set. The experimental results indicate that our policies perform well, even when the size of the model is orders of magnitude beyond the reach of state-of-the-art analysis tools.

Muqsit Azeem

Technical University of Munich

Germany

Debraj Chakraborty

Masaryk University

India

Sudeep Kanav

LMU Munich

Germany

Jan Kretinsky

Masaryk University, Czech Republic

Germany

Mohammadsadegh Mohagheghi

Masaryk University

Czechia

Stefanie Mohr

Technical University of Munich

Germany

Maximilian Weininger

Institute of Science and Technology Austria

Austria

Time Zone

The program is currently displayed in (GMT-07:00) Mountain Time (US & Canada).

Use conference time zone: (GMT-07:00) Mountain Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 21 Jan
Displayed time zone: Mountain Time (US & Canada) change

09:00 - 10:30	Keynote Talk (Tuesday) and LearningVMCAI 2025 at Hopscotch Chair(s): Ashutosh Trivedi University of Colorado Boulder

09:00 60m Talk		Keynote Talk: Outcome Logic: a foundational framework for concurrent and probabilistic program analysis VMCAI 2025 Alexandra Silva Cornell University
10:00 30m Talk		1–2–3–Go! Policy Synthesis for Parameterized Markov Decision Processes via Decision-Tree Learning and Generalization VMCAI 2025 Muqsit Azeem Technical University of Munich, Debraj Chakraborty Masaryk University, Sudeep Kanav LMU Munich, Jan Kretinsky Masaryk University, Czech Republic, Mohammadsadegh Mohagheghi Masaryk University, Stefanie Mohr Technical University of Munich, Maximilian Weininger Institute of Science and Technology Austria