Omega-Regular Objectives in Model-Free Reinforcement Learning (TACAS 2019)

Sat 6 - Thu 11 April 2019 Prague, Czech Republic

Who

Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

Track

TACAS 2019

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 9 Apr 2019 14:00 - 14:30 at JUPITER - Machine Learning Chair(s): Bernhard Steffen

Abstract

We provide the first solution for model-free reinforcement learning of omega-regular objectives for Markov decision processes (MDPs). We present a constructive reduction from the almost-sure satisfaction of omega-regular objectives to an almost-sure reachability problem and extend this technique to learning how to control an unknown model so that the chance of satisfying the objective is maximized. A key feature of our technique is the compilation of omega-regular properties into limit-deterministic Buechi automata instead of the traditional Rabin automata; this choice sidesteps difficulties that have marred previous proposals. Our approach allows us to apply model-free, off-the-shelf reinforcement learning algorithms to compute optimal strategies from the observations of the MDP. We present an experimental evaluation of our technique on benchmark learning problems.

Link to Publication

https://link.springer.com/chapter/10.1007/978-3-030-17462-0_27

Ernst Moritz Hahn

Queen's University Belfast

United Kingdom

Mateo Perez

Sven Schewe

University of Liverpool

Fabio Somenzi

Ashutosh Trivedi

Dominik Wojtczak