Safety Monitoring of Deep Reinforcement Learning Agents (ICSE 2024 - Posters)

Who

Amirhossein Zolfagharian, Manel Abdellatif, Lionel Briand, Ramesh S

Track

ICSE 2024 Posters

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 17 Apr 2024 10:30 - 11:00 at Open Space - Posters 1

Abstract

Deep reinforcement learning algorithms (DRL) are increasingly being used in safety-critical systems. Ensuring the safety of DRL agents is a critical concern in such contexts. However, relying solely on testing is not sufficient to ensure safety as it does not offer guarantees. Building safety monitors is one solution to alleviate this challenge. This paper proposes SMARLA, a machine learning-based safety monitoring approach designed for DRL agents. For practical reasons, SMARLA is designed to be black-box (as it does not require access to the internals of the agent) and leverages state abstraction to reduce the state space and thus facilitate the learning of safety violation prediction models from agent’s states. We validated SMARLA on two well-known RL case studies. Empirical analysis reveals that SMARLA achieves accurate violation prediction with a low false positive rate, and can predict safety violations at an early stage, approximately halfway through the agent’s execution before violations occur.

Amirhossein Zolfagharian

University of Ottawa - School of Electrical Engineering & Computer Science (EECS)

Canada

Manel Abdellatif

Software and Information Technology Engineering Department, École de Technologie Supérieure

Canada

Lionel Briand

University of Ottawa, Canada; Lero centre, University of Limerick, Ireland

Canada

Ramesh S

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 17 Apr
Displayed time zone: Lisbon change

10:30 - 11:00	Posters 1Posters at Open Space

10:30 30m Poster		KareCoder: A New Knowledge-Enriched Code Generation System Posters Tao Huang Shandong Normal University, Zhihong Sun Shandong Normal University, Zhi Jin Peking University, Ge Li Peking University, Chen Lyu Shandong Normal University
10:30 30m Poster		An Empirical Study on Cross-language Clone Bugs Posters Honghao Chen Shanghai Jiao Tong University, Ye Tang Shanghai Jiao Tong University, Hao Zhong Shanghai Jiao Tong University
10:30 30m Poster		Poster: Kotlin Assimilating the Android Ecosystem - An Appraisal of Diffusion and Impact on Maintainability Posters Riccardo Coppola Politecnico di Torino, Tommaso Fulcini Politecnico di Torino, Marco Torchiano Politecnico di Torino
10:30 30m Poster		Prompt-Enhanced Software Vulnerability Detection Using ChatGPT Posters Chenyuan Zhang Xiamen University, Hao Liu Xiamen University, Jiutian Zeng Alibaba, Kejing Yang Alibaba, Yuhong Li Alibaba, Hui Li Xiamen University Pre-print
10:30 30m Poster		Applying Transformer Models for Automatic Build Errors Classification of Java-Based Open Source Projects Posters Jonathan Lee National Taiwan University, Mason Li National Taiwan University, Kuo-Hsun Hsu Department of Computer Science, National Taichung University of Education
10:30 30m Poster		A First Look at the General Data Protection Regulation (GDPR) in Open-Source Software Posters Lucas Franke Virginia Tech, Huayu Liang Virginia Tech, Aaron Brantly Virginia Tech, James C. Davis Purdue University, Chris Brown Virginia Tech
10:30 30m Poster		Interpretable Software Maintenance and Support Effort Prediction Using Machine Learning Posters Susmita Haldar Fanshawe College, Luiz Fernando Capretz Western University Media Attached
10:30 30m Poster		Endogeneity, Instruments, and Two-Stage Models Posters Lorenz Graf-Vlachy University of Stuttgart, Stefan Wagner Technical University of Munich
10:30 30m Poster		ParSE: Efficient Detection of Smart Contract Vulnerabilities via Parallel and Simplified Symbolic Execution Posters Long He Yantai University, Xiangfu Zhao Yantai University, Yichen Wang Yantai University
10:30 30m Poster		Safety Monitoring of Deep Reinforcement Learning Agents Posters Amirhossein Zolfagharian University of Ottawa - School of Electrical Engineering & Computer Science (EECS), Manel Abdellatif Software and Information Technology Engineering Department, École de Technologie Supérieure, Lionel Briand University of Ottawa, Canada; Lero centre, University of Limerick, Ireland, Ramesh S
10:30 30m Poster		An Actionable Framework for Understanding and Improving Talent Retention as a Competitive Advantage in IT Organizations Posters Luiz Alexandre Costa UNIRIO, Edson Dias Federal University of Pará, Danilo Ribeiro Zup Innovation, Awdren Fontão Federal University of Mato Grosso do Sul (UFMS), Gustavo Pinto Federal University of Pará (UFPA) and Zup Innovation, Rodrigo Santos UNIRIO - Universidade Federal do Estado do Rio de Janeiro, Alexander Serebrenik Eindhoven University of Technology
10:30 30m Poster		Obfuscation-Resilient Software Plagiarism Detection with JPlag Posters Timur Sağlam Karlsruhe Institute of Technology (KIT), Sebastian Hahner Karlsruhe Institute of Technology (KIT), Larissa Schmid Karlsruhe Institute of Technology, Erik Burger Karlsruhe Institute of Technology (KIT) DOI Pre-print
10:30 30m Poster		Micro-scale Concolic Testing Framework for Automated Test Data Generation Based on Path Coverage Posters Fangqing Liu , Han Huang South China University of Technology, Yi Xiang South China University of Technology
10:30 30m Poster		What do you assume? A Theory of Security-Related Assumptions Posters Sophie Corallo Karlsruhe Institute of Technology (KIT), Thomas Weber , Lars König Karlsruhe Institute of Technology, Kathrin Leonie Schmidt Karlsruhe Institute of Technology, Frederik Reiche Karlsruhe Institute of Technology, Anne Koziolek Karlsruhe Institute of Technology