Prevalence of Code Smells in Reinforcement Learning Projects (CAIN 2023 - Papers) - CAIN 2023

Mon 15 - Sat 20 May 2023 Melbourne, Australia

co-located with ICSE 2023

Who

Nicolás Cardozo, Ivana Dusparic, Christian Cabrera

Track

CAIN 2023 Papers

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

When

Mon 15 May 2023 18:05 - 18:20 at Virtual - Zoom for CAIN - Data & Model Optimization Chair(s): Justus Bogner
Sat 20 May 2023 13:46 - 13:54 at Meeting Room 105 - Realizing the Promise of AI: Challenges and Visions Chair(s): Ipek Ozkaya

Abstract

RL is being increasingly used to learn and adapt application behavior in many domains, including large-scale and safety critical systems, as for example, autonomous driving. With the advent of plug-n-play RL libraries, its applicability has further increased, enabling integration of RL algorithms by non-experts. We note, however, that the majority of such code is not developed by professional programmers, which as a consequence, may lead to poor program quality yielding bugs, suboptimal performance, maintainability, and evolution problems for RL-based projects. In this paper we begin the exploration of this hypothesis, specific to code utilizing RL, analyzing different projects found in the wild, to assess their quality from a software engineering perspective. Our study includes 24 popular RL-based Python projects, analyzed with standard software engineering metrics. Our results, aligned with similar analyses for ML code in general, show that popular and widely reused RL repositories contain many code smells (3.95% of the code base on average), significantly affecting the projects’ maintainability. The most common code smells detected are long method and long method chain, highlighting problems in the definition and interaction of agents. Detected code smells suggest problems in responsibility separation, and the appropriateness of current abstractions for the definition of RL algorithms.

Link to Preprint

https://arxiv.org/pdf/2303.10236.pdf

Nicolás Cardozo

Universidad de los Andes

Colombia

Ivana Dusparic

Trinity College Dublin, Ireland

Christian Cabrera

Department of Computer Science and Technology, Univesity of Cambridge

[CAIN'23] Prevalence of Code Smells in Reinforcement Learning Projects

Time Zone

The program is currently displayed in (GMT+10:00) Hobart.

Use conference time zone: (GMT+10:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Session Program

Mon 15 May
Displayed time zone: Hobart change

	17:15 - 18:45	Data & Model OptimizationPapers / Posters / Industrial Talks at Virtual - Zoom for CAIN Chair(s): Justus Bogner University of Stuttgart Click here to Join us over zoom Click here to watch the session recording on Youtube

	17:15 15m Short-paper		Automatically Resolving Data Source Dependency Hell in Large Scale Data Science Projects Papers Laurent Boué Microsoft, Pratap Kunireddy Microsoft, Pavle Subotic Microsoft Azure Pre-print
	17:30 15m Short-paper		Dataflow graphs as complete causal graphs Papers Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Siyuan Guo Max Planck Institute for Intelligent Systems, Bernhard Schölkopf MPI Tuebingen, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge Pre-print
	17:45 20m Long-paper		Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AIDistinguished paper Award Candidate Papers Tim Yarally Delft University of Technology, Luís Cruz Delft University of Technology, Daniel Feitosa University of Groningen, June Sallou Delft University of Technology, Arie van Deursen Delft University of Technology Pre-print
	18:05 15m Short-paper		Prevalence of Code Smells in Reinforcement Learning Projects Papers Nicolás Cardozo Universidad de los Andes, Ivana Dusparic Trinity College Dublin, Ireland, Christian Cabrera Department of Computer Science and Technology, Univesity of Cambridge Pre-print Media Attached
	18:20 20m Long-paper		Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges Papers Hans-Martin Heyn University of Gothenburg & Chalmers University of Technology, Khan Mohammad Habibullah University of Gothenburg, Eric Knauss Chalmers \| University of Gothenburg, Jennifer Horkoff Chalmers and the University of Gothenburg, Markus Borg CodeScene, Alessia Knauss Zenseact AB, Polly Jing Li Kognic AB Pre-print

Sat 20 May
Displayed time zone: Hobart change

	13:30 - 15:00	Realizing the Promise of AI: Challenges and Visions Papers at Meeting Room 105 Chair(s): Ipek Ozkaya Carnegie Mellon University

	13:30 8m Long-paper		A Meta-Summary of Challenges in Building Products with ML Components -- Collecting Experiences from 4758+ PractitionersDistinguished paper Award Candidate Papers Nadia Nahar Carnegie Mellon University, Haoran Zhang Carnegie Mellon University, USA, Grace Lewis Carnegie Mellon Software Engineering Institute, Shurui Zhou University of Toronto, Canada, Christian Kästner Carnegie Mellon University Pre-print File Attached
	13:38 8m Short-paper		Dataflow graphs as complete causal graphs Papers Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Siyuan Guo Max Planck Institute for Intelligent Systems, Bernhard Schölkopf MPI Tuebingen, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge Pre-print
	13:46 8m Short-paper		Prevalence of Code Smells in Reinforcement Learning Projects Papers Nicolás Cardozo Universidad de los Andes, Ivana Dusparic Trinity College Dublin, Ireland, Christian Cabrera Department of Computer Science and Technology, Univesity of Cambridge Pre-print Media Attached
	13:54 8m Short-paper		Towards Code Generation from BDD Test Case Specifications: A vision Papers Leon Chemnitz TU Darmstadt, David Reichenbach TU Darmstadt, Germany, Hani Aldebes TU Darmstadt, Mariam Naveed TU Darmstadt, Krishna Narasimhan TU Darmstadt, Mira Mezini TU Darmstadt Pre-print
	14:02 8m Long-paper		Towards Concrete and Connected AI Risk Assessment (C2AIRA): A Systematic Mapping Study Papers Boming Xia CSIRO's Data61 & University of New South Wales, Qinghua Lu CSIRO’s Data61, Harsha Perera CSIRO's Data61 & University of New South Wales, Liming Zhu The University of New South Wales, Zhenchang Xing , Yue Liu CSIRO's Data61 & University of New South Wales, Jon Whittle CSIRO's Data61 and Monash University Pre-print
	14:10 50m Panel		Panel Discussion - Onsite Papers

:

:

:

: