CAIN 2023
Mon 15 - Sat 20 May 2023 Melbourne, Australia
co-located with ICSE 2023

RL is being increasingly used to learn and adapt application behavior in many domains, including large-scale and safety critical systems, as for example, autonomous driving. With the advent of plug-n-play RL libraries, its applicability has further increased, enabling integration of RL algorithms by non-experts. We note, however, that the majority of such code is not developed by professional programmers, which as a consequence, may lead to poor program quality yielding bugs, suboptimal performance, maintainability, and evolution problems for RL-based projects. In this paper we begin the exploration of this hypothesis, specific to code utilizing RL, analyzing different projects found in the wild, to assess their quality from a software engineering perspective. Our study includes 24 popular RL-based Python projects, analyzed with standard software engineering metrics. Our results, aligned with similar analyses for ML code in general, show that popular and widely reused RL repositories contain many code smells (3.95% of the code base on average), significantly affecting the projects’ maintainability. The most common code smells detected are long method and long method chain, highlighting problems in the definition and interaction of agents. Detected code smells suggest problems in responsibility separation, and the appropriateness of current abstractions for the definition of RL algorithms.

Mon 15 May

Displayed time zone: Hobart change

17:15 - 18:45
Data & Model OptimizationPapers / Posters / Industrial Talks at Virtual - Zoom for CAIN
Chair(s): Justus Bogner University of Stuttgart

Click here to Join us over zoom

Click here to watch the session recording on Youtube

17:15
15m
Short-paper
Automatically Resolving Data Source Dependency Hell in Large Scale Data Science Projects
Papers
Laurent Boué Microsoft, Pratap Kunireddy Microsoft, Pavle Subotic Microsoft Azure
Pre-print
17:30
15m
Short-paper
Dataflow graphs as complete causal graphs
Papers
Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Siyuan Guo Max Planck Institute for Intelligent Systems, Bernhard Schölkopf MPI Tuebingen, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge
Pre-print
17:45
20m
Long-paper
Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AIDistinguished paper Award Candidate
Papers
Tim Yarally Delft University of Technology, Luís Cruz Delft University of Technology, Daniel Feitosa University of Groningen, June Sallou Delft University of Technology, Arie van Deursen Delft University of Technology
Pre-print
18:05
15m
Short-paper
Prevalence of Code Smells in Reinforcement Learning Projects
Papers
Nicolás Cardozo Universidad de los Andes, Ivana Dusparic Trinity College Dublin, Ireland, Christian Cabrera Department of Computer Science and Technology, Univesity of Cambridge
Pre-print Media Attached
18:20
20m
Long-paper
Automotive Perception Software Development: An Empirical Investigation into Data, Annotation, and Ecosystem Challenges
Papers
Hans-Martin Heyn University of Gothenburg & Chalmers University of Technology, Khan Mohammad Habibullah University of Gothenburg, Eric Knauss Chalmers | University of Gothenburg, Jennifer Horkoff Chalmers and the University of Gothenburg, Markus Borg CodeScene, Alessia Knauss Zenseact AB, Polly Jing Li Kognic AB
Pre-print

Sat 20 May

Displayed time zone: Hobart change

13:30 - 15:00
Realizing the Promise of AI: Challenges and Visions Papers at Meeting Room 105
Chair(s): Ipek Ozkaya Carnegie Mellon University
13:30
8m
Long-paper
A Meta-Summary of Challenges in Building Products with ML Components -- Collecting Experiences from 4758+ PractitionersDistinguished paper Award Candidate
Papers
Nadia Nahar Carnegie Mellon University, Haoran Zhang Carnegie Mellon University, USA, Grace Lewis Carnegie Mellon Software Engineering Institute, Shurui Zhou University of Toronto, Canada, Christian Kästner Carnegie Mellon University
Pre-print File Attached
13:38
8m
Short-paper
Dataflow graphs as complete causal graphs
Papers
Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Siyuan Guo Max Planck Institute for Intelligent Systems, Bernhard Schölkopf MPI Tuebingen, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge
Pre-print
13:46
8m
Short-paper
Prevalence of Code Smells in Reinforcement Learning Projects
Papers
Nicolás Cardozo Universidad de los Andes, Ivana Dusparic Trinity College Dublin, Ireland, Christian Cabrera Department of Computer Science and Technology, Univesity of Cambridge
Pre-print Media Attached
13:54
8m
Short-paper
Towards Code Generation from BDD Test Case Specifications: A vision
Papers
Leon Chemnitz TU Darmstadt, David Reichenbach TU Darmstadt, Germany, Hani Aldebes TU Darmstadt, Mariam Naveed TU Darmstadt, Krishna Narasimhan TU Darmstadt, Mira Mezini TU Darmstadt
Pre-print
14:02
8m
Long-paper
Towards Concrete and Connected AI Risk Assessment (C2AIRA): A Systematic Mapping Study
Papers
Boming Xia CSIRO's Data61 & University of New South Wales, Qinghua Lu CSIRO’s Data61, Harsha Perera CSIRO's Data61 & University of New South Wales, Liming Zhu The University of New South Wales, Zhenchang Xing , Yue Liu CSIRO's Data61 & University of New South Wales, Jon Whittle CSIRO's Data61 and Monash University
Pre-print
14:10
50m
Panel
Panel Discussion - Onsite
Papers