CAIN 2024
Sun 14 - Mon 15 April 2024 Lisbon, Portugal
co-located with ICSE 2024

This year, CAIN will include a Doctoral Symposium, providing students with the opportunity to receive rich feedback on their PhD work related to the CAIN themes of Software Engineering applied to AI and Data Science. Students may submit single-author abstracts of early-stage ideas or late-stage work for review by the Doctoral Symposium Committee.

Dates
Tracks
You're viewing the program in a time zone which is different from your device's time zone change time zone

Sun 14 Apr

Displayed time zone: Lisbon change

09:00 - 10:30
Opening and KeynoteResearch and Experience Papers at Pequeno Auditório
Chair(s): Grace Lewis Carnegie Mellon Software Engineering Institute
09:00
30m
Day opening
Opening
Research and Experience Papers

09:30
60m
Keynote
Keynote by Pedro Bizarro - To have great machine learning models in production in harsh environments, first focus on the harsh environments
Research and Experience Papers

11:00 - 12:30
Architecting, Designing, Managing, and Modeling AI-Enabled SystemsIndustry Talks / Research and Experience Papers at Pequeno Auditório
Chair(s): Nicolás Cardozo Universidad de los Andes
11:00
10m
Talk
A Taxonomy of Foundation Model based Systems through the Lens of Software Architecture
Research and Experience Papers
Qinghua Lu Data61, CSIRO, Liming Zhu CSIRO’s Data61, Xiwei (Sherry) Xu Data61, CSIRO, Yue Liu CSIRO's Data61 & University of New South Wales, Zhenchang Xing CSIRO's Data61, Jon Whittle CSIRO's Data61 and Monash University
11:10
15m
Talk
Investigating the Impact of Solid Design Principles on Machine Learning Code UnderstandingDistinguished paper Award Candidate
Research and Experience Papers
Raphael Cabral Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Marcos Kalinowski Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Maria Teresa Baldassarre Department of Computer Science, University of Bari , Hugo Villamizar Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Tatiana Escovedo Pontifical Catholic University of Rio de Janeiro, Helio Côrtes Vieira Lopes PUC-Rio
Pre-print
11:25
10m
Industry talk
KnowING Intelligent Document Classification: A Deep Dive into Microservices and Efficient Models at ING
Industry Talks
A: Andrew Rutherfoord CWI; University of Groningen, A: Gert Vermeer , Andrea Capiluppi Brunel University
11:35
15m
Talk
An Exploratory Study of V-Model in Building ML-Enabled Software: A Systems Engineering PerspectiveDistinguished paper Award Candidate
Research and Experience Papers
Jie JW Wu University of British Columbia (UBC)
11:50
10m
Industry talk
Engineering Challenges in Industrial AI
Industry Talks
12:00
10m
Talk
Approach for Argumenting Safety on Basis of an Operational Design Domain
Research and Experience Papers
Gereon Weiss Fraunhofer IKS, Marc Zeller Siemens AG, Hannes Schoenhaar Siemens Corporate Technology, Christian Drabek Fraunhofer Institute for Cognitive Systems IKS, Andreas Kreutz Fraunhofer Institute for Cognitive Systems IKS
12:10
15m
Talk
The Impact of Knowledge Distillation on the Performance and Energy Consumption of NLP Models
Research and Experience Papers
Ye Yuan Vrije Universiteit Amsterdam, Jiacheng Shi Vrije Universiteit Amsterdam, Zongyao Zhang Vrije Universiteit Amsterdam, Kaiwei Chen Vrije Universiteit Amsterdam, Eloise Zhang Vrije Universiteit Amsterdam, Vincenzo Stoico Vrije Universiteit Amsterdam, Ivano Malavolta Vrije Universiteit Amsterdam
14:00 - 15:30
Data Engineering and Management for AI-Enabled SystemsResearch and Experience Papers / Industry Talks at Pequeno Auditório
Chair(s): Marc Zeller Siemens AG
14:00
15m
Talk
What About the Data? A Mapping Study on Data Engineering for AI Systems
Research and Experience Papers
Petra Heck Fontys University of Applied Sciences
Pre-print
14:15
15m
Talk
Unmasking Data Secrets: An Empirical Investigation into Data Smells and Their Impact on Data Quality
Research and Experience Papers
Gilberto Recupito University of Salerno, Raimondo Rapacciuolo University of Salerno, Dario Di Nucci University of Salerno, Fabio Palomba University of Salerno
14:30
15m
Talk
An Exploratory Study of Dataset and Model Management in Open Source Machine Learning ApplicationsDistinguished paper Award Candidate
Research and Experience Papers
Tajkia Rahman Toma University of Alberta, Cor-Paul Bezemer University of Alberta
14:45
10m
Talk
DVC in Open Source AI-development: The Action and the Reaction
Research and Experience Papers
Lorena Barreto Simedo Pacheco Concordia University, Musfiqur Rahman Concordia University, Fazle Rabbi Concordia University, Pouya Fathollahzadeh Queen’s University, Ahmad Abdellatif University of Calgary, Emad Shihab Concordia University, Tse-Hsun (Peter) Chen Concordia University, Jinqiu Yang Concordia University, Ying Zou Queen's University, Kingston, Ontario
14:55
10m
Industry talk
Structuring the world of unstructured text data – Balancing business requirements, training data availability, and model performance.
Industry Talks
15:05
10m
Industry talk
Invited: Artificial Intelligence Projects, a quest between meaningful use cases, data, and unfulfilled desires.
Industry Talks
A: Andreas Jedlitschka Fraunhofer IESE
15:15
15m
Live Q&A
Data : Q&A Session
Research and Experience Papers

16:00 - 18:00
Generative AI EngineeringIndustry Talks / Research and Experience Papers at Pequeno Auditório
Chair(s): Ipek Ozkaya Carnegie Mellon University
16:00
15m
Talk
Developer Experiences with a Contextualized AI Coding Assistant: Usability, Expectations, and Outcomes
Research and Experience Papers
Gustavo Pinto Federal University of Pará (UFPA) and Zup Innovation, Cleidson de Souza Federal University of Pará, Brazil, Thayssa Rocha Zup Innovation & UFPA, Igor Steinmacher Northern Arizona University, Alberto de Souza Zup Innovation, Edward Monteiro StackSpot
16:15
10m
Talk
Privacy and Copyright Protection in Generative AI: A Lifecycle Perspective
Research and Experience Papers
Dawen (David) Zhang CSIRO's Data61, Boming Xia CSIRO's Data61 & University of New South Wales, Yue Liu CSIRO's Data61 & University of New South Wales, Xiwei (Sherry) Xu Data61, CSIRO, Thong Hoang CSIRO's Data61, Zhenchang Xing CSIRO's Data61, Mark Staples CSIRO, Australia, Qinghua Lu Data61, CSIRO, Liming Zhu CSIRO’s Data61
16:25
10m
Industry talk
Innovating Translation: Lessons Learned from BWX Generative Language Engine
Industry Talks
Vanilson Burégio Bureau Works, A: Iverson Pereira Bureau Works, Henrique Cabral
16:35
15m
Talk
Towards a Responsible AI Metrics Catalogue: A Collection of Metrics for AI AccountabilityDistinguished paper Award Candidate
Research and Experience Papers
Boming Xia CSIRO's Data61 & University of New South Wales, Qinghua Lu Data61, CSIRO, Liming Zhu CSIRO’s Data61, Sung Une (Sunny) Lee CSIRO's Data61, Yue Liu CSIRO's Data61 & University of New South Wales, Zhenchang Xing CSIRO's Data61
Pre-print
16:50
10m
Live Q&A
GenAI : Q&A
Research and Experience Papers

17:00
60m
Panel
Industry Panel
Industry Talks

Mon 15 Apr

Displayed time zone: Lisbon change

09:00 - 10:30
Keynote and PostersPosters / Research and Experience Papers at Pequeno Auditório
Chair(s): Jan Bosch Chalmers University of Technology, Henry Muccini University of L'Aquila, Italy
09:00
3m
Talk
A Domain Specific Language for Specification of Risk-oriented Object Detection Requirements
Posters
Junji Hashimoto GREE, Inc., Nobukazu Yoshioka Waseda University
09:03
3m
Talk
AI Security Continuum: Concept and Challenges
Posters
Hironori Washizaki Waseda University, Nobukazu Yoshioka Waseda University
09:06
3m
Talk
A Roadmap for Enriching Jupyter Notebooks Documentation with Kaggle Data
Posters
Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University
09:09
3m
Talk
Automating Patch Set Generation from Code Reviews Using Large Language Models
Posters
Md Tajmilur Rahman Gannon University
09:12
3m
Talk
Data Selection Driven by Item Difficulty: On Investigating Data Efficient Practice for Hyperparameter Search
Posters
Gustavo Rodrigues dos Reis NAVER LABS Europe/LIG - UGA, Adrian Mos NAVER LABS Europe, Mario Cortes Cornax LIG - UGA, Cyril Labbé LIG - UGA
09:15
3m
Talk
Beyond Syntax: Unleashing the Power of Computational Notebooks Code Metrics in Documentation Generation
Posters
Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University
09:18
3m
Talk
Can causality accelerate experimentation in software systems?
Posters
Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Han-Bo Li Department of Computer Science and Technology, University of Cambridge, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge
09:21
3m
Talk
Custom Developer GPT for Ethical AI Solutions
Posters
Lauren Olson Vrije Universiteit Amsterdam
Pre-print
09:24
3m
Talk
Evaluation of The Generality of Multi-view Modeling Framework for ML Systems
Posters
Jati H. Husen Waseda University, Japan, Jomphon Runpakprakun Waseda University, Japan, Sun Chang Waseda University, Japan, Hironori Washizaki Waseda University, Hnin Thandar Tun Waseda University, Japan, Nobukazu Yoshioka Waseda University, Japan, Yoshiaki Fukazawa Waseda University
09:27
3m
Talk
Prompt Smells: An Omen for Undesirable Generative AI Outputs
Posters
Krishna Ronanki University Of Gothenburg, Beatriz Cabrero-Daniel University of Gothenburg, Christian Berger Chalmers University of Technology, Sweden
09:30
3m
Talk
Taxonomy of Generative AI Applications for Risk Assessment
Posters
Hiroshi Tanaka Fujitsu Limited, Tokyo, Japan, Masaru Ide Fujitsu Limited, Jun Yajima Fujitsu Limited, Sachiko Onodera Fujitsu Limited, Kazuki Munakata Fujitsu Limited, Tokyo, Japan, Nobukazu Yoshioka Waseda University, Japan
09:35
55m
Keynote
Keynote by Christian Kästner - From Models to Systems: On the Role of Software Engineering for Machine Learning
Research and Experience Papers
Christian Kästner Carnegie Mellon University
11:00 - 12:30
Doctoral Symposium and Energy-Aware AI EngineeringDoctoral Symposium / Research and Experience Papers at Pequeno Auditório
Chair(s): Justus Bogner Vrije Universiteit Amsterdam, Silverio Martínez-Fernández UPC-BarcelonaTech
11:00
6m
Talk
Software Design Decisions for Greener Machine Learning-based Systems
Doctoral Symposium
Santiago del Rey Universitat Politècnica de Catalunya (UPC)
11:06
6m
Talk
Energy-Efficient Development of ML-Enabled Systems: A Data-Centric Approach
Doctoral Symposium
11:12
6m
Talk
Optimizing Data Analytics Workflows through User-driven Experimentation
Doctoral Symposium
Keerthiga Rajenthiram Vrije Universiteit Amsterdam
11:18
6m
Talk
Component-based Approach to Software Engineering of Machine Learning-enabled Systems
Doctoral Symposium
Vladislav Indykov Chalmers | University of Gothenburg
11:24
6m
Talk
Threat Modeling of ML-intensive Systems: Research Proposal
Doctoral Symposium
Felix Viktor Jedrzejewski Blekinge Institute of Technology
11:30
6m
Talk
Continuous Quality Assurance ML Pipelines under the AI Act
Doctoral Symposium
Matthias Wagner Lund University
11:36
10m
Talk
Green Runner: A tool for efficient deep learning component selection
Research and Experience Papers
Jai Kannan Applied Artificial Intelligence Institute, Deakin University, Scott Barnett Applied Artificial Intelligence Institute, Deakin University, Anj Simmons , Taylan Selvi Applied Artificial Intelligence Institute, Deakin University, Luís Cruz Delft University of Technology
11:46
15m
Talk
Engineering Carbon Emission-aware Machine Learning Pipelines
Research and Experience Papers
Erik Johannes Husom SINTEF Digital, Sagar Sen , Arda Goknil SINTEF Digital
12:01
10m
Talk
Identifying architectural design decisions for achieving green ML serving
Research and Experience Papers
Francisco Durán Universitat Politècnica De Catalunya - Barcelona Tech, Silverio Martínez-Fernández UPC-BarcelonaTech, Matias Martinez Universitat Politècnica de Catalunya (UPC), Patricia Lago Vrije Universiteit Amsterdam
Pre-print
12:11
10m
Talk
Green AI: a Preliminary Empirical Study on Energy Consumption in DL Models Across Different Runtime Infrastructures
Research and Experience Papers
Negar Alizadeh Universiteit Utrecht, Fernando Castor University of Twente and Federal University of Pernambuco
12:21
9m
Live Q&A
Energy: Q&A Session
Research and Experience Papers

14:00 - 15:30
14:00
15m
Talk
A Combinatorial Testing Approach to Hyperparameter OptimizationDistinguished paper Award Candidate
Research and Experience Papers
Krishna Khadka The University of Texas at Arlington, Jaganmohan Chandrasekaran Virginia Tech, Jeff Yu Lei University of Texas at Arlington, Raghu Kacker National Institute of Standards and Technology, D. Richard Kuhn National Institute of Standards and Technology
14:15
15m
Talk
Mutation-based Consistency Testing for Evaluating the Code Understanding Capability of LLMs
Research and Experience Papers
Ziyu Li University of Sheffield, Donghwan Shin University of Sheffield
14:30
10m
Talk
LLMs for Test Input Generation for Semantic Applications
Research and Experience Papers
Zafaryab Rasool Applied Artificial Intelligence Institute, Deakin University, Scott Barnett Applied Artificial Intelligence Institute, Deakin University, David Willie Applied Artificial Intelligence Institute, Deakin University, Stefanus Kurniawan Deakin University, Sherwin Balugo Applied Artificial Intelligence Institute, Deakin University, Srikanth Thudumu Deakin University, Mohamed Abdelrazek Deakin University, Australia
14:40
10m
Talk
(Why) Is My Prompt Getting Worse? Rethinking Regression Testing for Evolving LLM APIs
Research and Experience Papers
MA Wanqin The Hong Kong University of Science and Technology, Chenyang Yang Carnegie Mellon University, Christian Kästner Carnegie Mellon University
14:50
10m
Talk
Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models
Research and Experience Papers
Ali Nouri Volvo cars & Chalmers University of Technology, Beatriz Cabrero-Daniel University of Gothenburg, Fredrik Torner Volvo cars, Hakan Sivencrona Zenseact AB, Christian Berger Chalmers University of Technology, Sweden
15:00
10m
Talk
ML-On-Rails: Safeguarding Machine Learning Models in Software Systems – A Case Study
Research and Experience Papers
Hala Abdelkader Applied Artificial Intelligence Institute, Deakin University, Mohamed Abdelrazek Deakin University, Australia, Scott Barnett Applied Artificial Intelligence Institute, Deakin University, Jean-Guy Schneider Monash University, Priya Rani RMIT University, Rajesh Vasa Deakin University, Australia
15:10
20m
Live Q&A
Test - Q&A Session
Research and Experience Papers

14:00 - 15:30
Doctoral Symposium -1Doctoral Symposium at Vianna da Motta
14:00
90m
Other
Doctoral Symposium - 1
Doctoral Symposium

16:00 - 18:00
System QualitiesResearch and Experience Papers / Industry Talks at Pequeno Auditório
Chair(s): Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge
16:00
10m
Talk
Modeling Resilience of Collaborative AI Systems
Research and Experience Papers
Diaeddin Rimawi Free University of Bozen-Bolzano, Antonio Liotta Free University of Bozen-Bolzano, Marco Todescato Fraunhofer Italia, Barbara Russo
16:10
10m
Talk
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Research and Experience Papers
Scott Barnett Applied Artificial Intelligence Institute, Deakin University, Stefanus Kurniawan Deakin University, Srikanth Thudumu Deakin University, Zach Brannelly Deakin University, Mohamed Abdelrazek Deakin University, Australia
16:20
15m
Talk
POLARIS: A framework to guide the development of Trustworthy AI systems
Research and Experience Papers
Maria Teresa Baldassarre Department of Computer Science, University of Bari , Domenico Gigante SER&Practices and University of Bari, Marcos Kalinowski Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Azzurra Ragone University of Bari
16:35
15m
Talk
Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory
Research and Experience Papers
A: Saeid Tizpaz-Niari University of Texas at El Paso, A: Sriram Sankaranarayanan University of Colorado, Boulder
16:50
15m
Talk
Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World
Research and Experience Papers
Lorena Poenaru-Olaru TU Delft, Natalia Karpova TU Delft, Luís Cruz Delft University of Technology, Jan S. Rellermeyer Leibniz University Hannover, Arie van Deursen Delft University of Technology
17:05
15m
Talk
Novel Contract-based Runtime Explainability Framework for End-to-End Ensemble Machine Learning Serving
Research and Experience Papers
Minh-Tri Nguyen Aalto University, Hong-Linh Truong Aalto University, Tram Truong-Huu Singapore Institute of Technology
17:20
10m
Industry talk
Trustworthy AI: Industry-Guided Tooling of the Methods
Industry Talks
Zakaria Chihani CEA, LIST, France
17:30
15m
Live Q&A
System Qualities: Q&A Session
Research and Experience Papers

17:45
15m
Day closing
Closing
Research and Experience Papers
Jan Bosch Chalmers University of Technology
16:00 - 18:00
Doctoral Symposium -2Doctoral Symposium at Vianna da Motta
16:00
2h
Other
Doctoral Symposium - 2
Doctoral Symposium

Submissions

All submissions should be accompanied by an endorsement letter from their advisor including the assessment of the current status of the research and an expected date for the completion of the dissertation.

For the early PhD category, the submissions should be 2 pages long, with one additional page permitted for references only. The submissions should clearly state:

  • the problem to be solved in the student’s research (justify why this problem is important and make clear that previous research and related work has not yet solved that problem),
  • the research hypothesis or claim,
  • the expected contributions of the research,
  • the plan for evaluating the contribution and presenting credible evidence of the results to the community.

For the late PhD category, the submissions should be 4 pages long, with one additional page permitted for references only. The submissions should include the bulleted items and:

  • a description of the results achieved so far, and
  • the planned timeline for completion

All authors should use the official “ACM Primary Article Template”, as can be obtained from the ACM Proceedings Template page. LaTeX users should use the sigconf option, as well as the review (to produce line numbers for easy reference by the reviewers) option. To that end, the following LaTeX code can be placed at the start of the LaTeX document:

\documentclass[sigconf,review]{acmart}
\acmConference[CAIN 2024]{3rd International Conference on AI Engineering — Software Engineering for AI}{April 2024}{Lisbon, Portugal}

All students with accepted submissions will receive feedback via a Zoom session in advance of CAIN. In addition, students with early-stage abstracts will be invited to present their work at CAIN via a poster (following poster formatting described above), while late-stage work will be presented through a talk in the main CAIN conference. The submissions may, at the review committee discretion, be published in the CAIN’24 proceedings.

Authors of all accepted submissions are required to register for the CAIN 2024 conference.

Questions? Use the CAIN Doctoral Symposium contact form.