CAIN 2024
Sun 14 - Mon 15 April 2024 Lisbon, Portugal
co-located with ICSE 2024
Mon 15 Apr 2024 09:06 - 09:09 at Pequeno Auditório - Keynote and Posters Chair(s): Jan Bosch, Henry Muccini

With the advancement of AI and data science in recent years, the use of computational notebooks, like Jupyter notebooks, has also increased. As such, various AI-Based automated tools have been also developed to automatically document notebooks. One of the main challenges in training AI models is the lack of appropriate datasets. In this paper, we outline a roadmap for developing a valuable dataset of markdown and code cell pairs centered on functions in Jupyter notebooks. The roadmap encompasses four high-level steps: \emph{structural filtering}, \emph{structural processing}, \emph{conceptual filtering}, and \emph{conceptual processing}. Our proposed roadmap leads to providing a quality dataset for training AI models on Jupyter notebooks.

Mon 15 Apr

Displayed time zone: Lisbon change

09:00 - 10:30
Keynote and PostersPosters / Research and Experience Papers at Pequeno Auditório
Chair(s): Jan Bosch Chalmers University of Technology, Henry Muccini University of L'Aquila, Italy
09:00
3m
Talk
A Domain Specific Language for Specification of Risk-oriented Object Detection Requirements
Posters
Junji Hashimoto GREE, Inc., Nobukazu Yoshioka Waseda University
09:03
3m
Talk
AI Security Continuum: Concept and Challenges
Posters
Hironori Washizaki Waseda University, Nobukazu Yoshioka Waseda University
09:06
3m
Talk
A Roadmap for Enriching Jupyter Notebooks Documentation with Kaggle Data
Posters
Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University
09:09
3m
Talk
Automating Patch Set Generation from Code Reviews Using Large Language Models
Posters
Md Tajmilur Rahman Gannon University
09:12
3m
Talk
Data Selection Driven by Item Difficulty: On Investigating Data Efficient Practice for Hyperparameter Search
Posters
Gustavo Rodrigues dos Reis NAVER LABS Europe/LIG - UGA, Adrian Mos NAVER LABS Europe, Mario Cortes Cornax LIG - UGA, Cyril Labbé LIG - UGA
09:15
3m
Talk
Beyond Syntax: Unleashing the Power of Computational Notebooks Code Metrics in Documentation Generation
Posters
Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University
09:18
3m
Talk
Can causality accelerate experimentation in software systems?
Posters
Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Han-Bo Li Department of Computer Science and Technology, University of Cambridge, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge
09:21
3m
Talk
Custom Developer GPT for Ethical AI Solutions
Posters
Lauren Olson Vrije Universiteit Amsterdam
Pre-print
09:24
3m
Talk
Evaluation of The Generality of Multi-view Modeling Framework for ML Systems
Posters
Jati H. Husen Waseda University, Japan, Jomphon Runpakprakun Waseda University, Japan, Sun Chang Waseda University, Japan, Hironori Washizaki Waseda University, Hnin Thandar Tun Waseda University, Japan, Nobukazu Yoshioka Waseda University, Japan, Yoshiaki Fukazawa Waseda University
09:27
3m
Talk
Prompt Smells: An Omen for Undesirable Generative AI Outputs
Posters
Krishna Ronanki University Of Gothenburg, Beatriz Cabrero-Daniel University of Gothenburg, Christian Berger Chalmers University of Technology, Sweden
09:30
3m
Talk
Taxonomy of Generative AI Applications for Risk Assessment
Posters
Hiroshi Tanaka Fujitsu Limited, Tokyo, Japan, Masaru Ide Fujitsu Limited, Jun Yajima Fujitsu Limited, Sachiko Onodera Fujitsu Limited, Kazuki Munakata Fujitsu Limited, Tokyo, Japan, Nobukazu Yoshioka Waseda University, Japan
09:35
55m
Keynote
Keynote by Christian Kästner - From Models to Systems: On the Role of Software Engineering for Machine Learning
Research and Experience Papers
Christian Kästner Carnegie Mellon University