ICSE 2024 (series) / CAIN 2024 (series) / Posters /
A Roadmap for Enriching Jupyter Notebooks Documentation with Kaggle Data
Mon 15 Apr 2024 09:06 - 09:09 at Pequeno Auditório - Keynote and Posters Chair(s): Jan Bosch, Henry Muccini
With the advancement of AI and data science in recent years, the use of computational notebooks, like Jupyter notebooks, has also increased. As such, various AI-Based automated tools have been also developed to automatically document notebooks. One of the main challenges in training AI models is the lack of appropriate datasets. In this paper, we outline a roadmap for developing a valuable dataset of markdown and code cell pairs centered on functions in Jupyter notebooks. The roadmap encompasses four high-level steps: \emph{structural filtering}, \emph{structural processing}, \emph{conceptual filtering}, and \emph{conceptual processing}. Our proposed roadmap leads to providing a quality dataset for training AI models on Jupyter notebooks.
Mon 15 AprDisplayed time zone: Lisbon change
Mon 15 Apr
Displayed time zone: Lisbon change
09:00 - 10:30 | Keynote and PostersPosters / Research and Experience Papers at Pequeno Auditório Chair(s): Jan Bosch Chalmers University of Technology, Henry Muccini University of L'Aquila, Italy | ||
09:00 3mTalk | A Domain Specific Language for Specification of Risk-oriented Object Detection Requirements Posters | ||
09:03 3mTalk | AI Security Continuum: Concept and Challenges Posters | ||
09:06 3mTalk | A Roadmap for Enriching Jupyter Notebooks Documentation with Kaggle Data Posters Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University | ||
09:09 3mTalk | Automating Patch Set Generation from Code Reviews Using Large Language Models Posters Md Tajmilur Rahman Gannon University | ||
09:12 3mTalk | Data Selection Driven by Item Difficulty: On Investigating Data Efficient Practice for Hyperparameter Search Posters Gustavo Rodrigues dos Reis NAVER LABS Europe/LIG - UGA, Adrian Mos NAVER LABS Europe, Mario Cortes Cornax LIG - UGA, Cyril Labbé LIG - UGA | ||
09:15 3mTalk | Beyond Syntax: Unleashing the Power of Computational Notebooks Code Metrics in Documentation Generation Posters Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Ashkan Khademian Department of Computer Engineering of Sharif University of Technology, Sepehr Kianian Department of Computer Engineering of Sharif University of Technology, Alireza Asadi Department of Computer Engineering of Sharif University of Technology, Hamed Jahantigh Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University | ||
09:18 3mTalk | Can causality accelerate experimentation in software systems? Posters Andrei Paleyes Department of Computer Science and Technology, Univesity of Cambridge, Han-Bo Li Department of Computer Science and Technology, University of Cambridge, Neil D. Lawrence Department of Computer Science and Technology, Univesity of Cambridge | ||
09:21 3mTalk | Custom Developer GPT for Ethical AI Solutions Posters Lauren Olson Vrije Universiteit Amsterdam Pre-print | ||
09:24 3mTalk | Evaluation of The Generality of Multi-view Modeling Framework for ML Systems Posters Jati H. Husen Waseda University, Japan, Jomphon Runpakprakun Waseda University, Japan, Sun Chang Waseda University, Japan, Hironori Washizaki Waseda University, Hnin Thandar Tun Waseda University, Japan, Nobukazu Yoshioka Waseda University, Japan, Yoshiaki Fukazawa Waseda University | ||
09:27 3mTalk | Prompt Smells: An Omen for Undesirable Generative AI Outputs Posters Krishna Ronanki University Of Gothenburg, Beatriz Cabrero-Daniel University of Gothenburg, Christian Berger Chalmers University of Technology, Sweden | ||
09:30 3mTalk | Taxonomy of Generative AI Applications for Risk Assessment Posters Hiroshi Tanaka Fujitsu Limited, Tokyo, Japan, Masaru Ide Fujitsu Limited, Jun Yajima Fujitsu Limited, Sachiko Onodera Fujitsu Limited, Kazuki Munakata Fujitsu Limited, Tokyo, Japan, Nobukazu Yoshioka Waseda University, Japan | ||
09:35 55mKeynote | Keynote by Christian Kästner - From Models to Systems: On the Role of Software Engineering for Machine Learning Research and Experience Papers Christian Kästner Carnegie Mellon University |