ASE 2023
Mon 11 - Fri 15 September 2023 Kirchberg, Luxembourg
Tue 12 Sep 2023 16:06 - 16:18 at Room C - Testing AI Systems 3 Chair(s): Mike Papadakis

Computational notebooks have become the go-to way for solving data-science problems. While they are designed to combine code and documentation, prior work shows that documentation is largely ignored by the developers because of the manual effort. Automated documentation generation can help, but existing techniques fail to capture algorithmic details and developers often end up editing the generated text to provide more explanation and sub-steps. This paper proposes a novel machine-learning pipeline, Cell2Doc, for code cell documentation in Python data science notebooks. Our approach works by identifying different logical contexts within a code cell, generating documentation for them separately, and finally combining them to arrive at the documentation for the entire code cell. Cell2Doc takes advantage of the capabilities of existing pre-trained language models and improves their efficiency for code cell documentation. We also provide a new benchmark dataset for this task, along with a data- preprocessing pipeline that can be used to create new datasets. We also investigate an appropriate input representation for this task. Our automated evaluation suggests that our best input representation improves the pre-trained model’s performance by 2.5x on average. Further, Cell2Doc achieves 1.33x improvement during human evaluation in terms of correctness, informativeness, and readability against the corresponding standalone pre-trained model.

Tue 12 Sep

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

15:30 - 17:00
Testing AI Systems 3Journal-first Papers / Research Papers / Industry Showcase (Papers) / NIER Track at Room C
Chair(s): Mike Papadakis University of Luxembourg, Luxembourg
15:30
12m
Research paper
Causality-Aided Trade-off Analysis for Machine Learning Fairness
Research Papers
Zhenlan Ji The Hong Kong University of Science and Technology, Pingchuan Ma HKUST, Shuai Wang Hong Kong University of Science and Technology, Yanhui Li Nanjing University
Pre-print
15:42
12m
Talk
Towards Self-Adaptive Machine Learning-Enabled Systems Through QoS-Aware Model Switching
NIER Track
Shubham Kulkarni IIIT Hyderabad, Arya Marda IIIT Hyderabad, Karthik Vaidhyanathan IIIT Hyderabad
Pre-print
15:54
12m
Talk
Challenges of Accurate and Efficient AutoML
Industry Showcase (Papers)
Swarnava Dey TCS Research, Avik Ghose TCS Research, Soumik Das Tata Consultancy Services Ltd.
File Attached
16:06
12m
Talk
Cell2Doc: ML Pipeline for Generating Documentation in Computational Notebooks
Research Papers
Tamal Mondal IIT Hyderabad, Scott Barnett Deakin University, Akash Lal Microsoft Research, Jyothi Vedurada IIT Hyderabad
Pre-print Media Attached
16:18
12m
Talk
Evaluating Pre-Trained Models for User Feedback Analysis in Software Engineering: A Study on Classification of App-Reviews
Journal-first Papers
Mohammad Abdul Hadi University of British Columbia, Fatemeh Hendijani Fard University of British Columbia
File Attached
16:30
12m
Talk
Evolve the Model universe of a System UniverseRecorded talk
NIER Track
Tao Yue Beihang University, Shaukat Ali Simula Research Laboratory and Oslo Metropolitan University
Media Attached