A Static Analysis Framework for Data Science Notebooks
Fri 13 May 2022 12:05 - 12:10 at ICSE room 2-even hours - Program Analysis 5 Chair(s): Karim Ali
Notebooks provide an interactive environment for programmers to develop code, analyse data and inject interleaved visualisations in a single environment. Despite their flexibility, a major pitfall that data scientists encounter is unexpected behaviour caused by the unique out-of-order execution model of notebooks. As a result data scientists face various challenges ranging from notebook correctness, reproducibility and cleaning. In this paper, we propose a framework that performs static analysis on notebooks, incorporating their unique execution semantics. Compared to previous work, our framework is general in the sense that it accommodate for a wide range of analyses, useful for various notebook use cases. We have instantiated our framework on a diverse set of analyses and have evaluated them on 2211 real world notebooks. Our evaluation demonstrates that the vast majority of notebooks can be analysed in less than a second, within the time frame required for interactive notebook clients.
Thu 12 MayDisplayed time zone: Eastern Time (US & Canada) change
05:00 - 06:00 | Program Analysis 1SEIP - Software Engineering in Practice / Journal-First Papers / Technical Track / NIER - New Ideas and Emerging Results at ICSE room 4-odd hours Chair(s): Shahar Maoz Tel Aviv University, Israel | ||
05:00 5mTalk | Pluto: Exposing Vulnerabilities in Inter-Contract Scenarios Journal-First Papers Fuchen Ma Tsinghua University, Zhenyang Xu University of Waterloo, Meng Ren Tsinghua University, Zijing Yin Tsinghua University, Yuanliang Chen Tsinghua University, Yu Jiang Tsinghua University Pre-print Media Attached | ||
05:05 5mTalk | Toward the Analysis of Graph Neural Network NIER - New Ideas and Emerging Results Thanh-Dat Nguyen University of Melbourne, Le-Cong Thanh Hanoi University of Science and Technology, ThanhVu Nguyen George Mason University, Xuan-Bach D. Le Singapore Management University, Singapore, Quyet Thang Huynh Hanoi University of Science and Technology Pre-print Media Attached | ||
05:10 5mTalk | A Static Analysis Framework for Data Science Notebooks SEIP - Software Engineering in Practice Pre-print Media Attached | ||
05:15 5mTalk | Learning Probabilistic Models for Static Analysis AlarmsBest Artifact Award Technical Track DOI Pre-print Media Attached | ||
05:20 5mTalk | Characterizing and Detecting Bugs in WeChat Mini-Programs Technical Track Tao Wang , Qingxin Xu Institute of Software, Chinese Academy of Sciences, China, Xiaoning Chang Institute of Software, Chinese Academy of Sciences, Wensheng Dou Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Jinhui Xie Tencent Inc., Yuetang Deng Tencent, Jianbo Yang Tencent Inc., Jiaheng Yang Tencent Inc., Jun Wei Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Tao Huang Institute of Software Chinese Academy of Sciences Pre-print Media Attached | ||
05:25 5mTalk | Static Inference Meets Deep Learning: A Hybrid Type Inference Approach for PythonNominated for Distinguished Paper Technical Track Yun Peng The Chinese University of Hong Kong, Cuiyun Gao Harbin Institute of Technology, Zongjie Li The Hong Kong University of Science and Technology, Bowei Gao Harbin Institute of Technology, Shenzhen, David Lo Singapore Management University, Qirun Zhang Georgia Institute of Technology, USA, Michael Lyu The Chinese University of Hong Kong DOI Pre-print Media Attached |
Fri 13 MayDisplayed time zone: Eastern Time (US & Canada) change
12:00 - 13:00 | Program Analysis 5SEIP - Software Engineering in Practice / Technical Track / Journal-First Papers at ICSE room 2-even hours Chair(s): Karim Ali University of Alberta | ||
12:00 5mTalk | Guided Feature Identification and Removal for Resource-constrained Firmware Journal-First Papers Ryan Williams Northeastern University, Tongwei Ren Worcester Polytechnic Institute, Lorenzo De Carli Worcester Polytechnic Institute, Long Lu Northeastern University, n.n., Gillian Smith Worcester Polytechnic Institute Link to publication Pre-print Media Attached | ||
12:05 5mTalk | A Static Analysis Framework for Data Science Notebooks SEIP - Software Engineering in Practice Pre-print Media Attached | ||
12:10 5mTalk | PUS: A Fast and Highly Efficient Solver for Inclusion-based Pointer AnalysisDistinguished Paper Award Technical Track Peiming Liu Texas A&M University, Yanze Li University of British Columbia, Bradley Swain Texas A&M University, Jeff Huang Texas A&M University Pre-print Media Attached | ||
12:15 5mTalk | Characterizing and Detecting Bugs in WeChat Mini-Programs Technical Track Tao Wang , Qingxin Xu Institute of Software, Chinese Academy of Sciences, China, Xiaoning Chang Institute of Software, Chinese Academy of Sciences, Wensheng Dou Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Jiaxin Zhu Institute of Software at Chinese Academy of Sciences, China, Jinhui Xie Tencent Inc., Yuetang Deng Tencent, Jianbo Yang Tencent Inc., Jiaheng Yang Tencent Inc., Jun Wei Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Tao Huang Institute of Software Chinese Academy of Sciences Pre-print Media Attached | ||
12:20 5mTalk | Fast and Precise Application Code Analysis using a Partial Library Technical Track Akshay Utture University of California, Los Angeles (UCLA), Jens Palsberg University of California at Los Angeles DOI Pre-print Media Attached | ||
12:25 5mTalk | Inference and Test Generation Using Program Invariants in Chemical Reaction Networks Technical Track Michael C. Gerten Iowa State University, Alexis L. Marsh Iowa State University, James I. Lathrop Iowa State University, Myra Cohen Iowa State University, Andrew S. Miner Iowa State University, Titus H. Klinge Drake University DOI Pre-print Media Attached |