Towards Understanding Fine-Grained Programming Mistakes and Fixing Patterns in Data Science
Programming is an essential activity in data science (DS). Compared with conventional programmers, DS programmers often use different environments (e.g., Jupyter Notebook, R Markdown) instead of traditional IDEs. Thus, it’s crucial to understand what kinds of mistakes they make and how they debug and fix these errors. In order to provide effective tool support to improve their productivity, previous studies have analyzed DS code from public code-sharing platforms such as GitHub and Kaggle. However, they only accounted for code changes committed to the version history, omitting many programming mistakes that are resolved before code commits. To bridge the gap, we present an in-depth analysis of the fine-grained logs of a DS competition, which includes 390 Jupyter Notebooks written by 67 participants over six weeks. In addition, we conducted semi-structured interviews with 10 DS programmers from different domains to understand the reasons behind their programming mistakes. In this work, we identified several unique programming mistakes and fix patterns that were not reported before, highlighting several future opportunities for designing new tool support for DS programming.
Mon 23 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
14:00 - 15:30 | BugsResearch Papers / Industry Papers / Ideas, Visions and Reflections at Pirsenteret 150 Chair(s): Ying Zou Queen's University, Kingston, Ontario | ||
14:00 20mTalk | Dissecting Real-World Cross-Language Bugs Research Papers DOI | ||
14:20 20mTalk | Towards Understanding Fine-Grained Programming Mistakes and Fixing Patterns in Data Science Research Papers Weihao Chen Purdue University, Jia Lin Cheoh Purdue University, Manthan Keim Purdue University, Sabine Brunswicker Purdue University, Tianyi Zhang Purdue University DOI | ||
14:40 20mTalk | Error Delayed is Not Error Handled: Understanding and Fixing Propagated Error-Handling Bugs Research Papers Haoran Liu National University of Defense Technology, Shanshan Li National University of Defense Technology, Zhouyang Jia National University of Defense Technology, Yuanliang Zhang National University of Defense Technology, Linxiao Bai National University of Defense Technology, Si Zheng National University of Defense Technology, Xiaoguang Mao National University of Defense Technology, Liao Xiangke National University of Defense Technology DOI | ||
15:00 10mTalk | “Silent Is Not Actually Silent”: An Investigation of Toxicity on Bug Report Discussion Ideas, Visions and Reflections Mia Mohammad Imran Missouri University of Science and Technology, Jaydeb Sarker University of Nebraska at Omaha Pre-print | ||
15:10 20mTalk | Behind the Hot Fix: Demystifying Hot Fixing Industrial Practices at Zühlke and Beyond Industry Papers Carol Hanna University College London, David Elliman Zuhlke Engineering Ltd, Wolfgang Emmerich Zuhlke Engineering Ltd, Federica Sarro University College London, Justyna Petke University College London |
This room is located outside Clarion Hotel
This room is located in the Pirsenteret (The Pier Center) convention center. It is just outside the hotel, on the back, towards the fjord.
You should be able to go through the emergency exit at Clarion, just on the side of the Cosmos 3 wing, which will be bring you close to Pirsenteret.
The entrance to the center is from here:
https://maps.app.goo.gl/dU3qH6kAimXGBNHe7
Once inside, go all straight and you will find signage to reach the room. The room is known as room 150 inside the center.