SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics
UI automation is an useful technique for UI testing, bug reproduction and robotic process automation. Recording the user actions with an application assists rapid development of UI automation scripts, but existing recording techniques are intrusive, rely on OS or GUI framework accessibility support or assume specific app implementations. Reversing engineering user actions from screencasts is non-intrusive, but a key reverse-engineering step is currently missing - recognize human-understandable structured user actions ([command] [widget] [location]) from action screencasts. To fill the gap, we propose a deep learning based computer vision model which can recognize 11 commands and 11 widgets, and generate location phrases from action screencasts, through joint learning and multi-task learning. We label a large dataset with 7260 video-action pairs, which record the user interactions with Word, Zoom, Firefox, Photoshop and Window 10 Settings. Through extensive experiments, we confirm the effectiveness and generality of our model, and demonstrate the usefulness of a screencast-to-action-script tool built upon our model for bug reproduction.
Fri 19 MayDisplayed time zone: Hobart change
11:00 - 12:30 | Reverse engineeringTechnical Track / Journal-First Papers / SEIP - Software Engineering in Practice at Meeting Room 104 Chair(s): Wei Le Iowa State University | ||
11:00 15mTalk | SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics Technical Track Dehai Zhao Australian National University, Australia, Zhenchang Xing , Xin Xia Huawei, Deheng Ye Tencent AI Lab, Xiwei (Sherry) Xu CSIRO’s Data61, Liming Zhu CSIRO’s Data61 | ||
11:15 15mTalk | AidUI: Toward Automated Recognition of Dark Patterns in User Interfaces Technical Track S M Hasan Mansur George Mason University, Sabiha Salma George Mason University, Damilola Awofisayo Duke University, Kevin Moran George Mason University | ||
11:30 15mTalk | Carving UI Tests to Generate API Tests and API Specification Technical Track Rahulkrishna Yandrapally University of British Columbia, Canada, Saurabh Sinha IBM Research, Rachel Tzoref-Brill IBM Research, Ali Mesbah University of British Columbia (UBC) Pre-print | ||
11:45 15mTalk | CFG2VEC: Hierarchical Graph Neural Network for Cross-Architectural Software Reverse Engineering SEIP - Software Engineering in Practice Shih-Yuan Yu UCI, Yonatan Achamyeleh UCI, Chonghan Wang UCI, Anton Kocheturov Siemens Technology, Patrick Eisen Siemens Technology, Mohammad Al Faruque UCI | ||
12:00 15mTalk | Ex pede Herculem: Augmenting Activity Transition Graph for Apps via Graph Convolution Network Technical Track Zhe Liu Institute of Software, Chinese Academy of Sciences, Chunyang Chen Monash University, Junjie Wang Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Yuhui Su Institute of Software, Chinese Academy of Sciences, Yuekai Huang Institute of Software, Chinese Academy of Sciences, Jun Hu Institute of Software, Chinese Academy of Sciences, Qing Wang Institute of Software at Chinese Academy of Sciences; University of Chinese Academy of Sciences | ||
12:15 7mTalk | VID2XML: Automatic Extraction of a Complete XML Data from Mobile Programming Screencasts Journal-First Papers Mohammad D. Alahmadi Department of Software Engineering, College of Computer Science and Engineering, University of Jeddah, Jeddah, Saudi Arabia. |