Write a Blog >>
ASE 2020
Mon 21 - Fri 25 September 2020 Melbourne, Australia
Wed 23 Sep 2020 00:00 - 00:20 at Platypus - Incidents and Bug Reports Chair(s): Yepang Liu

[experience paper] Although tremendous efforts have been devoted to the quality assurance of online service systems, in reality, these systems still come across many incidents (i.e., unplanned interruptions and outages), which can decrease user satisfaction or cause economic loss. To better understand the characteristics of incidents and improve the incident management process, we perform the first large-scale empirical analysis of incidents collected from 18 real-world online service systems in a multinational technology company M. Surprisingly, we find that although a large number of incidents could occur over a short period of time, many of them actually do not matter, i.e., engineers will not fix them with a high priority after manually diagnosing their root cause. We call these incidents incidental incidents. Our qualitative and quantitative analyses show that incidental incidents are significant in terms of both number and cost. Therefore, it is important to prioritize incidents by identifying incidental incidents in advance to optimize incident management efforts. In particular, we propose an approach, called DeepIP (Deep learning based Incident Prioritization), to prioritizing incidents based on a large amount of historical incident data. More specifically, we design an attention-based CNN (Convolutional Neural Network) model to learn a prediction model to identify incidental incidents. We then prioritize all incidents by ranking the predicted probabilities of incidents being incidental. We evaluate the performance of DeepIP using real-world incident data. The experimental results show that DeepIP effectively prioritizes incidents by identifying incidental incidents and significantly outperforms all the compared approaches. For example, the AUC of DeepIP achieves 0.808, while that of the best compared approach is only 0.624 onaverage. Also, we share our experience and lessons learned from practice.

Wed 23 Sep
Times are displayed in time zone: (UTC) Coordinated Universal Time change

00:00 - 01:00: Incidents and Bug ReportsResearch Papers at Platypus
Chair(s): Yepang LiuSouthern University of Science and Technology
00:00 - 00:20
Talk
Research Papers
Junjie ChenTianjin University, China, Shu ZhangMicrosoft Research, Beijing, Xiaoting HeMicrosoft, Qingwei LinMicrosoft Research, China, Hongyu ZhangUniversity of Newcastle, Australia, Dan HaoPeking University, China, Yu KangMicrosoft Research, China, Feng GaoMicrosoft, China, Zhangwei XuMicrosoft, China, Yingnong DangMicrosoft, USA, Dongmei ZhangMicrosoft Research, China
00:20 - 00:40
Talk
Research Papers
Songqiang ChenSchool of Computer Science, Wuhan University, Xiaoyuan XieSchool of Computer Science, Wuhan University, China, Bangguo YinSchool of Computer Science, Wuhan University, Yuanxiang JiSchool of Computer Science, Wuhan University, Lin ChenNanjing University, Baowen XuState Key Laboratory for Novel Software Technology, Nanjing University
00:40 - 01:00
Talk
Research Papers
Zhe LiuLaboratory for Internet Software Technologies, Institute of Software Chinese Academy of Sciences, University of Chinese Academy of Sciences, Chunyang ChenMonash University, Australia, Junjie WangInstitute of Software, Chinese Academy of Sciences, Yuekai HuangInstitute of Software, Chinese Academy of Sciences, Jun HuInstitute of Software, Chinese Academy of Sciences, Qing WangInstitute of Software, Chinese Academy of Sciences