FIXME: Enhance Software Reliability with Hybrid Approaches in CloudSEIP
Fri 28 May 2021 03:45 - 04:05 at Blended Sessions Room 1 - 3.3.1. Monitoring Cloud-Based Services
With the promise of reliability in cloud, more enterprises are migrating to cloud. The process of continuous integration/deployment (CICD) in cloud connects developers who need to deliver value faster and more transparently with site reliability engineers (SREs) who need to manage applications reliably. SREs feed back development issues to developers, and developers commit fixes and trigger CICD to redeploy. The release cycle is more continuous than ever, thus the code to production is faster and more automated. To provide this higher level agility, the cloud platforms become more complex in the face of flexibility with deeper layers of virtualization. However, reliability does not come for free with all these complexities. Software engineers and SREs need to deal with wider information spectrum from virtualized layers. Therefore, providing correlated information with true positive evidences is critical to identify the root cause of issues quickly in order to reduce mean time to recover (MTTR), performance metrics for SREs. Similarity, knowledge, or statistics driven approaches have been effective, but with increasing data volume and types, an individual approach is limited to correlate semantic relations of different data sources. In this paper, we introduce FIXME to enhance software reliability with hybrid diagnosis approaches for enterprises. Our evaluation results show using hybrid diagnosis approach is about 17% better in precision. The results are helpful for both practitioners and researchers to develop hybrid diagnosis in the highly dynamic cloud environment.
Thu 27 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
15:05 - 16:05 | 3.3.1. Monitoring Cloud-Based ServicesTechnical Track / SEIP - Software Engineering in Practice at Blended Sessions Room 1 +12h Chair(s): Andrea Zisman The Open University | ||
15:05 20mPaper | Fast Outage Analysis of Large-scale Production Clouds with Service Correlation MiningTechnical Track Technical Track Yaohui Wang Fudan University, Guozheng Li Peking University, Zijian Wang Fudan University, Yu Kang Microsoft Research, Beijing, China, Yangfan Zhou Fudan University, Hongyu Zhang The University of Newcastle, Feng Gao Microsoft Azure, Jeffrey Sun Microsoft Azure, Li Yang Microsoft Azure, Pochian Lee Microsoft Azure, Zhangwei Xu Microsoft Azure, Pu Zhao Microsoft Research, Beijing, China, Bo Qiao Microsoft Research, Beijing, China, Liqun Li Microsoft Research, Beijing, China, Xu Zhang Microsoft Research, Beijing, China, Qingwei Lin Microsoft Research, Beijing, China Pre-print Media Attached | ||
15:25 20mPaper | Neural Knowledge Extraction From Cloud Service IncidentsSEIP SEIP - Software Engineering in Practice Manish Shetty Microsoft Research, India, Chetan Bansal Microsoft Research, Sumit Kumar Microsoft, Nikitha Rao Microsoft Research, Nachiappan Nagappan Microsoft Research, Thomas Zimmermann Microsoft Research Link to publication DOI Pre-print Media Attached | ||
15:45 20mPaper | FIXME: Enhance Software Reliability with Hybrid Approaches in CloudSEIP SEIP - Software Engineering in Practice Jinho Hwang IBM Research, Larisa Shwartz IBM, Qing Wang Institute of Software, Chinese Academy of Sciences, Raghav Batta IBM, Harshit Kumar IBM, Michael Nidd IBM Pre-print Media Attached |
Fri 28 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
03:05 - 04:05 | 3.3.1. Monitoring Cloud-Based ServicesTechnical Track / SEIP - Software Engineering in Practice at Blended Sessions Room 1 | ||
03:05 20mPaper | Fast Outage Analysis of Large-scale Production Clouds with Service Correlation MiningTechnical Track Technical Track Yaohui Wang Fudan University, Guozheng Li Peking University, Zijian Wang Fudan University, Yu Kang Microsoft Research, Beijing, China, Yangfan Zhou Fudan University, Hongyu Zhang The University of Newcastle, Feng Gao Microsoft Azure, Jeffrey Sun Microsoft Azure, Li Yang Microsoft Azure, Pochian Lee Microsoft Azure, Zhangwei Xu Microsoft Azure, Pu Zhao Microsoft Research, Beijing, China, Bo Qiao Microsoft Research, Beijing, China, Liqun Li Microsoft Research, Beijing, China, Xu Zhang Microsoft Research, Beijing, China, Qingwei Lin Microsoft Research, Beijing, China Pre-print Media Attached | ||
03:25 20mPaper | Neural Knowledge Extraction From Cloud Service IncidentsSEIP SEIP - Software Engineering in Practice Manish Shetty Microsoft Research, India, Chetan Bansal Microsoft Research, Sumit Kumar Microsoft, Nikitha Rao Microsoft Research, Nachiappan Nagappan Microsoft Research, Thomas Zimmermann Microsoft Research Link to publication DOI Pre-print Media Attached | ||
03:45 20mPaper | FIXME: Enhance Software Reliability with Hybrid Approaches in CloudSEIP SEIP - Software Engineering in Practice Jinho Hwang IBM Research, Larisa Shwartz IBM, Qing Wang Institute of Software, Chinese Academy of Sciences, Raghav Batta IBM, Harshit Kumar IBM, Michael Nidd IBM Pre-print Media Attached |