Empirical Characterization of User Reports About Cloud Failures (ACSOS 2021 - Main Track)

Who

Sacheendra Talluri, Leon Overweel, Laurens Versluis, Animesh Trivedi, Alexandru Iosup

Track

ACSOS 2021 Main Track

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 29 Sep 2021 12:35 - 12:50 at AUDITORIUM 1 - Resource Management in Data Centers and Cloud Computing I Chair(s): Vana Kalogeraki, Samuel Kounev

Abstract

Cloud services are important for healthcare, banking, communication, and other purposes. Inevitably, such services fail, harming the processes and disturbing the people that depend on them. With the rapid increase in the use of cloud services, especially in 2020 during the COVID-19 period, more failures are expected to occur in cloud services. Understanding failure in cloud services is challenging, but important to help preventing them.

Much work has studied failure logs and reports provided by infrastructure operators. However, there is a paucity of information about how users perceive the failures of cloud services. In this work, we collect user-reported failures and characterize them empirically. We collect failures reported by users to the trusted aggregator Outage Report for 12 cloud services over 16 months spread across 2019 and 2020. We show evidence that user-reported failures not only capture major failures also self-reported by cloud operators, but also provide information about additional failures. We count and analyze time patterns in these reports. We further derive failures from sets of reports and characterize their duration and interarrival time. We make 9~main observations about how users perceive failure in cloud services. We find over 10x differences in request failure rates across microservice structures when using user reported traces compared to using constant a failure distribution. Overall, our study provides the first long-term characterization of user-reported cloud failures.

Sacheendra Talluri

Vrije Universiteit Amsterdam, Netherlands

Leon Overweel

Dexter Energy

Laurens Versluis

Vrije Universiteit Amsterdam

Animesh Trivedi

Vrije Universiteit Amsterdam

Alexandru Iosup

Vrije Universiteit Amsterdam

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 29 Sep
Displayed time zone: Eastern Time (US & Canada) change

11:45 - 12:50	Resource Management in Data Centers and Cloud Computing IMain Track at AUDITORIUM 1 Chair(s): Vana Kalogeraki Athens University of Economics and Business, Samuel Kounev University of Würzburg, Germany

11:45 25m Paper		FaaSRank: Learning to Schedule Functions in Serverless Platforms Main Track Hanfei Yu University of Washington, Tacoma, Athirai Irissappane University of Washington, Tacoma, Hao Wang Louisiana State University, USA, Wes Loyd University of Washington, Tacoma
12:10 25m Paper		Many Models at the Edge: Characterizing and Improving Deep Inference via Model-Level Caching Main Track Samuel Odgen Worcester Polytechnic Institute, Guin Gilman Worcester Polytechnic Institute, Robert Walls Worcester Polytechnic Institute, Tian Guo Worcester Polytechnic Institute
12:35 15m Short-paper		Empirical Characterization of User Reports About Cloud Failures Main Track Sacheendra Talluri Vrije Universiteit Amsterdam, Netherlands, Leon Overweel Dexter Energy, Laurens Versluis Vrije Universiteit Amsterdam, Animesh Trivedi Vrije Universiteit Amsterdam, Alexandru Iosup Vrije Universiteit Amsterdam