mPart: Miss-Ratio Curve Guided Partitioning in Key-Value Stores (ISMM 2018)

Who

Daniel Byrne, Nilufer Onder, Zhenlin Wang

Track

ISMM 2018

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 18 Jun 2018 16:00 - 16:30 at Discovery AB - Analyzing the Cache and Scheduling Chair(s): Michael D. Bond

Abstract

Web applications employ key-value stores to cache the data that is most commonly accessed.
The cache improves an web application's performance by serving its requests from memory, avoiding
fetching them from the backend database. Since the memory space is limited, maximizing the
memory utilization is a key to delivering the best performance possible. This has lead to
the use of multi-tenant systems, allowing applications to share cache space. In addition, application
data access patterns change over time, so the system should be adaptive in its memory allocation.

In this work, we address both multi-tenancy (where a single cache is used for multiple applications)
and dynamic workloads (changing access patterns) using a model that relates the cache size to the
application miss ratio, known as a miss ratio curve. Intuitively, the larger the cache, the less likely the
system will need to fetch the data from the database.
Our efficient, online construction of the miss ratio curve allows us
to determine a near optimal memory allocation given the available system memory, while
adapting to changing data access patterns.
We show that our model outperforms an existing
state-of-the-art sharing model, Memshare, in terms of overall cache hit ratio
and does so at a lower time cost.
We show that for a typical system, overall hit ratio is consistently 1 percentage point greater and
99.9th percentile latency is reduced by as much as 2.9% under standard web application
workloads containing millions of requests.

Daniel Byrne

Michigan Technological University, USA

Nilufer Onder

Michigan Technological University, USA

Zhenlin Wang

Michigan Technological University