Enabling Pipeline Parallelism in Heterogeneous Managed Runtime Environments via Batch Processing (VEE 2022 - Research Papers)

Who

Florin Blanaru, Athanasios Stratikopoulos, Juan Fumero, Christos Kotselidis

Track

VEE 2022 Research Papers

Time Zone

The program is currently displayed in (GMT+01:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+01:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 1 Mar 2022 15:30 - 15:50 at Online - Session-2: Runtime Virtualization Chair(s): Mingyu Wu

Abstract

During the last decade, managed runtime systems have been constantly evolving to become capable of exploiting underlying hardware accelerators, such as GPUs and FPGAs. Regardless of the programming language and their corresponding runtime systems, the majority of the work has been focusing on the compiler front trying to tackle the challenging task of how to enable just-in-time compilation and execution of arbitrary code segments on various accelerators. Besides this challenging task, another important aspect that defines both functional correctness and performance of managed runtime systems is that of automatic memory management. Although automatic memory management improves productivity by abstracting away memory allocation and maintenance, it hinders the capability of using specific memory regions, such as pinned memory, in order to perform data transfer times between the CPU and hardware accelerators.

In this paper, we introduce and evaluate a series of memory optimizations specifically tailored for heterogeneous managed runtime systems. In particular, we propose: (i) transparent and automatic “parallel batch processing” for overlapping data transfers and computation between the host and hardware accelerators in order to enable pipeline parallelism, and (ii) “off-heap pinned memory” in combination with parallel batch processing in order to increase the performance of data transfers without posing any on-heap overheads. These two techniques have been implemented in the context of the state-of-the-art open-source TornadoVM and their combination can lead up to 2.5x end-to-end performance speedup against sequential batch processing.

Link to Preprint

https://www.research.manchester.ac.uk/portal/en/publications/enabling-pipeline-parallelism-in-heterogeneous-managed-runtime-environments-via-batch-processing(690bb1ce-badc-45a9-89c5-05a5cfcad45f).html

DOI

https://doi.org/10.1145/3516807.3516821

Florin Blanaru

The University of Manchester

Athanasios Stratikopoulos

The University of Manchester

United Kingdom

Juan Fumero

University of Manchester, UK

United Kingdom

Christos Kotselidis

KTM Innovation / The University of Manchester

United Kingdom

Time Zone

The program is currently displayed in (GMT+01:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+01:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 1 Mar
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

15:30 - 16:30	Session-2: Runtime VirtualizationResearch Papers at Online Chair(s): Mingyu Wu Shanghai Jiao Tong University

15:30 20m Talk		Enabling Pipeline Parallelism in Heterogeneous Managed Runtime Environments via Batch Processing Research Papers Florin Blanaru The University of Manchester, Athanasios Stratikopoulos The University of Manchester, Juan Fumero University of Manchester, UK, Christos Kotselidis KTM Innovation / The University of Manchester DOI Pre-print
15:50 20m Talk		Transparent and Lightweight Object Placement for Managed Workloads atop Hybrid Memories Research Papers Zhe Li Shanghai Jiao Tong University, Mingyu Wu Shanghai Jiao Tong University
16:10 20m Talk		Capability Boehm: Challenges and Opportunities for Garbage Collection with Capability Hardware Research Papers Dejice Jacob University of Glasgow, UK, Jeremy Singer University of Glasgow Link to publication DOI Pre-print

Information for Participants

Tue 1 Mar 2022 15:30 - 16:30 at Online - Session-2: Runtime Virtualization Chair(s): Mingyu Wu

Info for session

The Zoom room for Session 2 is at https://rochester.zoom.us/j/95639573724?pwd=Q3Fscitpd3VIcnVTaEMwRTFUS2hRdz09.