Write a Blog >>
VEE 2017
Sat 8 - Sun 9 April 2017 Xi'an, China
Sat 8 Apr 2017 14:30 - 15:00 at Zhu Que Room - GPUs, FPGAs, etc Chair(s): David Chisnall

Computer systems are increasingly featuring powerful parallel devices with the advent of many-core CPUs and GPUs. This offers the opportunity to solve computationally-intensive problems at a fraction of the time traditional CPUs need. However, exploiting heterogeneous hardware requires the use of low-level programming language approaches such as OpenCL, which is incredibly challenging, even for advanced programmers.

On the application side, interpreted dynamic languages are increasingly becoming popular in many domains due to their simplicity, expressiveness and flexibility. However, this creates a wide gap between the high-level abstractions offered to programmers and the low-level hardware-specific interface. Currently, programmers must rely on high performance libraries or they are forced to write parts of their application in a low-level language like OpenCL. Ideally, non-expert programmers should be able to exploit heterogeneous hardware directly from their interpreted dynamic languages.

In this paper, we present a technique to transparently and automatically offload computations from interpreted dynamic languages to heterogeneous devices. Using just-in-time compilation, we automatically generate OpenCL code at runtime which is specialized to the actual observed data types using profiling information. We demonstrate our technique using \emph{R}, which is a popular interpreted dynamic language predominately used in big data analytic. Our experimental results show the execution on a GPU yields speedups of over 150x compared to the sequential FastR implementation and the obtained performance is competitive with manually written GPU code. We also show that when taking into account start-up time, large speedups are achievable, even when the applications run for as little as a few seconds.

Sat 8 Apr

Displayed time zone: Azores change

14:00 - 15:30
GPUs, FPGAs, etcSession 2 at Zhu Que Room
Chair(s): David Chisnall University of Cambridge
14:00
30m
Talk
Catalyst: GPU-assisted rapid memory deduplication in virtualization environments
Session 2
Anshuj Garg Indian Institute of Technology, Bombay, Debadatta Mishra Indian Institute of Technology, Bombay, Purushottam Kulkarni Indian Institute of Technology, Bombay
File Attached
14:30
30m
Talk
Just-In-Time GPU Compilation for Interpreted Languages with Partial Evaluation
Session 2
Juan Fumero The University of Edinburgh, Michel Steuwer The University of Edinburgh, Lukas Stadler Oracle Labs, Austria, Christophe Dubach University of Edinburgh
Link to publication
15:00
30m
Talk
Heterogeneous Managed Runtime Systems: A Computer Vision Case Study
Session 2
Christos Kotselidis The University of Manchester, James Clarkson The University of Manchester, Andrey Rodchenko The University of Manchester, Andrew Nisbet The University of Manchester, John Mawer The University of Manchester, Mikel Luján
File Attached