Generating Fast Sparse Matrix Vector Multiplication From a High Level Generic Functional IR
Usage of high-level intermediate representations promises the generation of fast code from a high-level description, improving the productivity of developers while achieving the performance traditionally only reached with low-level programming approaches.
High-level IRs come in two flavors: 1) domain-specific IRs designed to express only for a specific application area; or 2) generic high-level IRs that can be used to generate high-performance code across many domains. Developing generic IRs is more challenging but offers the advantage of reusing a common compiler infrastructure various applications.
In this paper, we extend a generic high-level IR to enable efficient computation with sparse data structures. Crucially, we encode sparse representation using reusable dense building blocks already present in the high-level IR. We use a form of dependent types to model sparse matrices in CSR format by expressing the relationship between multiple dense arrays explicitly separately storing the length of rows, the column indices, and the non-zero values of the matrix.
We demonstrate that we achieve high-performance compared to spare low-level library code using our extended generic high-level code generator. On an Nvidia GPU, we outperform the highly tuned Nvidia cuSparse implementation of \gls{spmv} multiplication across 28 sparse matrices of varying sparsity on average by 1.7\times1.7×.
Sat 22 FebDisplayed time zone: Pacific Time (US & Canada) change
13:00 - 14:30 | Session 2 Techniques for Specific DomainsMain Conference Chair(s): Dongyoon Lee Stony Brook University | ||
13:00 22mResearch paper | Generating Fast Sparse Matrix Vector Multiplication From a High Level Generic Functional IR Main Conference Federico Pizzuti University of Edinburgh, Michel Steuwer University of Glasgow, Christophe Dubach University of Edinburgh | ||
13:22 22mResearch paper | A Study of Event Frequency Profiling with Differential Privacy Main Conference Hailong Zhang Ohio State University, Yu Hao , Sufian Latif Ohio State University, USA, Raef Bassily Ohio State University, USA, Atanas Rountev Ohio State University | ||
13:45 22mResearch paper | Improving Database Query Performance with Automatic Fusion Main Conference Hanfeng Chen McGill University, Canada, Alexander Krolik McGill University, Canada, Bettina Kemme McGill University, Canada, Clark Verbrugge McGill University, Canada, Laurie Hendren McGill University, Canada | ||
14:07 22mResearch paper | Robust Quantization of Deep Neural Networks Main Conference Youngseok Kim Hanyang University, Korea, Junyeol Lee Hanyang University, Korea, Younghoon Kim Hanyang University, Korea, Jiwon Seo Hanyang University |