It’s about Time - Temporal Abstractions for Asynchronous GPU Tensor Computations
The rise of asynchronous execution and specialized concurrent thread groups has reshaped GPU programming, but it also introduces complex timing and coordination challenges. Developers must carefully manage data readiness, concurrency, and hardware-specific tensor units that current low-level primitives make error-prone and hardware-dependent.
We present Async Graphene, a set of high-level abstractions for asynchronous and concurrent GPU programming. It builds on three pillars: (1) Async Tensor Types, which encode temporal relationships in the type system to prevent synchronization errors; (2) Concurrency Primitives, which structure complex control flow through specialization and pipelining; and (3) Tensor-Unit Orchestration, which simplifies low-level tensor hardware management.
On critical workloads such as GEMM and FlashAttention, Async Graphene delivers performance competitive with hand-optimized frameworks while significantly reducing development complexity. By making temporal behavior explicit, it allows compilers to address asynchrony and concurrency challenges without obscuring performance-critical details.
Sat 31 JanDisplayed time zone: Hobart change
11:00 - 12:45 | |||
11:00 26mTalk | GraalMHC: ML-Based Method-Hotness Classification for Binary-Size Reduction in Optimizing Compilers Main Conference Milan Cugurovic Oracle and University of Belgrade, Aleksandar Prokopec Oracle Labs, Boris Spasojevic Oracle Labs, Zurich, Switzerland, Vojin Jovanovic Oracle Labs, Milena Vujosevic Janicic University of Belgrade and Oracle | ||
11:26 26mTalk | It’s about Time - Temporal Abstractions for Asynchronous GPU Tensor Computations Main Conference | ||
11:52 26mTalk | Optimizing Sparse Tensor Compilation for Sparse Output Main Conference Shideh Hashemian University of Edinburgh, Michael F. P. O'Boyle University of Edinburgh, Amir Shaikhha University of Edinburgh | ||
12:18 26mTalk | RIFS: Run-time Invariant Function Specialization Main Conference Saba Jamilan University of California, Santa Cruz, Snehasish Kumar Google LLC, Heiner Litz UC Santa Cruz | ||