CC 2026
Sat 31 January - Sun 1 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
Sat 31 Jan 2026 11:26 - 11:52 at Coogee - Optimizations Chair(s): Martin Kong

The rise of asynchronous execution and specialized concurrent thread groups has reshaped GPU programming, but it also introduces complex timing and coordination challenges. Developers must carefully manage data readiness, concurrency, and hardware-specific tensor units that current low-level primitives make error-prone and hardware-dependent.

We present Async Graphene, a set of high-level abstractions for asynchronous and concurrent GPU programming. It builds on three pillars: (1) Async Tensor Types, which encode temporal relationships in the type system to prevent synchronization errors; (2) Concurrency Primitives, which structure complex control flow through specialization and pipelining; and (3) Tensor-Unit Orchestration, which simplifies low-level tensor hardware management.

On critical workloads such as GEMM and FlashAttention, Async Graphene delivers performance competitive with hand-optimized frameworks while significantly reducing development complexity. By making temporal behavior explicit, it allows compilers to address asynchrony and concurrency challenges without obscuring performance-critical details.

Sat 31 Jan

Displayed time zone: Hobart change

11:00 - 12:45
OptimizationsMain Conference at Coogee
Chair(s): Martin Kong The Ohio State University
11:00
26m
Talk
GraalMHC: ML-Based Method-Hotness Classification for Binary-Size Reduction in Optimizing Compilers
Main Conference
Milan Cugurovic Oracle and University of Belgrade, Aleksandar Prokopec Oracle Labs, Boris Spasojevic Oracle Labs, Zurich, Switzerland, Vojin Jovanovic Oracle Labs, Milena Vujosevic Janicic University of Belgrade and Oracle
11:26
26m
Talk
It’s about Time - Temporal Abstractions for Asynchronous GPU Tensor Computations
Main Conference
11:52
26m
Talk
Optimizing Sparse Tensor Compilation for Sparse Output
Main Conference
Shideh Hashemian University of Edinburgh, Michael F. P. O'Boyle University of Edinburgh, Amir Shaikhha University of Edinburgh
12:18
26m
Talk
RIFS: Run-time Invariant Function Specialization
Main Conference
Saba Jamilan University of California, Santa Cruz, Snehasish Kumar Google LLC, Heiner Litz UC Santa Cruz