CC 2026
Sat 31 January - Sun 1 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
Sun 1 Feb 2026 09:05 - 09:25 at Coogee - Panel + Tools Chair(s): Martin Kong

The proliferation of Python DSLs for developing kernels has democratized GPU programming. While kernel development is now Python-native, performance analysis and optimization still rely on external tools and fragmented workflows.

We introduce Tachyon, a Python profiling toolkit that bridges this gap by bringing performance analysis for GPU kernels directly into the Python ecosystem. Tachyon is framework-agnostic and works seamlessly with any Python framework through simple decorators and context managers that mark code regions for analysis. The tool automatically handles performance measurement, data collection, and result aggregation across multiple configurations.

Key innovations include: (1) Python-native interface enabling direct GPU kernel analysis within existing Python workflows, (2) configuration-driven execution model that correlates measured metrics with input parameters, (3) access to thousands of core metrics and enabling custom derived and relative metric computations through parameter correlation, (4) built-in thermal management to prevent GPU throttling during batch analysis, and (5) structured customizable visualization with automatic plot generation based on parameter roles.

With Tachyon, the entire kernel development process—from implementation to iterative optimization and analysis—is now truly Python-native, providing detailed insights without requiring external profilers or disrupting workflows.

Sun 1 Feb

Displayed time zone: Hobart change

08:45 - 10:30
Panel + ToolsMain Conference at Coogee
Chair(s): Martin Kong Brookhaven National Laboratory
08:45
20m
Talk
Inside VOLT: Designing of an Open-Source GPU Compiler (Tool)
Main Conference
Shinnung Jeong Georgia Institute of Technology, Chihyo Ahn Georgia Tech, Huanzhi Pu Georgia Institute of Technology, Jisheng Zhao Georgia Institute of Technology, Hyesoon Kim Georgia Institute of Technology, Blaise Tine University of California, Los Angeles
09:05
20m
Talk
Nsight Python: A Python-First Profiling Toolkit for Seamless GPU Kernel Analysis (Tool)
Main Conference
09:30
60m
Panel
Panel: The role of compilers in the era of AI chips and programming frameworks
Main Conference
P: Ayal Zaks Mobileye, P: Albert Cohen Google DeepMind, P: Nicholas Smith Tenstorrent, P: Uday Bondhugula Indian Institute of Science