Nsight Python: A Python-First Profiling Toolkit for Seamless GPU Kernel Analysis (Tool)
The proliferation of Python DSLs for developing kernels has democratized GPU programming. While kernel development is now Python-native, performance analysis and optimization still rely on external tools and fragmented workflows.
We introduce Tachyon, a Python profiling toolkit that bridges this gap by bringing performance analysis for GPU kernels directly into the Python ecosystem. Tachyon is framework-agnostic and works seamlessly with any Python framework through simple decorators and context managers that mark code regions for analysis. The tool automatically handles performance measurement, data collection, and result aggregation across multiple configurations.
Key innovations include: (1) Python-native interface enabling direct GPU kernel analysis within existing Python workflows, (2) configuration-driven execution model that correlates measured metrics with input parameters, (3) access to thousands of core metrics and enabling custom derived and relative metric computations through parameter correlation, (4) built-in thermal management to prevent GPU throttling during batch analysis, and (5) structured customizable visualization with automatic plot generation based on parameter roles.
With Tachyon, the entire kernel development process—from implementation to iterative optimization and analysis—is now truly Python-native, providing detailed insights without requiring external profilers or disrupting workflows.
Sun 1 FebDisplayed time zone: Hobart change
08:45 - 10:30 | |||
08:45 20mTalk | Inside VOLT: Designing of an Open-Source GPU Compiler (Tool) Main Conference Shinnung Jeong Georgia Institute of Technology, Chihyo Ahn Georgia Tech, Huanzhi Pu Georgia Institute of Technology, Jisheng Zhao Georgia Institute of Technology, Hyesoon Kim Georgia Institute of Technology, Blaise Tine University of California, Los Angeles | ||
09:05 20mTalk | Nsight Python: A Python-First Profiling Toolkit for Seamless GPU Kernel Analysis (Tool) Main Conference | ||
09:30 60mPanel | Panel: The role of compilers in the era of AI chips and programming frameworks Main Conference P: Ayal Zaks Mobileye, P: Albert Cohen Google DeepMind, P: Nicholas Smith Tenstorrent, P: Uday Bondhugula Indian Institute of Science | ||