One-Shot Tuner for Deep Learning Compilers (CC 2022 - CC Research Papers)

Who

Jaehun Ryu, Eunhyeok Park, Hyojin Sung

Track

CC 2022 CC Research Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 6 Apr 2022 10:20 - 10:35 at CC Virtual Room - Session 3: Compilers and Machine Learning Chair(s): Ayal Zaks

Abstract

Auto-tuning DL compilers are gaining ground as an optimizing back-end for DL frameworks. While existing work can generate deep learning models that exceed the performance of hand-tuned libraries, they still suffer from prohibitively long auto-tuning time due to repeated hardware measurements in large search spaces. In this paper, we take a neural-predictor inspired approach to reduce the auto-tuning overhead and show that a performance model can be trained prior to compilation to accurately identify high-performing tensor operation codes and eliminate repeated search and hardware measurements. To generate a sample-efficient training dataset, we extend input representation to include task-specific information and guide data sampling methods to focus on learning high-performing codes. We evaluated the resulting predictor model, One-Shot Tuner, against AutoTVM and other prior work, and the results show that One-Shot Tuner speeds up compilation by 2.81 to 67.7x compared to prior work while providing comparable or improved inference time for CNN and Transformer models.

DOI

https://doi.org/10.1145/3497776.3517774

Jaehun Ryu

POSTECH

Eunhyeok Park

POSTECH

Hyojin Sung

POSTECH

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 6 Apr
Displayed time zone: Eastern Time (US & Canada) change

10:20 - 11:20	Session 3: Compilers and Machine LearningCC Research Papers at CC Virtual Room Chair(s): Ayal Zaks Intel Corporation and Technion, Israel

10:20 15m Paper		One-Shot Tuner for Deep Learning Compilers CC Research Papers Jaehun Ryu POSTECH, Eunhyeok Park POSTECH, Hyojin Sung POSTECH DOI
10:35 15m Paper		Training of Deep Learning Pipelines on Memory-Constrained GPUs via Segmented Fused-Tiled Execution CC Research Papers Yufan Xu University of Utah, Saurabh Raje , Atanas Rountev Ohio State University, Gerald Sabin RNET Technologies, Aravind Sukumaran-Rajam Washington State University, Ponnuswamy Sadayappan University of Utah DOI
10:50 15m Paper		MLIR-Based Code Generation for GPU Tensor Cores CC Research Papers Navdeep Katel Indian Institute of Science, PolyMage Labs, Vivek Khandelwal Indian Institute of Science, Uday Bondhugula Indian Institute of Science, PolyMage Labs DOI
11:05 15m Paper		Automating Reinforcement Learning Architecture Design for Code Optimization CC Research Papers HuantingWang , Zhanyong Tang Northwest University, Cheng Zhang Northwest University, Jiaqi Zhao Northwest University, Chris Cummins Facebook, Hugh Leather Facebook, Zheng Wang University of Leeds, UK DOI