Write a Blog >>
LCTES 2019
Sat 22 - Fri 28 June 2019 Phoenix, Arizona, United States
co-located with PLDI 2019
Sun 23 Jun 2019 16:30 - 16:35 at 105A - Session 4: Benchmarking and In-progress Works Chair(s): Hyunok Oh

Modern GPUs are the most successful accelerators as they provide outstanding performance gain by using CUDA or OpenCL programming models. For maximum performance, programmers typically try to maximize the number of thread blocks of target programs, and GPUs also generally attempt to allocate the maximum number of thread blocks to their GPU cores. However, many recent studies have pointed out that simply allocating the maximum number of thread blocks to GPU cores does not always guarantee the best performance. Thus, identifying proper number of thread blocks per GPU core is a major challenge. Despite these studies, most existing architectural techniques cannot be directly applied to current GPU hardware. Furthermore, the optimal number of thread blocks can vary significantly depending on the target hardware and application characteristics. To solve these problems, this study proposes a just-in-time thread block number adjustment system using CUDA binary modification upon an LLVM compiler framework, referred to as the CTA Limiter, in order to dynamically maximize GPU performance on real GPUs without reprograming. The framework gradually reduces the number of concurrent thread blocks of target CUDA workloads using extra shared memory allocation, and compares the execution time with the previous version to automatically identify the optimal number of thread blocks. The results showed meaningful performance improvements, averaging at 30%, 40%, and 44%, in GTX 960, GTX 1050, and GTX 1080 Ti, respectively.

Sun 23 Jun

16:00 - 16:45: LCTES 2019 - Session 4: Benchmarking and In-progress Works at 105A
Chair(s): Hyunok OhHanyang Univ
LCTES-2019-papers16:00 - 16:15
Kyle DaruwallaUniversity of Wisconsin – Madison, Heng ZhuoUniversity of Wisconsin - Madison, Carly SchulzUniversity of Wisconsin - Madison, Mikko H. Lipasti
LCTES-2019-papers16:15 - 16:20
Greg StittUniversity of Florida, David CampbellUniversity of Florida
LCTES-2019-papers16:20 - 16:25
Andrea MaioliPolitecnico di Milano, Italy, Luca MottolaPolitecnico di Milano, Italy and RI.Se SICS, Sweden, Muhammad Hamad AlizaiLUMS, Pakistan, Junaid Haroon SiddiquiLahore University of Management Sciences
LCTES-2019-papers16:25 - 16:30
LCTES-2019-papers16:30 - 16:35
Yongseung YuHanyang University, Seokwon KangHanyang University, Yongjun ParkHanyang University
LCTES-2019-papers16:35 - 16:40
Mostafa MohammedVirginia Tech, Haipeng CaiWashington State University Pullman, Na MengVirginia Tech