CGO 2021
Sat 27 February - Wed 3 March 2021

Research Papers

Loop Parallelization using Dynamic Commutativity Analysis Christos Vasiladiotis (University of Edinburgh), Roberto Castañeda Lozano (University of Edinburgh), Murray Cole (University of Edinburgh), Björn Franke (University of Edinburgh)

StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems Johannes de Fine Licht (ETH Zurich), Andreas Kuster (ETH Zurich), Tiziano De Matteis (ETH Zurich), Tal Ben-Nun (ETH Zurich), Dominic Hofer (ETH Zurich), Torsten Hoefler (ETH Zurich)

MLIR: Scaling Compiler Infrastructure for Domain Specific Computation Chris Lattner (MLIR / LLVM), Mehdi Amini (Google), Uday Bondhugula (Indian Institute of Science), Albert Cohen (Google), Andy Davis (Google), Jacques Pienaar (Google), River Riddle (Google), Tatiana Shpeisman (Google), Nicolas Vasilache (Google), Oleksandr Zinenko (Google)

UNIT: Unifying Tensorized Instruction Compilation Jian Weng (UCLA), Animesh Jain (Amazon Web Services), Jie Wang (University of California, Los Angeles), Leyuan Wang (Amazon), Yida Wang (Amazon), Tony Nowatzki (UCLA)

Variable-sized Blocks for Locality-aware SpMV Naveen Namashivayam (HPE), Sanyam Mehta (HPE), Pen-Chung Yew (University of Minnesota-Twin Cities)

YaskSite – Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures Christie Louis Alappat (Friedrich Alexander University, Erlangen-Nuremberg), Johannes Seiferth (University of Bayreuth), Georg Hager (Friedrich Alexander University, Erlangen-Nuremberg), Matthias Korch (University of Bayreuth), Thomas Rauber (University of Bayreuth), Gerhard Wellein (Friedrich Alexander University, Erlangen-Nuremberg)

Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUs Guangli Li (Institute of Computing Technology, Chinese Academy of Sciences), Jingling Xue (UNSW Sydney), Lei Liu (Institute of Computing Technology, Chinese Academy of Sciences), Xueying Wang (Institute of Computing Technology, Chinese Academy of Sciences), Xiu Ma (Jilin University), Xiao Dong (Institute of Computing Technology, Chinese Academy of Sciences), Jiansong Li (Institute of Computing Technology, Chinese Academy of Sciences), Xiaobing Feng (Institute of Computing Technology, Chinese Academy of Sciences)

Thread-aware Area-efficient High-level Synthesis Compiler for Embedded Devices Changsu Kim (POSTECH), Shinnung Jeong (Yonsei University), Sungjun Cho (POSTECH), Yongwoo Lee (Yonsei University), William Song (Yonsei University), Youngsok Kim (Yonsei University), Hanjun Kim (Yonsei University)

Compiling Graph Applications for GPUs with GraphIt Ajay Brahmakshatriya (Massachusetts Institute of Technology), Yunming Zhang (Massachusetts Institute of Technology), Changwan Hong (Massahusetts Institute of Technology), Shoaib Kamil (Adobe), Julian Shun (Massahusetts Institute of Technology), Saman Amarasinghe (Massachusetts Institute of Technology)

r3d3: Optimized Query Compilation on GPUs Alexander Krolik (McGill University), Clark Verbrugge (McGill University), Laurie Hendren (McGill University)

AnghaBench: a Suite with One Million Compilable C Benchmarks for Code-Size Reduction Anderson Faustino da Silva (UEM), Bruno Conde Kind (UFMG), José Wesley de Souza Magalhães (UFMG), Jerônimo Nunes Rocha (UFMG), Breno Guimaraes (UFMG), Fernando Pereira (Federal University of Minas Gerais)

Memory-Safe Elimination of Side Channels Luigi Soares (Federal University of Minas Gerais), Fernando Pereira (Federal University of Minas Gerais)

Seamless Compiler Integration of Variable Precision Floating-Point Arithmetic Tiago Jost (Univ. Grenoble Alpes CEA, LIST, Grenoble, France), Yves Durand (Univ. Grenoble Alpes CEA, LIST, Grenoble, France), Christian Fabre (Univ. Grenoble Alpes CEA, LIST, Grenoble, France), Albert Cohen (Google, Paris, France), Frédéric Pétrot (Univ. Grenoble Alpes, CNRS, Grenoble INP, TIMA, Grenoble, France)

An Interval Compiler for Sound Floating Point Computations Joao Rivera (ETH Zurich), Franz Franchetti (Carnegie Mellon University), Markus Püschel (ETH Zurich)

Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary Translation Ziyi Zhao (Nankai University), Zhang Jiang (Nankai University), Xiaoli Gong (Nankai University), Ying Chen (Nankai University), Wenwen Wang (University of Georgia), Pen-Chung Yew (University of Minnesota at Twin Cities)

Cinnamon: A Domain-Specific Language for Binary Profiling and Monitoring Mahwish Arif (University of Cambridge), Ruoyu Zhou (University of Cambridge), Hsi-Ming Ho (Sheffield Hallam University), Timothy M. Jones (University of Cambridge)

Progressive Raising in Multi-level IR Lorenzo Chelini (TU Eindhoven), Andi Drebes (Inria and École Normale Supérieure, Paris, France), Oleksandr Zinenko (Google), Albert Cohen (Google), Henk Corporaal (TU Eindhoven), Tobias Grosser (University of Edinburgh), Nicolas Vasilache (Google)

Optimizing Image Processing Pipelines with a Domain-Extensible Compiler Thomas Koehler (University of Glasgow), Michel Steuwer (The University of Edinburgh)

Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics Instrumentation David Pankratz (University of Alberta), Tyler Nowicki (Huawei Technologies Canada), Ahmed Eltantawy (Huawei Technologies Canada), J. Nelson Amaral (University of Alberta)

Object Versioning for Flow-Sensitive Pointer Analysis Mohamad Barbar (University of Technology Sydney), Yulei Sui (University of Technology Sydney), Shiping Chen (CSIRO, Data61)

GPA: A GPU Performance Advisor Based on Instruction Sampling Keren Zhou (Rice University), Xiaozhu Meng (Rice University), Ryuichi Sai (Rice University), John Mellor-Crummey (Rice University)

HHVM Jump-Start: Boosting both Warmup and Steady-State Performance at Scale Guilherme Ottoni (Facebook), Bin Liu (Facebook)

Efficient Execution of Graph Algorithms on CPU with SIMD Extensions Ruohuang Zheng (University of Rochester), Sreepathi Pai (University of Rochester)

Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum Circuits Ji Liu (North Carolina State University), Luciano Bello (IBM T. J. Watson Research Center), Huiyang Zhou (North Carolina State University)

Fine-grained Pipeline Parallelization for Network Function Programs Seungbin Song (Yonsei University), Heelim Choi (Yonsei University), Hanjun Kim (Yonsei University)

Tools and Practical Experience Papers

An Experience with Code-size Optimization for Production iOS Mobile Applications Milind Chabbi (Uber Technologies), Jin Lin (Uber Technologies), Raj Barik (Uber Technologies)

C-for-Metal: High Performance SIMD Programming on Intel GPUs Guei-Yuan Lueh (Intel Corporation), Kaiyu Chen (Intel Corporation), Gang Chen (Intel Corporation), Joel Fuentes (Intel Corporation), Wei-Yu Chen (Intel Corporation), Fangwen Fu (Intel Corporation), Hong Jiang (Intel Corporation), Hongzheng Li (Intel Corporation), Daniel Rhee (Intel Corporation)

ELFies: Executable Region Checkpoints for Performance Analysis and Simulation Harish Patil (Intel), Alexander Isaev (Intel), Wim Heirman (Intel), Alen Sabu (National University of Singapore), Ali Hajiabadi (National University of Singapore), Trevor E. Carlson (National University of Singapore)

GoBench: a Benchmark Suite of Real-World Go Concurrency Bugs Ting Yuan (Institute of Computing Technology), Guangwei Li (Institute of Computing Technology), Jie Lu (Institute of Computing Technology), Chen Liu (Institute of Computing Technology), Lian Li (Institute of Computing Technology), Jingling Xue (University of New South Wales)

Scaling up the IFDS Algorithm with Efficient Disk-based Computing Haofeng Li (Institute of Computing Technology, Chinese Academy of Sciences), Haining Meng (Institute of Computing Technology, Chinese Academy of Sciences), Hengjie Zheng (Institute of Computing Technology, Chinese Academy of Sciences), Liqing Cao (Institute of Computing Technology, Chinese Academy of Sciences), Jie Lu (Institute of Computing Technology, Chinese Academy of Sciences), Lian Li (Institute of Computing Technology, Chinese Academy of Sciences), Lin Gao (TianqiSoft Inc.)

BuildIt: A type based multistage programming framework for code generation in C++ Ajay Brahmakshatriya (Massachusetts Institute of Technology), Saman Amarasinghe (Massachusetts Institute of Technology)