Research Papers
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs Mhd Ghaith Olabi (American University of Beirut), Juan Gómez Luna (ETH Zurich), Onur Mutlu (ETH Zurich), Wen-mei Hwu (UIUC, NVIDIA), Izzat El Hajj (American University of Beirut)
Aggregate Update Problem for Multi-Clocked Dataflow Languages Hannes Kallwies (Institute for Software Engineering and Programming Languages, University of Lübeck), Martin Leucker (Institute for Software Engineering and Programming Languages, University of Lübeck), Daniel Thoma (Institute for Software Engineering and Programming Languages, University of Lübeck), Torben Scheffel (Institute for Software Engineering and Programming Languages, University of Lübeck), Malte Schmitz (Institute for Software Engineering and Programming Languages, University of Lübeck)
SRTuner: Effective Compiler Optimization Customization By Exposing Synergistic Relations Sunghyun Park (University of Michigan), Seyyed Salar Latifi Oskouei (PhD student at University of Michigan, Ann Arbor), Yongjun Park (Hanyang University), Armand Behroozi (University of Michigan), Byungsoo Jeon (Carnegie Mellon University), Scott Mahlke (University of Michigan/Nvidia Research)
PMD: Throughput Characterization for Any Architecture Nicolas Derumigny (INRIA), Fabian Gruber (Université Grenoble Alpes), Theophile Bastian (INRIA), Christophe Guillon (STMicroelectronics), Guillaume Iooss (INRIA), Louis-Noel Pouchet (Colorado State University), Fabrice Rastello (INRIA)
Lambda the Ultimate SSA! Siddharth Bhat (IIIT Hyderabad), Tobias Grosser (University of Edinburgh), Anurudh Peduri (IIIT Hyderabad, India)
Instruction Rolling for Code Size Reduction Rodrigo C. O. Rocha (The University of Edinburgh), Pavlos Petoumenos (University of Manchester), Björn Franke (University of Edinburgh), Pramod Bhatotia (TU Munich), Michael O'Boyle (University of Edinburgh)
An Open-Source MLIR-based Compiler for Fast Sum-Product Network Inference on CPUs and GPUs Lukas Sommer (Embedded Systems and Applications Group, TU Darmstadt), Cristian Axenie (Intelligent Cloud Technologies Laboratory, Huawei Munich Research Center), Andreas Koch (Embedded Systems and Applications Group, TU Darmstadt)
Sound, Precise, and Fast Abstract Interpretation with Tristate Numbers Harishankar Vishwanathan (Rutgers University), Matan Shachnai (Rutgers University), Srinivas Narayana (Rutgers University), Santosh Nagarakatte (Rutgers University)
A Compiler for Sound Floating-Point Computations using Affine Arithmetic Joao Rivera (ETH), Franz Franchetti (CMU), Markus Püschel (ETH)
Efficient Execution of OpenMP on GPUs Joseph Huber (Oak Ridge National Laboratory), Melanie Cornelius (Illinois Institute of Technology), Giorgis Georgakoudis (Lawrence Livermore National Laboratory), Shilei Tian (Stony Brook University), Jose Monslave Diaz (Argonne National Labs), Kuter Dinel (Düzce University), Barbara Chapman (Stony Brook University), Johannes Doerfert (Argonne National Labs)
M3V: Multi-Modal Multi-View Context Embedding for Repair Operator Prediction Xuezheng Xu (University of New South Wales), Xudong Wang (University of New South Wales), Jingling Xue (University of New South Wales)
Recovering Container Class Types in Binaries Xudong Wang (Univeristy of New South Wales), Xuezheng Xu (University of New South Wales), Qingan Li (Computer School, Wuhan University), Jingling Xue (University of New South Wales), YUAN Mengting (Wuhan University),
Automatic Horizontal Fusion for GPU Kernels Ao Li (Carnegie Mellon University), Bojian Zheng (University of Toronto), Gennady Pekhimenko (University of Toronto / Vector Institute), Fan Long (University of Toronto)
Improving Debug Headers Using BlackBox Equivalence Checking Vaibhav Kiran Kurhe (Indian Institute of Technology Delhi), Pratik Karia (Indian Institute of Technology Delhi), Shubhani (Indian Institute of Technology Delhi), Abhishek Rose (Indian Institute of Technology Delhi), Sorav Bansal (Indian Institute of Technology Delhi)
HECATE: Performance-aware Scale Optimization for Homomorphic Encryption Compiler Yongwoo Lee (Yonsei University), Seonyoung Heo (ETH Zurich), Seonyoung Cheon (Yonsei University), Shinnung Jeong (Yonsei University), Changsu Kim (Seoul National University), Eunkyung Kim (Samsung SDS), Dongyoon Lee (Stony Brook University), Hanjun Kim (Yonsei University)
Solving PBQP-based Register Allocation using Deep Reinforcement Learning Minsu Kim (Seoul National University), Jeong-Keun Park (Seoul National University), Soo-Mook Moon (Seoul National University)
DARM: Control-Flow Melding for SIMT Thread Divergence Reduction Charitha Saumya (Purdue University), Kirshanthan Sundararajah (Purdue University), Milind Kulkarni (Purdue University)
Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation Zifan Nan (North Carolina State University), Xipeng Shen (North Carolina State University), Hui Guan (University of Massachusetts Amherst)
Optimizing GPU Deep Learning Operators With Polyhedral Scheduling Constraint Injection Cedric Bastoul (Huawei), Zhen Zhang (Huawei Paris Research Center), Harenome Razanajato (Huawei), Javier de Juan (Huawei), Nelson Lossing (Huawei), Adilla Susungi (Huawei), Etienne Filhol (Huawei), Baptiste Jarry (Huawei)
F3M: Fast Focused Function Merging Sean Stirling (Codeplay), Rodrigo C. O. Rocha (University of Edinburgh), Hugh Leather (Facebook), Kim Hazelwood (Facebook), Michael O'Boyle (University of Edinburgh), Pavlos Petoumenos (University of Manchester)
Unified Compilation for Lossless Compression and Sparse Computing Daniel Donenfeld (MIT), Stephen Chou (MIT), Saman Amarasinghe (MIT)
Comprehensive Accelerator-Dataflow Co-Design Optimization for Convolutional Neural Networks Miheer Vaidya (University of Utah), Aravind Sukumaran-Rajam (Washington State University), Atanas Rountev (Ohio State University), Ponnuswamy Sadayappan (University of Utah)
Gadgets Splicing: Dynamic Binary Transformation for Precise Rewriting Linan Tian (University of Chinese Academy of Sciences), Yangyang Shi (University of Chinese Academy of Sciences), Liwei Chen (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences), Yanqi Yang (University of Chinese Academy of Sciences), Gang Shi (Institute of Information Engineering, Chinese Academy of Sciences, School of Cyber Security, University of Chinese Academy of Sciences)
Tools and Practical Experience Papers
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research Chris Cummins (Facebook), Bram Wasti (Facebook), Jiadong Guo (Facebook), Brandon Cui (Facebook), Jason Ansel (Facebook), Sahir Gomez (Facebook), Somya Jain (Facebook), Jia Liu (Facebook), Olivier Teytaud (Facebook), Benoit Steiner (Facebook), Yuandong Tian (Facebook), Hugh Leather (Facebook)
NOELLE Offers Empowering LLVM Extensions Angelo Matni (Northwestern University), Enrico Armenio Deiana (Northwestern University), Yian Su (Northwestern University), Lukas Gross (Northwestern University), Souradip Ghosh (Northwestern University), Sotiris Apostolakis (Princeton University), Ziyang Xu (Princeton University), Zujun Tan (Princeton University), Ishita Chaturvedi (Princeton University), Brian Homerding (Northwestern University), Tommy McMichen (Northwestern University), David I. August (Princeton University), Simone Campanoni (Northwestern University)
Distill: Domain-Specific Compilation for Cognitive Models Jan Vesely (Yale University), Raghavendra Pradyumna Pothukuchi (Yale University), Ketaki Joshi (Yale University), Samyak Gupta (Princeton University), Jonathan D. Cohen (Princeton University), Abhishek Bhattacharjee (Yale University)
GraphIt to CUDA compiler in 2021 LOC: A case for high-performance DSL implementation via staging with BuilDSL Ajay Brahmakshatriya (Massachusetts Institute of Technology), Saman Amarasinghe (MIT)