Sat 2 AprDisplayed time zone: Eastern Time (US & Canada) change
| 09:00 - 16:00 | Science, Art, Magic: Using and Developing The Graal CompilerWorkshops and Tutorials | ||
| 09:005m Talk | Welcome Workshops and Tutorials | ||
| 09:0555m Talk | Keynote: Static Java: The GraalVM Native Image Programming Model Workshops and Tutorials | ||
| 10:0030m Talk | Faster Native Image development build times with Quick Build mode Workshops and Tutorials | ||
| 10:3030m Talk | Improving GraalVM Reflection File Generation Workshops and Tutorials | ||
| 11:0030m Break | Break Workshops and Tutorials | ||
| 11:3030m Talk | Truffle Interpreter Performance without the Holy Graal Workshops and Tutorials | ||
| 12:0030m Talk | TruffleString: highly optimized cross-language string implementation. Workshops and Tutorials | ||
| 12:3030m Talk | State of AArch64 on GraalVM Workshops and Tutorials | ||
| 13:0030m Talk | Call-Target Agnostic Keyword Arguments Workshops and Tutorials | ||
| 13:3030m Talk | Tuning autovectorization in Graal Workshops and Tutorials | ||
| 14:0030m Break | Break Workshops and Tutorials | ||
| 14:3075m Talk | Lightning Talks Workshops and Tutorials | ||
| 15:4515m Talk | Closing remarks & survey Workshops and Tutorials | ||
| 10:00 - 13:00 | DL4CompilersWorkshops and Tutorials | ||
| 10:0060m Talk | Session I -  Learning over Programs by Chris Cummins Workshops and Tutorials | ||
| 11:0060m Talk | Session II - Challenges & Research Directions by Sandya Mannarswamy Workshops and Tutorials | ||
| 12:0060m Talk | Session III - Applications of DL to Compilers by Dibyendu Das Workshops and Tutorials | ||
| 13:30 - 17:00 | Autotuning & Reinforcement Learning for Compilers with CompilerGymWorkshops and Tutorials | ||
| 13:3030m Talk | Getting Started Workshops and Tutorials | ||
| 14:0030m Talk | Running CompilerGym on your Own Programs Workshops and Tutorials | ||
| 14:3030m Talk | CompilerGym Explorer Workshops and Tutorials | ||
| 15:0030m Talk | Autotuning Workshops and Tutorials | ||
| 15:3030m Talk | Reinforcement Learning Workshops and Tutorials | ||
| 16:0060m Talk | Extensions Workshops and Tutorials | ||
Sun 3 AprDisplayed time zone: Eastern Time (US & Canada) change
| 09:00 - 13:00 | IMOP: a Self-Stabilizing Source-to-Source Compiler Framework for OpenMP CWorkshops and Tutorials | ||
| 09:0015m Talk | Introduction to IMOP Workshops and Tutorials | ||
| 09:1540m Talk | Fundamental Representations (AST, CFG, and CG) Workshops and Tutorials | ||
| 09:5530m Talk | Scopes, Symbols, Types, and Environments Workshops and Tutorials | ||
| 10:255m Talk | Break Workshops and Tutorials | ||
| 10:3040m Talk | Code Construction and Transformations Workshops and Tutorials | ||
| 11:1040m Talk | Data-flow Analyses Workshops and Tutorials | ||
| 11:5010m Talk | Break Workshops and Tutorials | ||
| 12:0030m Talk | Concurrency Representations Workshops and Tutorials | ||
| 12:3020m Talk | Self-stabilization, and Z3-integration Workshops and Tutorials | ||
| 12:5010m Talk | Discussions and Q&A Workshops and Tutorials | ||
Mon 4 AprDisplayed time zone: Eastern Time (US & Canada) change
| 08:45 - 09:00 | OpeningMain Conference | ||
| 09:00 - 10:00 | Keynote (PPoPP)Main Conference Many Real-World Challenges for Effective Programming of Heterogeneous Systems Heterogeneous Systems offer tremendous opportunities through hardware innovation, but this leaves a lot unanswered in regards to ‘how will we program them.’ SYCL is a Khronos standard to extend C++ for Heterogeneous Programming, and is instructive to review in terms of the practical problems inherent in extending programming for heterogeneous systems. James will discuss SYCL in order to expose key challenges, and discuss real unsolved problems that stand in the way of ‘standard parallelism’ solving this in C++ and many other programming languages. Speaker: James Reinders (Intel) James Reinders is an engineer at Intel focused on enabling parallel programming in a heterogeneous world. James has helped create ten technical books related to parallel programming; his latest book is about SYCL (free download: https://www.apress.com/book/9781484255735). He has had the great fortune to help make key contributions to two of the world’s fastest computers (#1 on Top500 list) as well as many other supercomputers, and software developer tools. | ||
| 10:00 - 10:20 | BreakMain Conference | ||
| 10:20 - 11:20 | |||
| 10:2015m Talk | A Compiler Framework for Optimizing Dynamic Parallelism on GPUs Main Conference Mhd Ghaith Olabi American University of Beirut, Juan Gómez Luna ETH Zurich, Onur Mutlu ETH Zurich, Wen-mei Hwu University of Illinois at Urbana-Champaign, Izzat El Hajj American University of BeirutLink to publication | ||
| 10:3515m Talk | Automatic Horizontal Fusion for GPU Kernels Main Conference Ao Li Carnegie Mellon University, Bojian Zheng University of Toronto, Gennady Pekhimenko University of Toronto / Vector Institute, Fan Long University of Toronto, CanadaLink to publication | ||
| 10:5015m Talk | DARM: Control-Flow Melding for SIMT Thread Divergence Reduction Main Conference Charitha Saumya Purdue University, Kirshanthan Sundararajah Purdue University, Milind Kulkarni Purdue UniversityLink to publication | ||
| 11:0515m Talk | Efficient Execution of OpenMP on GPUs Main Conference Joseph Huber Oak Ridge National Laboratory, Melanie Cornelius Illinois Institute of Technology, Giorgis Georgakoudis Lawrence Livermore National Laboratory, Shilei Tian Stony Brook University, JoseM Monsalve Diaz Argonne National Laboratory, Kuter Dinel Düzce University, Barbara Chapman Stony Brook University, Johannes Doerfert Argonne National LaboratoryLink to publication | ||
| 11:20 - 11:40 | BreakMain Conference | ||
| 12:25 - 12:50 | BreakMain Conference | ||
| 12:50 - 13:35 | Session #3: Domain-Specific CompilationMain Conference Chair(s): Tobias Grosser University of Edinburgh | ||
| 12:5015m Talk | GraphIt to CUDA compiler in 2021 LOC: A case for high-performance DSL implementation via staging with BuilDSL Main Conference Ajay Brahmakshatriya Massachusetts Institute of Technology, Saman Amarasinghe Massachusetts Institute of TechnologyLink to publication | ||
| 13:0515m Talk | A Compiler for Sound Floating-Point Computations using Affine Arithmetic Main ConferenceLink to publication | ||
| 13:2015m Talk | Aggregate Update Problem for Multi-Clocked Dataflow Languages Main Conference Hannes Kallwies University of Lübeck, Martin Leucker University of Lübeck, Daniel Thorma University of Lübeck, Torben Scheffel  University of Lübeck, Malte Schmitz University of LübeckLink to publication | ||
| 13:35 - 13:45 | BreakMain Conference | ||
| 13:45 - 14:45 | Business MeetingMain Conference | ||
Tue 5 AprDisplayed time zone: Eastern Time (US & Canada) change
| 09:00 - 10:00 | Keynote (CGO)Main Conference Compiler 2.0 When I was a graduate student a long time ago, I used to have intense conversations and learned a lot from my peers in other areas of computer science as the program structure, systems, and algorithms used in my compiler were very similar to and inspired by many of the work done by my peers. For example, a Natural Language Recognition System that was developed by my peers, with a single sequential program with multiple passes connected through IRs that systematically transformed an audio stream into text, was structurally similar to the SUIF compiler I was developing. In the intervening 30 years, the information revolution brought us unprecedented advances in algorithms (e.g., machine learning and solvers), systems (e.g., multicores and cloud computing), and program structure (e.g., serverless and low-code frameworks). Thus, a modern NLP system such as Apple’s Siri or Amazon’s Alexa, a thin client on an edge device interfacing to a massively-parallel, cloud-based, centrally-trained Deep Neural Network, has little resemblance to its predecessors. However, the SUIF compiler is still eerily similar to a state-of-the-art modern compiler such as LLVM or MLIR. What happened with compiler construction technology? At worst, as a community, we have been Luddites to the information revolution even though our technology has been critical to it. At best, we have been unable to transfer our research innovations (e.g., polyhedral method or program synthesis) into production compilers. In this talk I hope to inspire the compiler community to radically rethink how to build next generation compilers by giving a few possible examples of using 21st century program structures, algorithms and systems in constructing a compiler. Speaker: Saman Amarasinghe (MIT) Saman Amarasinghe is a Professor in the Department of Electrical Engineering and Computer Science at Massachusetts Institute of Technology and a member of its Computer Science and Artificial Intelligence Laboratory (CSAIL) where he leads the Commit compiler group. Under Saman’s guidance, the Commit group developed the StreamIt, PetaBricks, Halide, Simit, MILK, Cimple, TACO, GraphIt, BioStream, CoLa and Seq programming languages and compilers, DynamoRIO, Helium, Tiramisu, Codon, StreamJIT and BuildIt compiler/runtime frameworks, Superword Level Parallelism (SLP), goSLP, VeGen and SuperVectorizer for vectorization, Ithemal machine learning based performance predictor, Program Shepherding to protect programs against external attacks, the OpenTuner extendable autotuner, and the Kendo deterministic execution system. He was the co-leader of the Raw architecture project. Saman was a co-founder of Determina, Lanka Internet Services, Venti Technologies, and DataCebo Corporations. Saman received his BS in Electrical Engineering and Computer Science from Cornell University in 1988, and his MSEE and Ph.D. from Stanford University in 1990 and 1997, respectively. He is an ACM Fellow. | ||
| 10:00 - 10:20 | BreakMain Conference | ||
| 11:20 - 11:40 | BreakMain Conference | ||
| 11:40 - 12:10 | Session #5: Natural-Language TechniquesMain Conference Chair(s): Weng-Fai Wong National University of Singapore | ||
| 11:4015m Talk | M3V: Multi-Modal Multi-View Context Embedding for Repair Operator Prediction Main ConferenceLink to publication | ||
| 11:5515m Talk | Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation Main Conference Zifan Nan North Carolina State University, Xipeng Shen North Carolina State University; Facebook, Hui Guan University of Massachusetts, AmherstLink to publication | ||
| 12:10 - 12:50 | BreakMain Conference | ||
| 12:50 - 13:35 | |||
| 12:5015m Talk | Recovering Container Class Types in C++ Binaries Main Conference Xudong Wang UNSW Sydney, Xuezheng Xu UNSW Sydney, Qingan Li Wuhan University, China, Jingling Xue UNSW Sydney, Yuan Mengting Wuhan UniversityLink to publication | ||
| 13:0515m Talk | Automatic Generation of Debug Headers through BlackBox Equivalence Checking Main Conference Vaibhav Kiran Kurhe Indian Institute of Technology Delhi, Pratik Karia (Indian Institute of Technology Delhi, Shubhani Gupta Indian Institute of Technology Delhi, Abhishek Rose IIT Delhi, Sorav Bansal IIT Delhi and CompilerAI LabsLink to publication | ||
| 13:2015m Talk | Gadgets Splicing: Dynamic Binary Transformation for Precise Rewriting Main Conference Linan Tian Chinese Academy of Sciences, Yangyang Shi Chinese Academy of Sciences, Liwei Chen Chinese Academy of Sciences, Yanqi Yang Chinese Academy of Sciences, Gang Shi Chinese Academy of SciencesLink to publication | ||
Wed 6 AprDisplayed time zone: Eastern Time (US & Canada) change
| 09:00 - 10:00 | Keynote (HPCA)Main Conference Integration, Specialization and Approximation: the “ISA” of Post-Moore Servers Datacenters are growing at unprecedented speeds building a foundation for global IT services, cost-effective containerized apps and novel paradigms including microservices and serverless computing. At the same time, we are entering a new era in computing where scalability no longer comes from higher density in silicon fabrication processes. Now, more than ever server designers are in search of new avenues to bridge the gap between higher demands for scalability and the diminishing returns in server density. In this talk, I will go over the basic anatomy of system hardware and software in a modern server blade which is primarily derived from the CPU-centric desktop PC of the 80s. I will then present opportunities for a clean slate design of servers based on integration, specialization and approximation as three pillars to enable server scalability in the post-Moore era. Speaker: Babak Falsafi (EcoCloud, EPFL) Babak is a Professor and the founding director of EcoCloud at EPFL. His contributions to computer systems include the first NUMA multiprocessors built by Sun Microsystems (WildFire/WildCat), memory streaming integrated in IBM BlueGene (temporal) and ARM cores (spatial), and performance evaluation methodologies in use by AMD, HP and Google PerfKit. He has shown that memory consistency models are neither necessary nor sufficient to achieve high performance in servers. These results led to fence speculation in modern CPUs. His work on workload-optimized server processors laid the foundation for the first generation of Cavium ARM server CPUs, ThunderX. He is a recipient of an Alfred P. Sloan Research Fellowship, and a fellow of ACM and IEEE. | ||
| 10:00 - 10:20 | BreakMain Conference | ||
| 11:20 - 11:35 | AwardsMain Conference Chair(s): Fabrice Rastello Inria, France, Sebastian Hack Saarland University, Germany, Tatiana Shpeisman Google | ||
| 11:35 - 12:00 | BreakMain Conference | ||
| 12:00 - 13:00 | Session #8: IR, Encryption and CompressionMain Conference Chair(s): Michel Steuwer University of Edinburgh | ||
| 12:0015m Talk | Lambda the Ultimate SSA: Optimizing Functional Programs in SSA Main ConferenceLink to publication | ||
| 12:1515m Talk | NOELLE Offers Empowering LLVM Extensions Main Conference Angelo Matni Northwestern University, Enrico Armenio Deiana Northwestern University, Yian Su Northwestern University, Lukas Gross Northwestern University, Souradip Ghosh Northwestern University, Sotiris Apostolakis Northwestern University, Ziyang Xu Princeton University, Zujun Tan Princeton University, Ishita Chaturvedi Princeton University, Brian Homerding Northwestern University, Tommy McMichen Northwestern University, David I. August Princeton University, Simone Campanoni Northwestern UniversityLink to publication | ||
| 12:3015m Talk | HECATE: Performance-aware Scale Optimization for Homomorphic Encryption Compiler Main Conference Yongwoo Lee Yonsei University, Seonyoung Heo ETH Zurich, Seonyoung Cheon Yonsei University, Changsu Kim Seoul National University, Eunkyung Kim Samsung SDS, Dongyoon Lee Stony Brook University, Hanjun Kim Yonsei UniversityLink to publication | ||
| 12:4515m Talk | Unified Compilation for Lossless Compression and Sparse Computing Main Conference Daniel Donenfeld Massachusetts Institute of Technology, Stephen Chou Massachusetts Institute of Technology, Saman Amarasinghe Massachusetts Institute of TechnologyLink to publication | ||
| 13:00 - 13:10 | |||