PPoPP 2021 - Main Conference

PPoPP is the premier forum for leading work on all aspects of parallel programming, including theoretical foundations, techniques, languages, compilers, runtime systems, tools, and practical experience. In the context of the symposium, “parallel programming” encompasses work on concurrent and parallel systems (multicore, multi-threaded, heterogeneous, clustered, and distributed systems; grids; datacenters; clouds; and large scale machines). Given the rise of parallel architectures in the consumer market (desktops, laptops, and mobile devices) and data centers, PPoPP is particularly interested in work that addresses new parallel workloads and issues that arise out of extreme-scale applications or cloud platforms, as well as techniques and tools that improve the productivity of parallel programming or work towards improved synergy with such emerging architectures.

Time Zone

The program is currently displayed in (GMT-05:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-05:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

You're viewing the program in a time zone which is different from your device's time zone change time zone

Mon 1 Mar
Displayed time zone: Eastern Time (US & Canada) change

	08:45 - 09:00	OpeningMain Conference

09:00 - 10:00	Keynote 1Main Conference Chair(s): Erez Petrank Technion

09:00 60m Keynote		Atomicity without Trust Main Conference Maurice Herlihy Brown University

10:00 - 11:00	Session 1. ConcurrencyMain Conference Chair(s): Konstantinos (Kostis) Sagonas Uppsala University and National Technical University of Athens

10:00 15m Talk		Efficient Algorithms for Persistent Transactional Memory Main Conference Pedro Ramalhete Cisco Systems, Andreia Correia University of Neuchatel, Switzerland, Pascal Felber University of Neuchâtel Link to publication
10:15 15m Talk		Investigating the Semantics of Futures in Transactional Memory Systems Main Conference Jingna Zeng KTH Royal Institute of Technology, Sweden / IST, ULisboa, Shady Issa INESC-ID, Paolo Romano INESC-ID / IST, ULisboa, Luis Rodrigues INESC-ID, IST, ULisboa, Seif Haridi KTH Royal Institute of Technology, Sweden Link to publication
10:30 15m Talk		Constant-Time Snapshots with Applications to Concurrent Data Structures Main Conference Yuanhao Wei Carnegie Mellon University, Naama Ben-David VMware Research, Guy E. Blelloch Carnegie Mellon University, Panagiota Fatourou FORTH ICS and University of Crete, Greece, Eric Ruppert York University, Canada, Yihan Sun University of California, Riverside Link to publication
10:45 15m Talk		Reasoning About Recursive Tree Traversals Main Conference Yanjun Wang Purdue University, Jinwei Liu Beijing Jiaotong University, Dalin Zhang Beijing Jiaotong University, Xiaokang Qiu Purdue University Link to publication

	11:00 - 11:10	BreakMain Conference

11:10 - 12:10	Session 2. Compilers, Analysis, SynthesisMain Conference Chair(s): Milind Chabbi Uber Technologies

11:10 15m Talk		Synthesizing Optimal Collective Algorithms Main Conference Zixian Cai Australian National University, Zhengyang Liu University of Utah, Saeed Maleki Microsoft Research, Madan Musuvathi Microsoft Research, Todd Mytkowicz Microsoft Research, Jacob Nelson Microsoft Research, Olli Saarikivi Microsoft Research, Redmond Link to publication
11:25 15m Talk		Parallel Binary Code Analysis Main Conference Xiaozhu Meng Rice University, Jonathon Anderson Rice University, John Mellor-Crummey Rice University, Mark W. Krentel Rice University, Barton P. Miller University of Wisconsin - Madison, Srđan Milaković Rice University Link to publication
11:40 15m Talk		Compiler Support for Near Data Computing Main Conference Mahmut Taylan Kandemir Penn State University, USA, Jihyun Ryoo Penn State University, USA, Xulong Tang University of Pittsburgh, USA, Mustafa Karakoy TUBITAK-BILGEM, Turkey Link to publication
11:55 15m Talk		Scaling Implicit Parallelism via Dynamic Control Replication Main Conference Michael Bauer NVIDIA, Wonchan Lee NVIDIA, Elliott Slaughter SLAC National Accelerator Laboratory, Zhihao Jia Carnegie Mellon University, Mario Di Renzo Sapienza University of Rome, Manolis Papadakis NVIDIA, Galen Shipman Los Alamos National Laboratory, Patrick McCormick Los Alamos National Laboratory, Michael Garland NVIDIA, Alex Aiken Stanford Univeristy Link to publication

	12:10 - 12:30	BreakMain Conference

12:30 - 13:30	Session 3. GraphsMain Conference Chair(s): Ang Li Pacific Northwest National Laboratory

12:30 15m Talk		Understanding and Bridging the Gaps in Current GNN Performance Optimizations Main Conference Kezhao Huang Tsinghua University, Jidong Zhai Tsinghua University, Zhen Zheng Alibaba Group, Youngmin Yi University of Seoul, Xipeng Shen North Carolina State University Link to publication
12:45 15m Talk		A Fast Work-Efficient SSSP Algorithm for GPUs Main Conference Kai Wang University of Texas at Austin, Donald Fussell University of Texas at Austin, Calvin Lin University of Texas at Austin Link to publication
13:00 15m Talk		ShadowVM: Accelerating Data Plane for Data Analytics with Bare Metal CPUs and GPUs Main Conference Zhifang Li East China Normal University, Mingcong Han East China Normal University, Shangwei Wu East China Normal University, Chuliang Weng East China Normal University Link to publication
13:15 15m Talk		BiPart: A Parallel and Deterministic Hypergraph Partitioner Main Conference Sepideh Maleki The University of Texas at Austin, Udit Agarwal UT Austin, Martin Burtscher Texas State University, Keshav Pingali The University of Texas at Austin Link to publication

	13:30 - 14:30	Business MeetingMain Conference

Tue 2 Mar
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:00	Keynote 2Main Conference

09:00 60m Keynote		Data Layout and Data Representation Optimizations to Reduce Data Movement Main Conference Mary Hall University of Utah

10:00 - 11:00	Session 4. Memory ManagementMain Conference Chair(s): Naama Ben-David VMware Research

10:00 15m Talk		NBR: Neutralization Based Reclamation Main Conference Ajay Singh University of Waterloo, Trevor Brown University of Waterloo, Ali Mashtizadeh University of Waterloo Link to publication
10:15 15m Talk		Efficiently Reclaiming Memory in Concurrent Search Data Structures While Bounding Wasted Memory Main Conference Daniel Solomon Tel Aviv University, Adam Morrison Tel Aviv University Link to publication
10:30 15m Talk		OrcGC: Automatic Lock-Free Memory Reclamation Main Conference Andreia Correia University of Neuchatel, Switzerland, Pedro Ramalhete Cisco Systems, Pascal Felber University of Neuchâtel Link to publication
10:45 15m Talk		Are Dynamic Memory Managers on GPUs Slow? A Survey and Benchmarks Main Conference Martin Winter Graz University of Technology, Mathias Parger Graz University of Technology, Daniel Mlakar Graz University of Technology, Markus Steinberger Graz University of Technology Link to publication

	11:00 - 11:10	BreakMain Conference

11:10 - 12:10	Session 5. Auto TuningMain Conference Chair(s): Saeed Maleki Microsoft Research

11:10 20m Talk		GPTune: Multitask Learning for Autotuning Exascale Applications Main Conference Yang Liu , Wissam M. Sid-Lakhdar Lawrence Berkeley National Laboratory, Osni Marques Lawrence Berkeley National Laboratory, Xinran Zhu Cornell University, Chang Meng Emory University, James W. Demmel UC Berkeley, Xiaoye S. Li Lawrence Berkeley National Laboratory Link to publication
11:30 20m Talk		I/O Lower Bounds for Auto-tuning of Convolutions in CNNs Main Conference Xiaoyang Zhang Institute of Computing Technology, Chinese Academy of Sciences, Junmin Xiao Institute of Computing Technology, Chinese Academy of Sciences, Guangming Tan Institute of Computing Technology, Chinese Academy of Sciences Link to publication
11:50 20m Talk		ApproxTuner: A Compiler and Runtime System for Adaptive Approximations Main Conference Hashim Sharif University of Illinois at Urbana Champaign, Yifan Zhao University of Illinois at Urbana Champaign, Maria Kotsifakou Runtime Verification, Inc., Akash Kothari University of Illinois at Urbana Champaign, Ben Schreiber University of Illinois at Urbana Champaign, Elizabeth Wang University of Illinois at Urbana Champaign, Yasmin Sarita Cornell University, Nathan Zhao University of Illinois at Urbana-Champaign, Keyur Joshi University of Illinois at Urbana-Champaign, Vikram S. Adve University of Illinois at Urbana-Champaign, Sasa Misailovic University of Illinois at Urbana-Champaign, Sarita Adve University of Illinois at Urbana-Champaign Link to publication

	12:10 - 12:30	BreakMain Conference

12:30 - 13:18	Session 6. Posters 1Main Conference Chair(s): Adam Morrison Tel Aviv University

12:30 6m Talk		POSTER: On Group Mutual Exclusion for Dynamic Systems Main Conference Shreyas Gokhale The University of Texas at Dallas, Sahil Dhoked The University of Texas at Dallas, Neeraj Mittal The University of Texas at Dallas Link to publication
12:36 6m Talk		POSTER: Bundled References: An Abstraction for Highly-Concurrent Linearizable Range Queries Main Conference Jacob Nelson Lehigh University, Ahmed Hassan Lehigh University, Roberto Palmieri Lehigh University Link to publication
12:42 6m Talk		POSTER: Verifying C11-Style Weak Memory Libraries Main Conference Sadegh Dalvandi University of Surrey, Brijesh Dongol University of Surrey Link to publication
12:48 6m Talk		POSTER: A Lock-free Relaxed Concurrent Queue for Fast Work Distribution Main Conference Giorgos Kappes University of Ioannina, Stergios V. Anastasiadis University of Ioannina Link to publication
12:54 6m Talk		POSTER: A more Pragmatic Implementation of the Lock-free, Ordered, Linked List Main Conference Jesper Träff TU Wien, Austria, Manuel Pöter TU Wien, Austria Link to publication
13:00 6m Talk		POSTER: Extending MapReduce Framework with Locality Keys Main Conference Yifeng Cheng Peiking University, China, Bei Wang Peking University, China, Xiaolin Wang Peking University, China Link to publication
13:06 6m Talk		POSTER: On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal LU Factorization Main Conference Grzegorz Kwasniewski ETH Zurich, Tal Ben-Nun Department of Computer Science, ETH Zurich, Alexandros Nikolaos Ziogas ETH Zurich, Timo Schneider ETH Zurich, Maciej Besta ETH Zurich, Torsten Hoefler ETH Zurich Link to publication
13:12 6m Talk		POSTER: Asynchrony versus Bulk-Synchrony for a Generalized N-body Problem from Genomics Main Conference Marquita Ellis University of California at Berkeley & Lawrence Berkeley National Lab, Aydın Buluç University of California at Berkeley & Lawrence Berkeley National Lab, Katherine Yelick University of California at Berkeley & Lawrence Berkeley National Lab Link to publication

	13:18 - 13:30	BreakMain Conference

13:30 - 14:30	Session 7. Posters 2Main Conference Chair(s): Todd Mytkowicz Microsoft Research

13:30 6m Talk		POSTER: In-situ Workflow Auto-tuning through Combining Component Models Main Conference Tong Shu Southern Illinois University Carbondale, Yanfei Guo Argonne National Laboratory, Justin Wozniak Argonne National Laboratory, Xiaoning Ding New Jersey Institute of Technology, Ian Foster Argonne Nat Lab and U.Chicago, Tahsin Kurc Stony Brook University Link to publication
13:36 6m Talk		POSTER: Simplifying Low-Level GPU Programming with GAS Main Conference Da Yan Hong Kong University of Science and Technology, Wei Wang Hong Kong University of Science and Technology, Xiaowen Chu Hong Kong Baptist University Link to publication
13:42 6m Talk		POSTER: Corder: Cache-Aware Reordering for Optimizing Graph Analytics Main Conference YuAng Chen The Chinese University of Hong Kong, Shenzhen, Yeh-Ching Chung The Chinese University of Hong Kong, Shenzhen Link to publication
13:48 6m Talk		POSTER: DFOGraph: An I/O- and Communication-Efficient System for Distributed Fully-out-of-Core Graph Processing Main Conference Jiping Yu Tsinghua University, Wei Qin Tsinghua University, Xiaowei Zhu Tsinghua University, Zhenbo Sun Tsinghua University, Jianqiang Huang Tsinghua University, Xiaohan Li Tsinghua University, Wenguang Chen Tsinghua University Link to publication
13:54 6m Talk		POSTER: An Efficient Uncertain Graph Processing Framework for Heterogeneous Architectures Main Conference Heng Zhang Institute of Software, Chinese Academy of Sciences; University of Sydney, Lingda Li Brookhaven National Laboratory, Donglin Zhuang University of Sydney, Rui Liu University of Chicago, Shuang Song Facebook Inc., Dingwen Tao Washington State University, Yanjun Wu Institute of Software, Chinese Academy of Sciences, Shuaiwen Leon Song University of Sydney Link to publication
14:00 6m Talk		POSTER: Dynamic Scaling for Low-Precision Learning Main Conference Ruobing Han Peking University, Min Si Argonne National Laboratory, James W. Demmel UC Berkeley, Yang You UC Berkeley Link to publication
14:06 6m Talk		POSTER: Exploring Deep Reuse in Winograd CNN Inference Main Conference Ruofan Wu Renmin University of China, Feng Zhang Renmin University of China, Zhen Zheng Alibaba Group, Xiaoyong Du Renmin University of China, Xipeng Shen North Carolina State University Link to publication
14:12 6m Talk		POSTER: A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression Main Conference Sian Jin Washington State University, Guanpeng Li University of Iowa, Shuaiwen Leon Song University of Sydney, Dingwen Tao Washington State University Link to publication
14:18 6m Talk		POSTER: FFT Blitz: The Tensor Cores Strike Back Main Conference Sultan Durrani University of Illinois at Urbana-Champaign, Muhammad Saad Chughtai Georgia Institute of Technology, Abdul Dakkak University of Illinois at Urbana-Champaign, Wen-mei Hwu University of Illinois at Urbana-Champaign, Lawrence Rauchwerger UIUC Link to publication
14:24 6m Break		Break Main Conference

14:30 - 15:30	Industry SponsorsMain Conference

14:30 60m Meeting		Chat with Researchers from Microsoft Main Conference Madan Musuvathi Microsoft Research, Tim Harris Microsoft, UK, Abdul Dakkak Microsoft

Wed 3 Mar
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:00	Keynote 3Main Conference

09:00 60m Keynote		A Journey to a Commercial-Grade Function-In-Memory (FIM) Chip Development Main Conference Nam Sung Kim University of Illinois at Urbana-Champaign

10:00 - 11:00	Session 8. Scientific Computing & OptimizationsMain Conference Chair(s): Tim Harris Microsoft, UK

10:00 15m Talk		EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision Main Conference Boyuan Feng UC Santa Barbara, Yuke Wang UC Santa Barbara, Guoyang Chen Alibaba Group US Inc., Weifeng Zhang Alibaba Group US Inc., Yuan Xie UCSB, Yufei Ding UC Santa Barbara Link to publication
10:15 15m Talk		Efficiently Running SpMV on Long Vector Architectures Main Conference Constantino Gómez Barcelona Supercomputing Center, Filippo Mantovani Barcelona Supercomputing Center, Erich Focht NEC, Marc Casas Barcelona Supercomputing Center Link to publication
10:30 15m Talk		Improving Communication by Optimizing On-Node Data Movement with Data Layout Main Conference Tuowen Zhao University of Utah, Mary Hall University of Utah, Hans Johansen Lawrence Berkeley National Laboratory, Samuel Williams Lawrence Berkeley National Laboratory Link to publication
10:45 15m Talk		Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory Main Conference Jiawen Liu University of California, Merced, Jie Ren University of California, Merced, Roberto Gioiosa Pacific Northwest National Laboratory, Dong Li University of California, Merced, Jiajia Li Pacific Northwest National Laboratory Link to publication

	11:00 - 11:10	BreakMain Conference

11:10 - 12:10	Session 9. Tasks, Threads, and Fault ToleranceMain Conference Chair(s): Pascal Felber University of Neuchâtel

11:10 15m Talk		Advanced Synchronization Techniques for Task-based Runtime Systems Main Conference David Álvarez Barcelona Supercomputing Center, Kevin Sala Barcelona Supercomputing Center, Marcos Maroñas Barcelona Supercomputing Center, Aleix Roca Barcelona Supercomputing Center, Vicenç Beltran Barcelona Supercomputing Center Link to publication
11:25 15m Talk		An Ownership Policy and Deadlock Detector for Promises Main Conference Caleb Voss Georgia Institute of Technology, Vivek Sarkar Georgia Institute of Technology Link to publication
11:40 15m Talk		Understanding a Program's Resiliency Through Error Propagation Main Conference zhimin li , Harshitha Menon Lawrence Livermore National Laboratory, Kathryn Mohror Lawrence Livermore National Laboratory, Peer-Timo Bremer Lawrence Livermore National Laboratory, Yarden Livant University of Utah, Valerio Pascucci University of Utah Link to publication
11:55 15m Talk		Lightweight Preemptive User-Level Threads Main Conference Shumpei Shiina The University of Tokyo, Shintaro Iwasaki Argonne National Laboratory, Kenjiro Taura The University of Tokyo, Pavan Balaji Argonne National Laboratory Link to publication

	12:10 - 12:30	BreakMain Conference

12:30 - 13:30	Session 10. Machine Learning and Software EngineeringMain Conference Chair(s): Albert Cohen Google

12:30 15m Talk		TurboTransformers: An Efficient GPU Serving System For Transformer Models Main Conference Jiarui Fang Tencent, Yang Yu , Chengduo Zhao Tencent, Jie Zhou Tencent Link to publication
12:45 15m Talk		Extracting Clean Performance Models from Tainted Programs Main Conference Marcin Copik ETH Zurich, Alexandru Calotoiu ETH Zurich, Tobias Grosser University of Edinburgh, Nicolas Wicki ETH Zurich, Felix Wolf TU Darmstadt, Torsten Hoefler ETH Zurich Link to publication Pre-print
13:00 15m Talk		Modernizing Parallel Code with Pattern Analysis Main Conference Roberto Castañeda Lozano University of Edinburgh, Murray Cole University of Edinburgh, Björn Franke University of Edinburgh Link to publication
13:15 15m Talk		DAPPLE: A Pipelined Data Parallel Approach for Training Large Models Main Conference Shiqing Fan Alibaba Group, Yi Rong Alibaba Group, Chen Meng Alibaba Group, ZongYan Cao Alibaba Group, Siyu Wang Alibaba Group, Zhen Zheng Alibaba Group, Chuan Wu The University of Hong Kong, Guoping Long Alibaba Group, Jun Yang Alibaba Group, LiXue Xia Alibaba Group, Lansong Diao Alibaba Group, Xiaoyong Liu Alibaba Group, Wei Lin Alibaba Group Link to publication

13:30 - 15:00	Joint Session PanelMain Conference Panelists: John L. Hennessy _{Alphabet and Stanford}, David Patterson _{Google and U.C. Berkeley}, Margaret Martonosi _{NSF CISE and Princeton}, Bill Dally _{NVIDIA and Stanford}, Natalie Enright Jerger _{U. Toronto and ACM D&I Council}, Kim Hazelwood _{Facebook AI Research}, Timothy M. Pinkston _USC

13:30 90m Other		Valuing Diversity, Equity, and Inclusion in Our Computing Community Main Conference File Attached

Accepted Papers

	Title
	Advanced Synchronization Techniques for Task-based Runtime Systems Main Conference David Álvarez, Kevin Sala, Marcos Maroñas, Aleix Roca, Vicenç Beltran Link to publication
	A Fast Work-Efficient SSSP Algorithm for GPUs Main Conference Kai Wang, Donald Fussell, Calvin Lin Link to publication
	An Ownership Policy and Deadlock Detector for Promises Main Conference Caleb Voss, Vivek Sarkar Link to publication
	ApproxTuner: A Compiler and Runtime System for Adaptive Approximations Main Conference Hashim Sharif, Yifan Zhao, Maria Kotsifakou, Akash Kothari, Ben Schreiber, Elizabeth Wang, Yasmin Sarita, Nathan Zhao, Keyur Joshi, Vikram S. Adve, Sasa Misailovic, Sarita Adve Link to publication
	Are Dynamic Memory Managers on GPUs Slow? A Survey and Benchmarks Main Conference Martin Winter, Mathias Parger, Daniel Mlakar, Markus Steinberger Link to publication
	BiPart: A Parallel and Deterministic Hypergraph Partitioner Main Conference Sepideh Maleki, Udit Agarwal, Martin Burtscher, Keshav Pingali Link to publication
	Compiler Support for Near Data Computing Main Conference Mahmut Taylan Kandemir, Jihyun Ryoo, Xulong Tang, Mustafa Karakoy Link to publication
	Constant-Time Snapshots with Applications to Concurrent Data Structures Main Conference Yuanhao Wei, Naama Ben-David, Guy E. Blelloch, Panagiota Fatourou, Eric Ruppert, Yihan Sun Link to publication
	DAPPLE: A Pipelined Data Parallel Approach for Training Large Models Main Conference Shiqing Fan, Yi Rong, Chen Meng, ZongYan Cao, Siyu Wang, Zhen Zheng, Chuan Wu, Guoping Long, Jun Yang, LiXue Xia, Lansong Diao, Xiaoyong Liu, Wei Lin Link to publication
	Efficient Algorithms for Persistent Transactional Memory Main Conference Pedro Ramalhete, Andreia Correia, Pascal Felber Link to publication
	Efficiently Reclaiming Memory in Concurrent Search Data Structures While Bounding Wasted Memory Main Conference Daniel Solomon, Adam Morrison Link to publication
	Efficiently Running SpMV on Long Vector Architectures Main Conference Constantino Gómez, Filippo Mantovani, Erich Focht, Marc Casas Link to publication
	EGEMM-TC: Accelerating Scientific Computing on Tensor Cores with Extended Precision Main Conference Boyuan Feng, Yuke Wang, Guoyang Chen, Weifeng Zhang, Yuan Xie, Yufei Ding Link to publication
	Extracting Clean Performance Models from Tainted Programs Main Conference Marcin Copik, Alexandru Calotoiu, Tobias Grosser, Nicolas Wicki, Felix Wolf, Torsten Hoefler Link to publication Pre-print
	GPTune: Multitask Learning for Autotuning Exascale Applications Main Conference Yang Liu, Wissam M. Sid-Lakhdar, Osni Marques, Xinran Zhu, Chang Meng, James W. Demmel, Xiaoye S. Li Link to publication
	Improving Communication by Optimizing On-Node Data Movement with Data Layout Main Conference Tuowen Zhao, Mary Hall, Hans Johansen, Samuel Williams Link to publication
	Investigating the Semantics of Futures in Transactional Memory Systems Main Conference Jingna Zeng, Shady Issa, Paolo Romano, Luis Rodrigues, Seif Haridi Link to publication
	I/O Lower Bounds for Auto-tuning of Convolutions in CNNs Main Conference Xiaoyang Zhang, Junmin Xiao, Guangming Tan Link to publication
	Lightweight Preemptive User-Level Threads Main Conference Shumpei Shiina, Shintaro Iwasaki, Kenjiro Taura, Pavan Balaji Link to publication
	Modernizing Parallel Code with Pattern Analysis Main Conference Roberto Castañeda Lozano, Murray Cole, Björn Franke Link to publication
	NBR: Neutralization Based Reclamation Main Conference Ajay Singh, Trevor Brown, Ali Mashtizadeh Link to publication
	OrcGC: Automatic Lock-Free Memory Reclamation Main Conference Andreia Correia, Pedro Ramalhete, Pascal Felber Link to publication
	Parallel Binary Code Analysis Main Conference Xiaozhu Meng, Jonathon Anderson, John Mellor-Crummey, Mark W. Krentel, Barton P. Miller, Srđan Milaković Link to publication
	POSTER: A Lock-free Relaxed Concurrent Queue for Fast Work Distribution Main Conference Giorgos Kappes, Stergios V. Anastasiadis Link to publication
	POSTER: A more Pragmatic Implementation of the Lock-free, Ordered, Linked List Main Conference Jesper Träff, Manuel Pöter Link to publication
	POSTER: An Efficient Uncertain Graph Processing Framework for Heterogeneous Architectures Main Conference Heng Zhang, Lingda Li, Donglin Zhuang, Rui Liu, Shuang Song, Dingwen Tao, Yanjun Wu, Shuaiwen Leon Song Link to publication
	POSTER: A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression Main Conference Sian Jin, Guanpeng Li, Shuaiwen Leon Song, Dingwen Tao Link to publication
	POSTER: Asynchrony versus Bulk-Synchrony for a Generalized N-body Problem from Genomics Main Conference Marquita Ellis, Aydın Buluç, Katherine Yelick Link to publication
	POSTER: Bundled References: An Abstraction for Highly-Concurrent Linearizable Range Queries Main Conference Jacob Nelson, Ahmed Hassan, Roberto Palmieri Link to publication
	POSTER: Corder: Cache-Aware Reordering for Optimizing Graph Analytics Main Conference YuAng Chen, Yeh-Ching Chung Link to publication
	POSTER: DFOGraph: An I/O- and Communication-Efficient System for Distributed Fully-out-of-Core Graph Processing Main Conference Jiping Yu, Wei Qin, Xiaowei Zhu, Zhenbo Sun, Jianqiang Huang, Xiaohan Li, Wenguang Chen Link to publication
	POSTER: Dynamic Scaling for Low-Precision Learning Main Conference Ruobing Han, Min Si, James W. Demmel, Yang You Link to publication
	POSTER: Exploring Deep Reuse in Winograd CNN Inference Main Conference Ruofan Wu, Feng Zhang, Zhen Zheng, Xiaoyong Du, Xipeng Shen Link to publication
	POSTER: Extending MapReduce Framework with Locality Keys Main Conference Yifeng Cheng, Bei Wang, Xiaolin Wang Link to publication
	POSTER: FFT Blitz: The Tensor Cores Strike Back Main Conference Sultan Durrani, Muhammad Saad Chughtai, Abdul Dakkak, Wen-mei Hwu, Lawrence Rauchwerger Link to publication
	POSTER: In-situ Workflow Auto-tuning through Combining Component Models Main Conference Tong Shu, Yanfei Guo, Justin Wozniak, Xiaoning Ding, Ian Foster, Tahsin Kurc Link to publication
	POSTER: On Group Mutual Exclusion for Dynamic Systems Main Conference Shreyas Gokhale, Sahil Dhoked, Neeraj Mittal Link to publication
	POSTER: On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal LU Factorization Main Conference Grzegorz Kwasniewski, Tal Ben-Nun, Alexandros Nikolaos Ziogas, Timo Schneider, Maciej Besta, Torsten Hoefler Link to publication
	POSTER: Simplifying Low-Level GPU Programming with GAS Main Conference Da Yan, Wei Wang, Xiaowen Chu Link to publication
	POSTER: Verifying C11-Style Weak Memory Libraries Main Conference Sadegh Dalvandi, Brijesh Dongol Link to publication
	Reasoning About Recursive Tree Traversals Main Conference Yanjun Wang, Jinwei Liu, Dalin Zhang, Xiaokang Qiu Link to publication
	Scaling Implicit Parallelism via Dynamic Control Replication Main Conference Michael Bauer, Wonchan Lee, Elliott Slaughter, Zhihao Jia, Mario Di Renzo, Manolis Papadakis, Galen Shipman, Patrick McCormick, Michael Garland, Alex Aiken Link to publication
	ShadowVM: Accelerating Data Plane for Data Analytics with Bare Metal CPUs and GPUs Main Conference Zhifang Li, Mingcong Han, Shangwei Wu, Chuliang Weng Link to publication
	Sparta: High-Performance, Element-Wise Sparse Tensor Contraction on Heterogeneous Memory Main Conference Jiawen Liu, Jie Ren, Roberto Gioiosa, Dong Li, Jiajia Li Link to publication
	Synthesizing Optimal Collective Algorithms Main Conference Zixian Cai, Zhengyang Liu, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi Link to publication
	TurboTransformers: An Efficient GPU Serving System For Transformer Models Main Conference Jiarui Fang, Yang Yu, Chengduo Zhao, Jie Zhou Link to publication
	Understanding and Bridging the Gaps in Current GNN Performance Optimizations Main Conference Kezhao Huang, Jidong Zhai, Zhen Zheng, Youngmin Yi, Xipeng Shen Link to publication
	Understanding a Program's Resiliency Through Error Propagation Main Conference zhimin li, Harshitha Menon, Kathryn Mohror, Peer-Timo Bremer, Yarden Livant, Valerio Pascucci Link to publication

Artifact Evaluation

Artifact evaluation submission site

Call for Artifacts

Authors of accepted PPoPP 2022 papers/posters are invited to formally submit their supporting materials to the Artifact Evaluation (AE) process. The Artifact Evaluation Committee attempts to reproduce experiments (in broad strokes) and assess if submitted artifact supports the claims made in the paper/poster. The submission is voluntary and does not influence the final decision regarding paper/poster acceptance.

We invite every author of an accepted PPoPP paper/poster to consider submitting an artifact. It is good for the community as a whole. At PPoPP, we follow ACM's artifact reviewing and badging policy. ACM describes a research artifact as follows:

"By "artifact" we mean a digital object that was either created by the authors to be used as part of the study or generated by the experiment itself. For example, artifacts can be software systems, scripts used to run experiments, input datasets, raw data collected in the experiment, or scripts used to analyze results."

Submission Site

The submission site is located at https://ppopp22ae.hotcrp.com/.

Evaluation Process

Artifact evaluation is single-blind. Please take precautions (e.g. turning off analytics, logging) to help prevent accidentally learning the identities of reviewers. Each submitted artifact is evaluated by at least two members of the artifact evaluation committee.

During the process, authors and evaluators are allowed to anonymously communicate with each other to overcome technical difficulties. Ideally, we hope to see all submitted artifacts to successfully pass the artifact evaluation.

The evaluators are asked to evaluate the artifact based on the following criteria, that are defined byACM.

ACM recommends awarding three different types of badges to communicate how the artifact has been evaluated. A single paper can receive up to three badges — one badge of each type.

		The green Artifacts Available badge indicates that an artifact is publicly accessible in an archival repository. For this badge to be awarded the paper does not have to be independently evaluated. ACM requires that a qualified archival repository is used, for example Zenodo, figshare, Dryad. Personal webpages, GitHub repositories or alike are not sufficient as it can be changed after the submission deadline!
		The red Artifacts Evaluated badges indicate that a research artifact has been successfully completed an independent audit. A reviewer has verified that the artifact is documented, complete, consistent, exercisable, and include appropriate evidence of verification and validation. Two levels are distinguished: The lighter red Artifacts Evaluated — Functional badge indicates a basic level of functionality. The darker red Artifacts Evaluated — Reusable badge indicates a higher quality artifact which significantly exceeds minimal functionality so that reuse and repurposing is facilitated. Artifacts need not be made publicly available to be considered for one of these badges. However, they do need to be made available to reviewers.
		The blue Results Validated badges indicate that the main results of the paper have been successfully obtained by an independent reviewer. Two levels are distinguished: The darker blue Results Reproduced badge indicates that the main results of the paper have been successfully obtained using the provided artifact. The lighter blue Results Replicated badge indicates that the main results of the paper have been independently obtained without using the author-provided research artifact. Artifacts need not be made publicly available to be considered for one of these badges. However, they do need to be made available to reviewers.

At PPoPP the artifact evaluation committee awards for each successfully evaluated paper one of the two red Artifacts Evaluated badges as well as the darker blue Results Reproduced badge. We do not award the lighter blue Results Replicated badge in this artifact evaluation process. The green Artifact Available badge does not require the formal audit and, therefore, is awarded directly by the publisher — if the authors provide a link to the deposited artifact.

Note that the variation of empirical and numerical results is tolerated. In fact, it is often unavoidable in computer systems research - see "how to report and compare empirical results?" in AE FAQ on ctuning.org!

Packaging and Instructions

Your submission should consist of three pieces:

The submission version of your paper/poster.
A README file (PDF or plaintext format) that explains your artifact (details below).
The artifact itself, packaged as a single archive file. Artifacts less than 600MB can be directly uploaded to the hotCRP submission site; for archives larger than 600MB, please provide a URL pointing to the artifact; the URL must protect the anonymity of the reviewers. Please use a widely available compressed archive format such as ZIP (.zip), tar and gzip (.tgz), or tar and bzip2 (.tbz2). Ensure the file has the suffix indicating its format. Those seeking the "Available" badge must additionally follow the appropriate instructions recommended by ACM on uploading the archive to a publicly available, immutable location to receive the badge.

The README file should consist of two parts:

a Getting Started Guide and
Step-by-Step Instructions for how you propose to evaluate your artifact (with appropriate connections to the relevant sections of your paper);

The Getting Started Guide should contain setup instructions (including, for example, a pointer to the VM player software, its version, passwords if needed, etc.) and basic testing of your artifact that you expect a reviewer to be able to complete in 30 minutes. Reviewers will follow all the steps in the guide during an initial kick-the-tires phase. The Getting Started Guide should be as simple as possible, and yet it should stress the key elements of your artifact. Anyone who has followed the Getting Started Guide should have no technical difficulties with the rest of your artifact. In this step, you may want to include a single high-level "runme.sh" script that automatically compiles your artifact, runs it (printing some interesting events to the console), collects data (e.g., performance data), and produces files such as graphs or charts similar to the ones used in your paper.

The Step by Step Instructions explain how to reproduce any experiments or other activities that support the conclusions in your paper. Write this for readers who have a deep interest in your work and are studying it to improve it or compare against it. If your artifact runs for more than a few minutes, point this out and explain how to run it on smaller inputs.

Where appropriate, include descriptions of and links to files (included in the archive) that represent expected outputs (e.g., the speedup comparison chart expected to be generated by your tool on the given inputs); if there are warnings that are safe to be ignored, explain which ones they are.

The artifact's documentation should include the following:

A list of claims from the paper supported by the artifact, and how/why.
A list of claims from the paper not supported by the artifact, and how/why. Example: Performance claims cannot be reproduced in VM, authors are not allowed to redistribute specific benchmarks, etc. Artifact reviewers can then center their reviews / evaluation around these specific claims.

If you are seeking a "reusable" badge, your documentation should include which aspects of the artifact you suggest the reviewer exercise in a different setting. For example, you may want to point out which script to modify so that the reviewer may be able to run your tool on a benchmark not used in the paper. You may want the reviewer to suggest where to edit a script to change the number of CPU cores used for evaluation.

Submission Guidelines

1. Carefully think which badges you want.

In your hotCRP submission, be upfront about which badge(s) you are seeking.

If making your code public is all you want to do, seek only the "available" badge. The reviewers will not exercise the artifact for its functionality or validate the claims.
If you do not plan on making the artifact available to the public, do not seek the "available" badge.
If you only plan to reproduce the claims without making your artifact Documented, Consistent, Complete, and exercisable, do not seek the "functional" badge.

2. Minimize the artifact setup overhead

A well-packaged artifact is easily usable by the reviewers, saving them time and frustration, and more clearly conveying the value of your work during evaluation. A great way to package an artifact is as a Docker image or in a virtual machine that runs "out of the box" with very little system-specific configuration. Using a virtual machine provides a way to make an easily reproducible environment — it is less susceptible to bit rot. It also helps the AEC have confidence that errors or other problems cannot cause harm to their machines.

Giving AE reviewers remote access to your machines with preinstalled (proprietary) software is also possible.

The submission of an artifact is not the same as making it public. AEC members will be instructed that they may not publicize any part of your artifact during or after completing evaluation, nor retain any part of it after evaluation. Thus, you are free to include models, data files, proprietary binaries, and similar items in your artifact.

After preparing your artifact, download and test it on at least one fresh machine where you did not prepare the artifact; this will help you fix missing dependencies, if any.

3. Carefully think your artifact working on a reviewer's machine

The reviewers will not have access to any special hardware or software outside of their own research needs provided by their university or research team. There are more tips preparing a submission available on the ctuning website.

If you have an unusual experimental setup that requires specific hardware (i.e., custom hardware, oscilloscopes for measurements …) or proprietary software please contact the artifact evaluation chairs before the submission.

Discussion with Reviewers

Throughout the review period, reviews will be submitted to HotCRP and will be (approximately) continuously visible to authors. AEC reviewers will be able to continuously interact (anonymously) with authors for clarifications, system-specific patches, and other logistics to help ensure that the artifact can be evaluated. The goal of continuous interaction is to prevent rejecting artifacts for "wrong library version" types of problems.

For questions, please contact AE co-chairs, Milind Chabbi (milind@uber.com) and Karthik Murthy (karthik.s.m@gmail.com).

Call for Papers

PPoPP 2021: 26th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming

Seoul, S. Korea. (collocated with CC-2021, HPCA-2021 and CGO-2021) Dates: Feb 27-Mar 3, 2021.

Submission URL: https://ppopp21.hotcrp.com

Important dates:

Paper registration and abstract submission: August 6, 2020
Full paper submission: August 13, 2020
Early notification for papers not passing first stage: October 10, 2020
Author response period: October 30–November 2, 2020
Author Notification: November 16, 2020
Artifact submission to AE committee (tentative): November 26, 2020
Artifact notification by AE committee (tentative): December 26, 2020
Final paper due: January 2, 2021

All deadlines are at midnight anywhere on earth (AoE), and are firm.

Scope:

PPoPP is the premier forum for leading work on all aspects of parallel programming, including theoretical foundations, techniques, languages, compilers, runtime systems, tools, and practical experience. In the context of the symposium, “parallel programming” encompasses work on concurrent and parallel systems (multicore, multi-threaded, heterogeneous, clustered, and distributed systems; grids; data centers; clouds; and large scale machines). Given the rise of parallel architectures in the consumer market (desktops, laptops, and mobile devices) and data centers, PPoPP is particularly interested in work that addresses new parallel workloads and issues that arise out of extreme-scale applications or cloud platforms, as well as techniques and tools that improve the productivity of parallel programming or work towards improved synergy with such emerging architectures.

Specific topics of interest include (but are not limited to): - Compilers and runtime systems for parallel and heterogeneous systems - Concurrent data structures - Development, analysis, or management tools - Fault tolerance for parallel systems - Formal analysis and verification - High-performance / scientific computing - Libraries - Middleware for parallel systems - Parallel algorithms - Parallel applications and frameworks - Parallel programming for deep memory hierarchies including nonvolatile memory - Parallel programming languages - Parallel programming theory and models - Parallelism in non-scientific workloads: web, search, analytics, cloud, machine learning - Performance analysis, debugging and optimization - Programming tools for parallel and heterogeneous systems - Software engineering for parallel programs - Software for heterogeneous architectures - Software productivity for parallel programming - Synchronization and concurrency control

Papers should report on original research relevant to parallel programming and should contain enough background materials to make them accessible to the entire parallel programming research community. Papers describing experience should indicate how they illustrate general principles or lead to new insights. PPoPP submissions will be evaluated based on their technical merit and accessibility. Submissions should clearly motivate the importance of the problem being addressed, compare to the existing body of work on the topic, and explicitly and precisely state the paper’s key contributions and results towards addressing the problem. Submissions should strive to be accessible both to a broad audience and to experts in the area. Authors of papers that do not pass the first round of reviewing will receive a notification so that they can start working as early as possible on revising their papers and resubmitting them to other conferences or journals.

Paper Submission:

Conference submission site: https://ppopp21.hotcrp.com.

All submissions must be made electronically through the conference web site and include an abstract (100–400 words), author contact information, the full list of authors and their affiliations. Full paper submissions must be in PDF formatted printable on both A4 and US letter size paper.

All papers must be prepared in ACM Conference Format using the 2-column acmart format: use the SIGPLAN proceedings template acmart-sigplanproc-template.tex for Latex,and interim-layout.docx for Word. You may also want to consult the official ACM information on the Master Article Template and related tools. Important note: The Word template (interim-layout.docx) on the ACM website uses 9pt font; you need to increase it to 10pt.

Papers should contain a maximum of 10 pages of text (in a typeface no smaller than 10 point) or figures, NOT INCLUDING references. There is no page limit for references and they must include the name of all authors (not {et al.}). Appendices are not allowed, but the authors may submit supplementary material, such as proofs or source code; all supplementary material must be in PDF format. Looking at supplementary material is at the discretion of the reviewers.

Submission is double blind and authors will need to identify any potential conflicts of interest with PC and Extended Review Committee members, as defined here: http://www.sigplan.org/Resources/Policies/Review/ (ACM SIGPLAN policy).

PPoPP 2021 will employ a lightweight double-blind reviewing process. To facilitate this process, submissions should not reveal the identity of the authors in any way. Authors should leave out author names and affiliations from the body of their submission and from the supplementary material. They should also ensure that any references to authors’ own related work should be in the third person (e.g., not “We build on our previous work …” but rather “We build on the work of …”). The purpose of this process is to help the PC and external reviewers come to an initial judgment about the paper without bias, not to make it impossible for them to discover the authors if they were to try. Nothing should be done in the name of anonymity that weakens the submission or makes the job of reviewing the paper more difficult. In particular, important background references should not be omitted or anonymized. In addition, authors should feel free to disseminate their ideas or draft versions of their paper as they normally would. For instance, authors may post drafts of their papers on the web or give talks on their research ideas. Authors with further questions on double-blind reviewing are encouraged to contact the Program Chair by email.

Submissions should be in PDF and printable on both US Letter and A4 paper. Papers may be resubmitted to the submission site multiple times up until the deadline, but the last version submitted before the deadline will be the version reviewed. Papers that exceed the length requirement, that deviate from the expected format, or that are submitted late will be rejected.

All submissions that are not accepted for regular presentations will be automatically considered for posters. Two-page summaries of accepted posters will be included in the conference proceedings

To allow reproducibility, we encourage authors of accepted papers to submit their papers for Artifact Evaluation (AE). The AE process begins after the acceptance notification, and is run by a separate committee whose task is to assess how the artifacts support the work described in the papers. Artifact evaluation is voluntary and will not affect paper acceptance, but will be taken into consideration when selecting papers for awards. Papers that go through the AE process successfully will receive one or several of the ACM reproducibility badges, printed on the papers themselves. More information will be posted on AE website.

Deadlines expire at midnight anywhere on earth.

Publication Date:

The titles of all accepted papers are typically announced shortly after the author notification date (late November 2020). Note, however, that this is not the official publication date. The official publication date is the date the proceedings are made available in the ACM Digital Library. ACM will make the proceedings available via the Digital Library for one month, up to 2 weeks prior to the first day of the conference. The official publication date affects the deadline for any patent filings related to published work.

Main ConferencePPoPP 2021

Mon 1 Mar
Displayed time zone: Eastern Time (US & Canada) change

Tue 2 Mar
Displayed time zone: Eastern Time (US & Canada) change

Wed 3 Mar
Displayed time zone: Eastern Time (US & Canada) change

Accepted Papers

Artifact Evaluation

Call for Artifacts

Submission Site

Evaluation Process

Packaging and Instructions

Submission Guidelines

1. Carefully think which badges you want.

2. Minimize the artifact setup overhead

3. Carefully think your artifact working on a reviewer's machine

Discussion with Reviewers

Call for Papers

Important dates:

Scope:

Paper Submission:

Publication Date:

Tracks

Main ConferencePPoPP 2021

Program Display Configuration

Mon 1 MarDisplayed time zone: Eastern Time (US & Canada) change

Tue 2 MarDisplayed time zone: Eastern Time (US & Canada) change

Wed 3 MarDisplayed time zone: Eastern Time (US & Canada) change

Accepted Papers

Artifact Evaluation

Call for Papers

Mon 1 Mar
Displayed time zone: Eastern Time (US & Canada) change

Tue 2 Mar
Displayed time zone: Eastern Time (US & Canada) change

Wed 3 Mar
Displayed time zone: Eastern Time (US & Canada) change