HybridSIMD: A Super C++ SIMD Library with Integrated Auto-tuning Capabilities
This program is tentative and subject to change.
Single Instruction, Multiple Data (SIMD) technology is crucial for enhancing computational efficiency in High-Performance Computing (HPC) and Artificial Intelligence (AI). While automatic vectorization methods offer ease of use, they suffer from limitations in hardware utilization due to compilers’ static analysis capabilities. Manual vectorization, on the other hand, allows for fine-grained control and potentially better hardware utilization, but manual approaches using low-level intrinsics specifically introduce challenges in portability and development complexity. Existing C++ SIMD libraries aim to address these issues but introduce new challenges such as performance and usability fragmentation and underutilization of hardware potential due to limited support for variable vector element counts. To overcome these limitations, this paper introduces HybridSIMD, a novel unified and autotunable SIMD library. HybridSIMD is designed to resolve both fragmentation and hardware underutilization by enabling operator-level hybrid collaborative optimization across different SIMD libraries through a unified interface. A built-in auto-tuning mechanism, leveraging static analysis and hierarchical search, automatically optimizes and tunes programs for high performance across diverse hardware platforms without human intervention. Experimental results across six real-world HPC benchmarks on AVX2, AVX512, and NEON architectures demonstrate that HybridSIMD outperforms state-of-the-art SIMD libraries. Notably, the highest speedups achieved by HybridSIMD are 185.34$\times$ on AVX2, 97.80$\times$ on AVX512, and 71.32$\times$ on NEON, showcasing superior computational efficiency and adaptability.
This program is tentative and subject to change.
Mon 17 NovDisplayed time zone: Seoul change
11:00 - 12:30 | |||
11:00 12mTalk | AutoFid: Adaptive and Noise-Aware Fidelity Measurement for Quantum Programs via Circuit Graph Analysis Research Papers | ||
11:12 12mTalk | HybridSIMD: A Super C++ SIMD Library with Integrated Auto-tuning Capabilities Research Papers Haolin Pan Institute of Software, Chinese Academy of Sciences;School of Intelligent Science and Technology, HIAS, UCAS, Hangzhou;University of Chinese Academy of Sciences, Xulin Zhou Institute of Software, Chinese Academy of Sciences; University of Chinese Academy of Sciences, Mingjie Xing Institute of Software, Chinese Academy of Sciences, Yanjun Wu Institute of Software, Chinese Academy of Sciences | ||
11:25 12mTalk | PEACE: Towards Efficient Project-Level Performance Optimization via Hybrid Code Editing Research Papers Xiaoxue Ren Zhejiang University, Jun Wan Zhejiang University, Yun Peng The Chinese University of Hong Kong, Zhongxin Liu Zhejiang University, Ming Liang Ant Group, Dajun Chen Ant Group, Wei Jiang Ant Group, Yong Li Ant Group | ||
11:38 12mTalk | CoTune: Co-evolutionary Configuration Tuning Research Papers Gangda Xiong University of Electronic Science and Technology of China, Tao Chen University of Birmingham Pre-print | ||
11:51 12mTalk | It's Not Easy Being Green: On the Energy Efficiency of Programming Languages Research Papers Nicolas van Kempen University of Massachusetts Amherst, USA, Hyuk-Je Kwon University of Massachusetts Amherst, Dung Nguyen University of Massachusetts Amherst, Emery D. Berger University of Massachusetts Amherst and Amazon Web Services | ||
12:04 12mTalk | When Faster Isn't Greener: The Hidden Costs of LLM-Based Code Optimization Research Papers Tristan Coignion Université de Lille - Inria, Clément Quinton Université de Lille, Romain Rouvoy University Lille 1 and INRIA | ||
12:17 12mTalk | United We Stand: Towards End-to-End Log-based Fault Diagnosis via Interactive Multi-Task Learning Research Papers Minghua He Peking University, Chiming Duan Peking University, Pei Xiao Peking University, Tong Jia Institute for Artificial Intelligence, Peking University, Beijing, China, Siyu Yu The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), Lingzhe Zhang Peking University, China, Weijie Hong Peking university, Jing Han ZTE Corporation, Yifan Wu Peking University, Ying Li School of Software and Microelectronics, Peking University, Beijing, China, Gang Huang Peking University | ||