ELTC: An End-to-End Large Language Model-Based Tensor Compilation Optimization Framework (APLAS 2025 - The 23rd Asian Symposium on Programming Languages and Systems)

Who

wenbo ma, qingzeng song, yongjiang xue, Fei Qiao, mingze sun

Track

APLAS 2025 Research Papers

Time Zone

The program is currently displayed in (GMT+05:30) Chennai, Kolkata, Mumbai, New Delhi.

Use conference time zone: (GMT+05:30) Chennai, Kolkata, Mumbai, New DelhiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 29 Oct 2025 14:00 - 14:30 at R104 - AI and Compiler Optimisation for Performance Chair(s): Meenakshi D'Souza

Abstract

Tensor program optimization is a non-convex optimization problem, and efficiently solving it while balancing optimization efficiency and execution performance remains a challenging task. Search-based tensor program compilers have proven effective by constructing large-scale exploration spaces that include potentially high-performance program variants, thus overcoming the performance bottlenecks of traditional program optimization methods. However, these approaches still face significant challenges in search strategies, as existing compilers often require hours or even days to identify the optimal program representation.This paper proposes ELTC, an end-to-end tensor program compilation framework based on large language models (LLMs), designed for efficient optimization of tensor programs in deep neural networks. ELTC formulates the tensor program exploration problem as a generation task for language models. By training a large language model offline, it generates transformation sequences for tensor programs in an end-to-end manner based on their feature representations. While preserving the broad search space, this approach significantly improves optimization efficiency. Moreover, we introduce a language-model-friendly intermediate representation, which encodes key features of tensor programs using structured textual formats. Based on this representation, we construct a tensor program dataset tailored for language models. Experimental results demonstrate that ELTC achieves superior performance in both optimization quality and tuning speed. Compared with the fully converged Ansor-TenSet, ELTC achieves a 34.07× compilation speedup and an average performance improvement of 1.06× under convergence conditions. Furthermore, ELTC outperforms the manually optimized kernel library TensorRT, achieving a 1.3× performance gain.

keywords: Program Transformation · Compiler · Large Language Models.

wenbo ma

Tiangong University

qingzeng song

Tiangong University

yongjiang xue

Tiangong University

Fei Qiao

Tsinghua University

mingze sun

Tiangong University

Time Zone

The program is currently displayed in (GMT+05:30) Chennai, Kolkata, Mumbai, New Delhi.

Use conference time zone: (GMT+05:30) Chennai, Kolkata, Mumbai, New DelhiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 29 Oct
Displayed time zone: Chennai, Kolkata, Mumbai, New Delhi change

14:00 - 15:00	AI and Compiler Optimisation for PerformanceResearch Papers at R104 Chair(s): Meenakshi D'Souza IIITB - International Institute of Information Technology Bangalore

14:00 30m Talk		ELTC: An End-to-End Large Language Model-Based Tensor Compilation Optimization FrameworkRemote Talk Research Papers wenbo ma Tiangong University, qingzeng song Tiangong University, yongjiang xue Tiangong University, Fei Qiao Tsinghua University, mingze sun Tiangong University
14:30 30m Talk		Performance Optimization of HPC Workloads in Cloud Using AI-Driven AlgorithmsRemote Talk Research Papers Aman Iftekhar IIT Patna, Rahul Mishra IIT Patna