Fusion of Operators of Computational Graphs via Greedy Clustering: The XNNC Experience (CC 2025 - Main Conference)

Who

Michael Canesche, Vanderson Martins do Rosario, Edson Borin, Fernando Magno Quintão Pereira

Track

CC 2025 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sat 1 Mar 2025 11:30 - 12:00 at Acacia A - Compilers and Optimization Chair(s): Jens Palsberg

Abstract

Tensor compilers like XLA, TVM, and TensorRT operate on computational graphs, where vertices represent operations and edges represent data flow between these operations. Operator fusion is a compiler optimization that merges operators within the computational graph to improve their efficiency. This paper presents the operator fusion algorithm recently deployed in the Xtensa Neural Network Compiler (XNNC)—Cadence Tensilica’s tensor compiler. The algorithm clusters nodes within the computational graph and iteratively grows these clusters until reaching a fixed point. A priority queue, sorted by the estimated profitability of merging cluster candidates, guides this iterative process. It balances precision and practicality, producing more efficient model implementations than XNNC’s previous fusion approach, which was based on a depth-first traversal of the computational graph. Moreover, unlike recently proposed exhaustive or evolutionary search methods, this algorithm terminates quickly while often yielding equally efficient models.

Michael Canesche

Cadence Design Systems

Vanderson Martins do Rosario

Cadence Design Systems

Edson Borin

State University of Campinas

Fernando Magno Quintão Pereira

Federal University of Minas Gerais

Brazil