MimIrADe: Automatic Differentiation in a Higher-Order Sea-of-Nodes IR (CC 2025 - Main Conference)

Who

Marcel Ullrich, Sebastian Hack, Roland Leißa

Track

CC 2025 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-08:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 2 Mar 2025 09:30 - 10:00 at Bristlecone_ - Machine Learning and PL II Chair(s): Fernando Magno Quintão Pereira

Abstract

Automatic differentiation (AD) is at the core of all machine learning frameworks and has applications in scientific computing as well. Theoretical research on reverse-mode automatic differentiation focuses on functional, higher-order languages, enabling AD to be formulated as a series of local, concise program rewrites. These theoretical approaches focus on correctness but disregard efficiency. Practical implementations, however, employ mutation and taping techniques to enhance efficiency. This approach, however, necessitates intricate, low-level, and non-local program transformations.

In this work, we introduce MimIrADe, a functionally inspired AD technique implemented within a higher-order, graph-based (“sea of nodes”) intermediate representation (IR). Our method consists of a streamlined implementation and incorporates standard optimizations, resulting in an efficient AD system. The higher-order nature of the IR enables us to utilize concise functional AD methods, expressing AD through local rewrites. This locality facilitates modular high-level extensions, such as matrix operations, in a straightforward manner. Additionally, the graph-based structure of the IR ensures that critical implementation aspects -particularly the handling of shared pullback invocations - are managed naturally and efficiently. Our AD pass supports a comprehensive set of features, including non-scalar types, pointers, and higher-order recursive functions.

We demonstrate through standard benchmarks that a suite of common optimizations effectively eliminates the overhead typically associated with functional AD approaches, producing differentiated code that performs on par with leading mutation and taping techniques. At the same time, MimIrADe’s implementation is an order of magnitude less complex compared to its contenders.

Link to Publication

https://dl.acm.org/doi/10.1145/3708493.3712685

Marcel Ullrich

Saarland University

Sebastian Hack

Saarland University, Saarland Informatics Campus

Germany

Roland Leißa

University of Mannheim, School of Business Informatics and Mathematics