Designing and implementing LLM guardrails components in production environments (CAIN 2025 - Research and Experience Papers)

Who

Mateus Devino, Evaline Ju, Paulo Marques Caldeira Junior

Track

CAIN 2025 Research and Experience Papers

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 27 Apr 2025 11:15 - 11:25 at 208 - Engineering AI systems with LLMs Chair(s): Justus Bogner

Abstract

With advancements in generative Artificial Intelligence (AI), there has been an increasing need for tools that rely on Large Language Models (LLMs). As these models may produce undesired answers, there is a need to prevent such events, especially in enterprise environments. Even if models are trained on safe data, user inputs and even model behavior can be unpredictable, leading to problems like leakage of confidential data that could result in revenue loss. In this paper, we describe our experiences on developing tools for “guardrailing” LLMs. We describe how we started with a quick monolith implementation, and later transitioned to a microservices architecture. As results, we share our lessons learned throughout the process, and how the re-architecture to microservices led to runtime performance gains, easier maintenance and extensibility, and also allowed us to open source the main component of the solution, so anyone can contribute and use it.

Mateus Devino

IBM

Brazil

Evaline Ju

IBM

Paulo Marques Caldeira Junior

IBM