Question 1

What is an AI guardrail?

Accepted Answer

An AI guardrail is a runtime check that inspects an AI output before it reaches a user or triggers an action, and blocks or modifies outputs that fail a quality check. Guardrails differ from offline evaluation: they run inline at inference time and affect the live system.

Question 2

Why are generic guardrails not enough for production AI?

Accepted Answer

Generic guardrails check for broad categories: toxicity, PII, basic hallucination. Domain-specific failures - a wrong medication dosage, a stale FX rate, a fabricated legal citation - do not match any generic pattern. They require guardrails calibrated to the specific ways your AI fails in your specific domain.

Question 3

How does Composo implement runtime guardrails?

Accepted Answer

Composo's evaluation runs at the inference boundary with sub-second latency. The same reward model that catches failures offline runs as a runtime pass/fail gate. One customer blocks 50% of tool calls in real time using this approach.

Question 4

What is the latency impact of Composo's guardrails?

Accepted Answer

Composo's guardrail inference is optimised for the inline case. Typical latency is 200 to 600 milliseconds depending on the complexity of the evaluation criteria, model choice, and whether ensembling is enabled.

Question 5

Can Composo guardrails block specific tool calls in agent systems?

Accepted Answer

Yes. Composo integrates at the tool-call boundary in agent frameworks (LangGraph, custom agents) so individual tool invocations can be allowed, blocked, or rewritten based on domain-specific criteria.

Block bad AI outputs before they reach customers.

Generic guardrails catch what is generic. They miss what is specific to your AI.

Generic guardrails catch

Composo catches

A calibrated reward model, running inline.

Sub-second latency

Pass / fail or rewrite

Calibrated to you

"50% of tool calls get blocked."

Frequently asked questions

What is an AI guardrail?

Why are generic guardrails not enough for production AI?

How does Composo implement runtime guardrails?

What is the latency impact of Composo's guardrails?

Can Composo guardrails block specific tool calls in agent systems?

See what your AI is about to produce.