New

Guardrails 2.0

Keep agents on-brand and compliant at scale

With pre-built protections and custom rules that run in real time with minimal added latency, Guardrails 2.0 makes it easy to control how agents behave in production.

Get started with Guardrails

Available for all ElevenAgents users

Minimal latency impact

AIUC-1 certified

Trusted by 7,500,000+ enterprises and creators

Enterprise-grade safety, ultra-low latency

As voice agents handle more complex, higher-impact workflows, a single unintended response can erode user trust, create compliance exposure, or trigger costly escalations. Guardrails 2.0 ensures that never happens.

Real-time response with minimal latency

Checks run in parallel with response generation

Validates every agent response against your policies

Two layers of protection

Proactive guidance

Guardrails harden the system prompt with additional instructions that keep agents anchored to their role, especially during longer or more complex calls where drift is most likely.

Real-time enforcement

Separate checks run on every response before it reaches the user. If a violation is detected, the response is blocked and the system takes the action you've defined: end, retry, or escalate.

Get started with Guardrails

You choose what happens next

When a custom Guardrail fires, you decide the outcome.

Custom
Guardrail
triggered

End the call

Conversation terminates immediately and the call is dropped

Deliver fallback

Agent replies with a preset message, retries, ends or continues

Escalate to human

Call is forwarded to a live human agent for manual handling

Every trigger is logged in your analytics dashboard with details on which Guardrail fired and why

Configure your Guardrails

Pre-built protections and custom control

Focus

Keep agents on-topic in long, complex interactions. Detects drift from the agent's intended topic, role, or tone and reinforces the original intent.

Content

Ensure appropriate responses. Prevents disallowed content in agent responses before delivery, including sexually explicit or violent material.

Manipulation

Protect against bad actors and prompt injection. Blocks attempts to bypass system instructions, including overrides and adversarial attacks.

Try Guardrails today

Retail

Output

Never mention competitors by name, make unauthorized price exceptions, or issue refunds without confirmed eligibility.

8 Active

Healthcare

Output

Never give medical advice, diagnoses, or treatment recommendations. Book appointments and escalate clinical questions only.

3 Active

Finance

Output

Never recommend specific investment products, provide tax guidance, or offer individualized financial planning advice.

18 Active

Each custom rule is evaluated by a lightweight LLM running in parallel.
Configure as many as you need, toggle on/off without deleting.

Try Guardrails today

Automatically redact sensitive information

Before redaction

After a call ends, automatically detect and redact selected sensitive information from your transcripts, recordings, and webhook payloads.

After redaction

Detected entities are replaced with typed placeholders in text and bleeped in audio, keeping conversation context for review and future analysis.

Try Guardrails today

Part of a broader safety foundation

Guardrails 2.0 supports enterprise deployments of ElevenAgents, alongside pre-production safeguards, post-deployment monitoring, and access to the industry's first insurance policies, backed by AIUC-1 certification.

Get started with Guardrails

Commonly asked questions

Do Guardrails add latency?

Minimal. Guardrails run concurrently with response generation and typically complete before the full response is ready to deliver. In most cases, users notice no delay at all.

Are Guardrails on by default?

Pre-built Guardrails are currently off by default, so there’s no impact to existing agents when you upgrade. We recommend enabling them (especially Focus) for any production deployment. Soon, Guardrails will be on by default for new agents. You can toggle any individual Guardrail on or off at any time from your agent’s Security tab.

Do they cost money?

Pre-built Guardrails (Focus, Content, Manipulation) are included at no additional cost. Custom Guardrails are usage-based and costs are passed through like other LLM costs. You can also choose which model evaluates your rules.

What’s the difference between a Custom Guardrail and a system prompt rule?

Your system prompt guides your agent’s behavior by telling it what to do and how to respond. A Custom Guardrail independently evaluates every agent response against your rule after the model generates it, and blocks violations before they reach the user. Think of your system prompt as instructions and Guardrails as enforcement.

For your most critical policies, we recommend using both: the system prompt and Focus Guardrail shape behavior, and the Custom Guardrail catches anything that slips through, especially in long conversations where models are more likely to drift.

What happens to the user when a Guardrail triggers?

It depends on the Guardrail type and how you’ve configured it. For Custom Guardrails, you choose the exit strategy: end the conversation, transfer to another agent, or escalate to a human. For pre-built Guardrails (Focus, Content, Manipulation), the conversation currently terminates when triggered. Configurable exit strategies for these are coming soon. In all cases, users can start a new conversation immediately. The Guardrail blocks a specific response, not the user.

How do I know which Guardrail triggered?

Every trigger is logged in your conversation analytics. You’ll see which Guardrail fired, why, and the conversation context. Use this to review false positives and refine your rules over time.

Can I use Guardrails in regulated industries like healthcare or finance?

Yes. Guardrails are purpose-built for high-stakes deployments. Custom Guardrails let you define domain-specific policies in natural language, such as “do not provide medical diagnoses” or “do not recommend specific investments.” These rules are enforced independently across every conversation, helping reduce compliance exposure without requiring custom infrastructure.

Guardrails 2.0 also supports AIUC-1 compliance alignment and access to the industry’s first AI insurance policies, making it easier to get security and legal teams more comfortable with production deployments. While Guardrails significantly reduce risk, they work best as part of a broader compliance strategy rather than a standalone solution.

Does sensitive data get stored in conversation logs?

By default, conversation logs include transcript and audio data to support analytics, QA, and agent improvement. If you need to limit data exposure, Conversation History Redaction automatically removes selected sensitive information (such as names, payment card numbers, or other PII) from transcripts, recordings, and webhook payloads before they’re stored. Text is replaced with typed placeholders and audio is bleeped. You control exactly which entity types get redacted. Conversation History Redaction is available to enterprise clients.

Should I use Guardrails AND system prompt hardening?

Yes, and this is the recommended setup. System prompt hardening guides the agent toward the right responses. Guardrails independently enforce your rules as a safety net, so even if the model drifts in a long conversation, violations are caught before delivery. Together they create defense in depth.

Can I turn them off?

Yes. Each Guardrail can be toggled on or off individually. For most production deployments, we recommend keeping all Guardrails enabled (especially Focus). In some cases, a specific Guardrail may conflict with your agent’s intended use case. When in doubt, test before disabling.

Try Guardrails today

Ready to deploy safer, more reliable agents?

Try Guardrails today in minutes and gain enterprise-grade control over your AI agents.

Get started with Guardrails