Open Source · Apache 2.0 · Self-Hosted

Govern AI before it governs your budget.

Name: gatez
Author: gatez

The self-hosted gateway that unifies API traffic, LLM calls, and autonomous agents in one control plane.
Three layers. One audit trail. Zero data leaving your network.

Deploy in 5 Minutes View on GitHub

terminal

$ docker compose up -d

[+] Running 8/8

$ curl localhost:9080/healthcheck

✓ API Gateway ready 50k TPS · P99 < 50ms

✓ AI Gateway ready 599 req/s cache-hit · Rust

✓ Agent Gateway ready MCP + A2A · Rust

Open Source | Built on Apache APISIX | L2 + L3 built in Rust | 13 LLM Providers | 3 layers, 1 docker compose

Your AI stack is three tools duct-taped together.

Platform engineers managing AI workloads are running separate tools for the edge, the LLM layer, and agents. Three auth systems. Three dashboards. Three 2am incidents.

Shadow AI burning budgets

Teams calling LLM APIs with no governance. A runaway prompt loop burns $50K over a weekend. Finance cannot forecast AI spend.

Three systems, zero unification

Kong for the edge. LiteLLM in the middle. A separate agent runtime. Three repos, three auth configs, three observability stacks.

Agents with zero governance

Autonomous agents calling tools with no allowlists, no session budgets, no human approval for high-risk operations. MCP + A2A are here, but who governs them?

There's a better way.

One gateway. Three layers. Shared everything.

Click a layer to explore. All three share tenant_id, Redis, ClickHouse, and OTel traces.

API Gateway — Apache APISIX

Battle-tested L7 proxy handling all inbound traffic. 50k TPS at P99 < 50ms on production hardware. Per-tenant rate limiting, JWT auth, and every request logged to ClickHouse.

●Per-tenant rate limiting — Redis-backed sliding window, bucket key: rl:{tenant_id}:{route}:{window}
●JWT auth + Keycloak — tenant_id extracted from JWT claims, validated before routing
●ClickHouse request logging — async via http-logger, Buffer engine for high-write throughput
●Key auth — service-to-service API key validation with tenant scoping
●Grafana dashboard — request rate, error rate, P99 latency per tenant

L1 performance (local Docker)

Throughput50k TPS

P99 Latency< 50ms

Plugin overhead< 1ms

Stack

APISIX Lua etcd Redis Keycloak

Agent Gateway — Custom Rust (axum + tokio)

Give autonomous agents MCP tool access with guardrails. A2A delegation with policy enforcement. HITL gates for high-risk operations. Session budgets. Tool allowlists.

●MCP protocol — Server registry, tool discovery, JSON-RPC forwarding, schema validation.
●A2A protocol — Agent registry, delegation policies, loop detection, chain depth limits.
●HITL gates — Configurable per-tool, per-tenant. Pending queue with approve/deny API.
●Tool allowlists — Deny-by-default. Per-session. Tenant-scoped. Tool poisoning protection.
●Session budgets — Per-session token limits. Budget check before every tool call.
●Audit trail — Every tool call, A2A hop, and session event logged to ClickHouse.

L3 capabilities

MCP tool forwarding Shipped

A2A delegation + policies Shipped

HITL approval gates Shipped

Tool poisoning detection Shipped

OpenAPI-to-MCP translation Shipped

Multi-tenant tool federation Shipped

Stack

Rust axum tokio Redis ClickHouse

Shared across all layers

tenant_id

Propagated end-to-end from JWT to ClickHouse

Redis

Rate limits, caches, sessions, budgets

ClickHouse

Request logs, AI usage, agent audit trail

OTel + Jaeger

Cross-layer distributed traces

Why developers choose gatez

Not marketing claims. Measured results from real deployments.

Rust Performance

L2 + L3 built with axum + tokio. 599 req/s on cache-hit path. 18ms P50. Zero-copy SSE streaming. No Python wrappers in the hot path.

Multi-Tenant by Default

tenant_id in every API call, every log entry, every cache key. Rate limit buckets, token budgets, tool allowlists — all tenant-isolated from day one. Not bolted on.

Cross-Layer Traces

One span tree from APISIX → AI Gateway → agent tool call. See an HTTP request become an LLM call become a tool invocation. OTel + Jaeger. No competitor has this.

Agent Governance

MCP + A2A protocol support. HITL gates for high-risk tool calls. Tool allowlists (deny-by-default). Session budgets. Tool poisoning detection. A2A loop detection.

Self-Hosted

Docker Compose to start. Kubernetes for production. Air-gap capable. Your data never leaves your infrastructure. HIPAA, GDPR, ISO 42001 ready.

Open Source

Apache 2.0. No "community edition" limitations. No license traps. No open-core bait-and-switch. Every feature ships in the open-source build.

How gatez compares

Not every tool does every job. Here is what each platform actually ships.

Feature	gatez	Kong	Portkey	agentgateway
API Gateway (L1)	✓	✓	✕	✕
AI Gateway (L2)	✓	Partial	✓	✕
Agent Gateway (L3)	✓	✕	✕	✓
All 3 layers unified	✓	✕	✕	✕
Cross-layer traces	✓	✕	✕	✕
Multi-tenant native	✓	Partial	✕	Partial
Self-hosted / air-gap	✓	✓	✕	✓
MCP + A2A protocols	✓	✕	✕	✓
HITL gates	✓	✕	✕	✕
Per-tenant token budgets	✓	Enterprise	✕	✕
Two-portal control plane	✓	Partial	✕	✕
API lifecycle governance	✓	Partial	✕	✕
Open source	✓	Partial	✕	✕

See full comparisons:

vs Kong | vs agentgateway

Simple, transparent pricing

Community

Free forever

L1+L2+L3 engine, Docker Compose, Prometheus, Grafana

Get Started

POPULAR

Pro

Everything in Community + Operator Portal, Developer Portal, SSO, audit export

Contact Sales

Enterprise

Everything in Pro + SCIM, Helm charts, SLA, dedicated support, compliance packs

Contact Sales

See full pricing details →

Running in 60 seconds

Clone, start, call. That request goes through rate limiting (L1), PII redaction (L2), and token budget enforcement (L2). All logged. All traced.

quick-start

# Clone and start the full stack

$ git clone https://github.com/gatez-dev/gatez.git

$ cd gatez && docker compose up -d

# Your first API call through all 3 layers

$ curl -H "Authorization: Bearer $TOKEN" \

-H "x-tenant-id: acme" \

http://localhost:9080/v1/chat/completions \

-d '{"model":"gpt-4o","messages":[{"role":"user","content":"Hello"}]}'

L1 rate-limited, JWT-validated, logged to ClickHouse

L2 PII-redacted, cache-checked, budget-enforced

L3 session-tracked, tool-allowlisted, audit-trailed

Solutions

AI Cost Control

Stop shadow AI from burning your budget. Token budgets, semantic caching, per-tenant cost analytics.

Learn more

Agent Governance

Give agents tools. Keep humans in control. HITL gates, tool allowlists, session budgets, A2A policies.

Learn more

Regulated AI

AI governance that satisfies your auditor. Full air-gap, PII redaction, complete audit trail, no data leaves your network.

Learn more

API Monetization

Meter, limit, and bill every API call per tenant. Self-service developer portal, API key workflows, usage export for billing.

Learn more

Open source. Built in public.

Apache 2.0 licensed. No open-core. No feature gates. Every feature ships in the open-source build. The code is public, so we have no incentive to lie about features.

Apache 2.0

License

3 Layers

All open source

Rust + Lua

Performance-first

Public

Roadmap on GitHub

Star on GitHub Join Discord Contribute

Need production support?

On-prem deployment assistance. Custom Keycloak/SSO integration. SLA guarantees. Architecture review. Dedicated support channel.

Contact for Enterprise Or just deploy it now

Govern AI before it governs your budget.

Your AI stack is three tools duct-taped together.

Shadow AI burning budgets

Three systems, zero unification

Agents with zero governance

One gateway. Three layers. Shared everything.

API Gateway — Apache APISIX

AI Gateway — Custom Rust (axum + tokio)

Agent Gateway — Custom Rust (axum + tokio)

Shared across all layers

Why developers choose gatez

Rust Performance

Multi-Tenant by Default

Cross-Layer Traces

Agent Governance

Self-Hosted

Open Source

How gatez compares

Simple, transparent pricing

Community

Pro

Enterprise

Running in 60 seconds

Solutions

AI Cost Control

Agent Governance

Regulated AI

API Monetization

Open source. Built in public.

Need production support?