NeuralTrust | Platform for Agent Security.

TrustGate ships a single binary (trustgate) that boots one HTTP server, chosen by its first argument. In production each pod runs the same image with a different argument, so the planes scale independently.

./trustgate          # → proxy (default)
./trustgate admin    # → admin
./trustgate mcp      # → MCP server
./trustgate run      # → admin + proxy together (single-node)

The three planes

Plane	Port	Responsibility
Admin	`8080`	REST CRUD for gateways, registries, consumers, auth, policies, roles, and catalogs. Applies DB migrations on boot.
Proxy	`8081`	Request routing, auth validation, policy execution, load balancing, provider forwarding, streaming, telemetry.
MCP	`8082`	Model Context Protocol server: exposes registered MCP servers and tools to agents, with an OAuth2 authorization server.

The Admin plane is the control plane (you configure it); the Proxy and MCP planes are the data plane (your traffic flows through them).

Request lifecycle (proxy)

A client calls the proxy with a consumer API key (or OAuth2/OIDC token, or client cert).
TrustGate resolves the gateway (from X-AG-Gateway-Slug or the host), the consumer (from the URL slug), and the applicable policies.
The applicable policies run at their stages — rate limit, LLM budget, request size, semantic cache, guardrails — sequentially or in parallel.
The load balancer picks a healthy registry from the consumer’s pool (round-robin, weighted, least-connections, random, or semantic), with fallback.
The request is forwarded to the selected provider adapter (OpenAI, Anthropic, Bedrock, …), streaming when the client asked for it.
The response returns, the semantic cache is populated, and a telemetry event is exported over OTLP.

Client ─▶ Proxy :8081
   │  middleware: request-id · CORS · access-log · metrics · recover · security-headers
   │  auth: API-key hash compare · OAuth2 / OIDC JWT · mTLS
   ├─ resolve gateway (header/subdomain) → consumer (slug) → policies
   ├─ policies run at their stages (parallel or sequential)
   ├─ load balancer → registry (+ fallback chain)
   ├─ provider adapter → upstream (stream or full)
   └─ response ─▶ client          telemetry ─▶ OTLP

Gateway discovery

GATEWAY_DISCOVERY_MODE controls how the proxy finds the gateway:

header (default, self-managed) — reads the X-AG-Gateway-Slug header, falling back to a Host match against {slug}.<GATEWAY_BASE_DOMAIN>.
subdomain (cloud) — Host-only.

Infrastructure

Component	Role
PostgreSQL	Source of truth for all configuration (gateways, registries, consumers, …). The Admin plane runs migrations on boot.
Redis	Rate-limit counters, the semantic-cache vector store, session store, and a pub/sub channel for cache invalidation.

Telemetry is exported asynchronously over OTLP, off the request critical path, so a slow or unavailable collector never adds latency to a user request. See Telemetry.

Caching & invalidation

To avoid a database round-trip per request, the proxy keeps an in-process TTL cache (CACHE_LOCAL_TTL, default 5m) of resolved gateways, consumers, and auths. Admin mutations publish invalidation events over Redis pub/sub, and the proxy flushes the affected entries — so config changes propagate without a restart.

Endpoints the proxy serves

All proxy traffic is shaped as /{consumer_slug}/..., and the inbound format is detected from the path:

Path	Format
`POST /{consumer_slug}/v1/chat/completions`	OpenAI Chat Completions
`POST /{consumer_slug}/v1/messages`	Anthropic Messages
`POST /{consumer_slug}/v1/responses`	OpenAI Responses API
`POST /{consumer_slug}/v1beta/models/{model}:generateContent` (and `:streamGenerateContent`)	Google Gemini

The inbound format is chosen by the path, independent of the upstream provider — TrustGate adapts between formats, so an OpenAI-format client can be routed to an Anthropic or Gemini upstream. Any other path returns 404. Streaming ("stream": true, or the Gemini :streamGenerateContent path) is supported on all routes; the proxy flushes each SSE chunk and surfaces mid-stream upstream failures as an explicit error event rather than a silent truncation.

Repository layout

TrustGate follows a hexagonal layout — domain entities and ports in pkg/domain, use-cases in pkg/app, and adapters in pkg/infra (providers, policies, load balancer, database, telemetry). Configuration is environment-only; see Configuration.

Introduction

Getting started

Core concepts

Routing

Policies

MCP

Observability

Operate

Admin API

API reference

Architecture

The three planes

Request lifecycle (proxy)

Gateway discovery

Infrastructure

Caching & invalidation

Endpoints the proxy serves

Repository layout

​The three planes

​Request lifecycle (proxy)

​Gateway discovery

​Infrastructure

​Caching & invalidation

​Endpoints the proxy serves

​Repository layout

The three planes

Request lifecycle (proxy)

Gateway discovery

Infrastructure

Caching & invalidation

Endpoints the proxy serves

Repository layout