TrustGate

module

v0.2.5 Latest Latest Go to latest Published: Jun 26, 2026 License: Apache-2.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/NeuralTrust/TrustGate

Links

Open Source Insights

README ¶

TrustGate

The high-performance data-plane gateway for LLM and agent traffic — built from scratch in Go

Documentation | Quick Start | Architecture | Community

✨ Features

🚀 High Performance — Built in Go on top of Fiber, tuned for low latency and high concurrency.
🌍 Multi-Provider — First-class adapters for OpenAI, Anthropic, Azure OpenAI, AWS Bedrock, Google Gemini, Vertex AI, Groq, Mistral and DeepSeek.
🧭 Smart Routing & Load Balancing — Round-robin, weighted round-robin and IP-hash strategies with health checks and fallback targets.
🔌 Plugin System — Policy stages with built-in plugins: rate limiting, token rate limiting, request size guard, semantic cache and CORS.
🧠 Semantic Cache — Embedding-based response caching to cut cost and latency on repeated prompts.
🔒 Security & Multi-Tenancy — Per-gateway consumers, API-key auth, and policies scoped globally or per consumer.
📊 Observability — Built-in metrics, request telemetry streamed to Kafka by default, with an opt-in per-gateway OpenTelemetry (OTLP) exporter.
⚙️ Two Independent Planes — Admin and Proxy run as separate processes so you can scale them independently.
☁️ Cloud Agnostic — Single static binary, Docker image and Kubernetes manifests. Deploy anywhere.

🚀 Quick Start

One-line install

Clones the repo, seeds .env, brings up the full stack in Docker and — when Go is installed — compiles the trustgate binary and installs it on your PATH:

curl -fsSL https://raw.githubusercontent.com/NeuralTrust/TrustGate/main/scripts/install.sh | bash

Requires git, docker and Docker Compose (plus Go to build the CLI). Re-running updates the checkout and never overwrites your .env. Useful overrides:

AG_REF=develop — install a different branch/tag/commit
AG_DIR=/path/to/dir — where to clone
AG_BIN_DIR=~/.local/bin — where to install the trustgate CLI
AG_INSTALL_CLI=0 — skip building the CLI · AG_NO_START=1 — skip docker compose up

Using Docker Compose

# Clone the repository
git clone https://github.com/NeuralTrust/TrustGate.git
cd TrustGate

# Copy the env template and adjust as needed
cp .env.example .env

# One command to bring up everything (Postgres, Redis, Kafka, Zookeeper) + admin, proxy & mcp
make up

# Tail the logs / tear everything down
make logs
make down

Then hit the health probes:

curl localhost:8080/healthz       # admin
curl localhost:8081/healthz       # proxy
curl localhost:8082/healthz       # mcp
curl localhost:8080/__/version    # build info (version, commit, build date)

The image is pinned to linux/amd64 because confluent-kafka-go only bundles an amd64 librdkafka; on Apple Silicon the build runs under emulation out of the box.

Local Development

Run the infra in Docker and the binary on your machine so you can attach a debugger:

# 1. Boot the local dev infra (Postgres, Redis, Kafka, Zookeeper)
make compose-up

# 2a. Run admin + proxy together in a single process (simplest, single-node)
make run-all        # applies migrations, starts admin on :8080 and proxy on :8081

# 2b. ...or run each plane in its own terminal (closer to production)
make run-admin      # terminal 1 — applies migrations, starts admin on :8080
make run-proxy      # terminal 2 — applies migrations, starts proxy on :8081
make run-mcp        # terminal 3 — (optional) starts the MCP server on :8082

# 3. Stop the infra (add -v to wipe volumes)
make compose-down

Using Kubernetes

Manifests live under k8s/.

kubectl apply -k k8s/

Run Tests

make test            # unit tests
make test-race       # unit tests with the race detector
make test-cover      # unit tests with coverage profile
make test-functional # functional tests against a real admin server

🧪 Your first request

The Admin plane (:8080) configures gateways, providers and consumers; the Proxy plane (:8081) serves OpenAI-compatible traffic. The proxy resolves the gateway from the X-AG-Gateway-Slug header and the consumer from its X-AG-API-Key. End-to-end, from zero to a forwarded completion:

make up   # admin :8080, proxy :8081 + Postgres/Redis/Kafka

ADMIN="http://localhost:8080"
PROXY="http://localhost:8081"
TOKEN="$ADMIN_TOKEN"   # admin JWT, see "Admin token" below

# 1. Create a gateway (slug becomes its host/subdomain)
GW=$(curl -s -X POST "$ADMIN/v1/gateways" \
  -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \
  -d '{"name":"My Gateway","slug":"demo"}')
GW_ID=$(echo "$GW" | jq -r .id); GW_SLUG=$(echo "$GW" | jq -r .slug)

# 2. Register an upstream LLM provider (OpenAI here)
REG=$(curl -s -X POST "$ADMIN/v1/gateways/$GW_ID/registries" \
  -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \
  -d '{"name":"openai-primary","provider":"openai",
       "auth":{"type":"api_key","api_key":{"api_key":"'"$OPENAI_API_KEY"'"}}}')
REG_ID=$(echo "$REG" | jq -r .id)

# 3. Create a consumer bound to that registry
CON=$(curl -s -X POST "$ADMIN/v1/gateways/$GW_ID/consumers" \
  -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \
  -d '{"name":"my-app","registries":[{"id":"'"$REG_ID"'"}]}')
CON_ID=$(echo "$CON" | jq -r .id); CON_SLUG=$(echo "$CON" | jq -r .slug)

# 4. Mint a consumer API key (returned in cleartext once)
AUTH=$(curl -s -X POST "$ADMIN/v1/gateways/$GW_ID/auths" \
  -H "Authorization: Bearer $TOKEN" -H "Content-Type: application/json" \
  -d '{"name":"my-app-key","type":"api_key"}')
AUTH_ID=$(echo "$AUTH" | jq -r .id); API_KEY=$(echo "$AUTH" | jq -r .api_key)

# 5. Attach the key to the consumer
curl -s -X POST "$ADMIN/v1/gateways/$GW_ID/consumers/$CON_ID/auths/$AUTH_ID" \
  -H "Authorization: Bearer $TOKEN"

# 6. Call the proxy (OpenAI-compatible)
curl -s -X POST "$PROXY/$CON_SLUG/v1/chat/completions" \
  -H "X-AG-Gateway-Slug: $GW_SLUG" -H "X-AG-API-Key: $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"gpt-4o-mini","messages":[{"role":"user","content":"Hello!"}]}'

From an application, point any OpenAI SDK at the proxy — no client changes beyond the base URL and two headers:

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8081/my-app",  # /{consumer_slug}
    api_key="unused",                          # the provider key lives in the gateway
    default_headers={
        "X-AG-Gateway-Slug": "demo",
        "X-AG-API-Key": "<consumer api key>",
    },
)

resp = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(resp.choices[0].message.content)

Other entrypoints follow the same /{consumer_slug}/... shape: /v1/messages (Anthropic format) and /v1/responses (OpenAI Responses format).

Admin token

The Admin API expects a JWT (HS256) signed with SERVER_SECRET_KEY from your .env. Mint a short-lived one for local use:

export SERVER_SECRET_KEY="$(grep ^SERVER_SECRET_KEY .env | cut -d= -f2-)"
export ADMIN_TOKEN=$(python3 - <<'PY'
import jwt, os, time
secret = os.environ["SERVER_SECRET_KEY"]
print(jwt.encode({"sub": "admin", "iat": int(time.time()), "exp": int(time.time()) + 3600}, secret, algorithm="HS256"))
PY
)

🏗️ Architecture

TrustGate ships a single binary that boots one HTTP server, selected by argv[1] (default: proxy). In production each pod runs one container with the appropriate argument, so the Admin, Proxy and MCP planes scale independently.

./trustgate              # → proxy (default)
./trustgate proxy        # → proxy
./trustgate admin        # → admin
./trustgate mcp          # → mcp (Model Context Protocol server)
./trustgate run          # → admin + proxy together in one process (single-node)

flowchart LR
    subgraph Clients["Clients & Agents"]
        APP["Apps / SDKs / Agents"]
    end

    subgraph AG["TrustGate"]
        direction TB
        ADMIN["Admin Plane :8080\nGateways · Registries · Consumers\nAuth · Policies · Catalog"]
        PROXY["Proxy Plane :8081\nRouting · Load Balancing\nPolicy Stages · Plugins"]
        MCP["MCP Plane :8082\nMCP targets & tools for agents"]
    end

    subgraph Plugins["Policy Plugins"]
        RL["Rate Limit"]
        TRL["Token Rate Limit"]
        RS["Request Size"]
        SC["Semantic Cache"]
        CORS["CORS"]
    end

    subgraph Providers["LLM Providers"]
        P1["OpenAI · Anthropic\nAzure · Bedrock"]
        P2["Gemini · Vertex\nGroq · Mistral"]
    end

    subgraph Infra["Infrastructure"]
        PG[("Postgres")]
        RD[("Redis")]
        KFK[["Kafka"]]
    end

    APP -->|API key| PROXY
    APP -->|MCP| MCP
    PROXY --> Plugins
    PROXY -->|load balance| Providers
    ADMIN -. config .-> PROXY
    ADMIN -. config .-> MCP
    ADMIN --- PG
    PROXY --- PG
    PROXY --- RD
    MCP --- PG
    PROXY -->|telemetry| KFK

Request lifecycle

A client calls the Proxy with a consumer API key.
The gateway resolves the consumer, gateway config and applicable policies.
Policy stages run their plugins (rate limit, token rate limit, request size, semantic cache, CORS).
The load balancer picks a healthy upstream target (round-robin / weighted / IP-hash) with fallback.
The request is forwarded to the selected provider adapter (OpenAI, Anthropic, Bedrock, …), streaming when supported.
The response is returned, the semantic cache is populated, and telemetry is emitted to Kafka.

Planes

Plane	Port	Responsibilities
Admin	`8080`	Gateway, registry, consumer, auth, policy and catalog management. Applies DB migrations.
Proxy	`8081`	Request routing, load balancing, policy & plugin execution, provider forwarding, telemetry.
MCP	`8082`	Model Context Protocol server: exposes registered MCP targets and tools to agents.

🔌 Plugins

Plugins run inside ordered policy stages and can execute sequentially or in parallel.

Plugin	Description
`ratelimit`	Per-consumer / per-gateway request rate limiting.
`tokenratelimit`	Token-based rate limiting for LLM cost control.
`requestsize`	Rejects requests above a configured body size.
`semanticcache`	Embedding-based response caching for repeated prompts.
`cors`	Cross-origin resource sharing for browser clients.

🌍 Providers

Provider	Provider	Provider	Provider
OpenAI	Anthropic	Azure OpenAI	AWS Bedrock
Google Gemini	Vertex AI	Groq	Mistral
DeepSeek

⚙️ Configuration

All configuration is read from environment variables. In development, copy .env.example to .env and godotenv loads it automatically. Production deployments inject env vars directly (Helm values, ECS task definitions, k8s ConfigMap + Secret).

# Server (HTTP listeners)
SERVER_ADMIN_PORT=8080
SERVER_PROXY_PORT=8081
SERVER_MCP_PORT=8082

# Host suffixes returned in the gateway response ({slug}.<base-domain>)
GATEWAY_BASE_DOMAIN=llm.neuraltrust.ai  # proxy plane host
MCP_BASE_DOMAIN=mcp.neuraltrust.ai      # mcp plane host

# Database (Postgres via pgx/pgxpool)
DB_HOST=localhost
DB_PORT=5432
DB_NAME=trustgate

# Redis & Kafka
REDIS_HOST=localhost
KAFKA_BROKERS=localhost:9092

# Telemetry & Metrics
TELEMETRY_ENABLED=true
TELEMETRY_KAFKA_TOPIC=trustgate.requests
METRICS_ENABLED=true

# OTLP exporter defaults (opt-in per gateway; per-gateway settings override these)
OTEL_EXPORTER_OTLP_ENDPOINT=collector:4317
OTEL_EXPORTER_OTLP_PROTOCOL=grpc

See .env.example for the full set with safe defaults.

Telemetry exporters (Kafka default + OTLP opt-in)

Every completed request is turned into a sanitized business event and fanned out to one or more exporters. Kafka is the config-driven default (feeds kafka-connect → ClickHouse → data-plane-api). A gateway can additionally opt into the otlp exporter, which ships the same event to an external OpenTelemetry Collector as a single OTLP log record per request (event.name="gateway.request"); the Collector fans out to any vendor backend. Enabling otlp does not replace Kafka — both fire (explicit exporters merge with the global defaults).

Add it to a gateway's telemetry.exporters:

{
  "telemetry": {
    "exporters": [
      {
        "name": "otlp",
        "settings": {
          "endpoint": "collector:4317",
          "protocol": "grpc",
          "headers": { "authorization": "Bearer <token>" },
          "compression": "gzip",
          "timeout": "10s",
          "insecure": false
        }
      }
    ]
  }
}

With Kafka still configured, the event is delivered to the Collector and Kafka (the ClickHouse path keeps populating with schema_version=2 intact).

otlp settings

Key	Type	Default	Notes
`endpoint`	string	— (required unless env fallback)	`host:port` or full URL of the Collector
`protocol`	string	`grpc`	`grpc` (`:4317`) or `http/protobuf` (`:4318`)
`signal`	string	`logs`	`logs`; `traces` is reserved and rejected
`headers`	map	`{}`	auth/tenant headers
`insecure`	bool	`false`	plaintext, no TLS (cannot combine with `tls`)
`tls`	object	—	`{ ca_file, cert_file, key_file, skip_verify }`
`timeout`	duration	`10s`	export + graceful-shutdown bound; must be `> 0`
`compression`	string	`gzip`	`gzip` or `none`
`max_body_bytes`	int	`4096`	request/response body truncation cap

Any key absent from a gateway's settings falls back to the process-level OTEL_EXPORTER_OTLP_ENDPOINT, OTEL_EXPORTER_OTLP_PROTOCOL, OTEL_EXPORTER_OTLP_HEADERS, OTEL_EXPORTER_OTLP_TIMEOUT, OTEL_EXPORTER_OTLP_INSECURE, and OTEL_EXPORTER_OTLP_COMPRESSION env vars (per-gateway settings win). OTEL_EXPORTER_OTLP_TIMEOUT accepts an integer of milliseconds per the OpenTelemetry spec (e.g. 10000); a Go duration string (e.g. 10s) is also accepted. Settings are validated structurally on gateway create/update with no network I/O; export is non-blocking (bounded queue, drop-on-full) so a slow or unreachable Collector never affects request latency or the Kafka path.

Migrations

Migrations are in-code Go files under pkg/infra/database/migrations/. Each file is named <unix_timestamp>_<snake_name>.go and registers itself via database.RegisterMigration in its init(). The pgx-backed runner commits each migration's DDL plus its migration_version row in a single transaction, applying any pending migrations automatically on boot.

📚 API Docs

The Admin API is fully annotated and ships Swagger 2.0 and OpenAPI 3 specs:

make swagger   # generate docs/swagger.{json,yaml} + docs.go
make openapi   # convert to docs/openapi.json (OpenAPI 3)
make docs      # regenerate everything

Specs live under docs/ (swagger.json, swagger.yaml, openapi.json).

🗂️ Repository layout

cmd/trustgate/         # entry point (single binary: proxy | admin | mcp | run)
pkg/version/           # ldflag-fed build info
pkg/config/            # env-only config loader (.env via godotenv in dev)
pkg/domain/            # domain entities, value objects and port interfaces
pkg/app/               # application services (use cases)
pkg/infra/providers/   # provider adapters (openai, anthropic, bedrock, …)
pkg/infra/plugins/     # policy plugins (ratelimit, semanticcache, …)
pkg/infra/loadbalancer/# routing strategies + health checks
pkg/infra/database/    # pgxpool + in-code Go migrations registry
pkg/infra/telemetry/   # Kafka (default) + OTLP exporters
pkg/api/handler/http/  # per-route HTTP handlers
pkg/server/            # Server interface + admin / proxy routers
pkg/container/         # dig DI container + one module per context

🤝 Contributing

We love contributions! To get started:

Fork the repository
Create your feature branch (git checkout -b feat/my-feature)
Run make lint && make test before committing
Push to your branch and open a Pull Request

📜 License

TrustGate is licensed under the Apache License 2.0 — see the LICENSE file for details.

📫 Community & Support

Made with ❤️ by NeuralTrust

Directories ¶

Path	Synopsis
cmd
trustgate command Command trustgate starts a server selected by argv[1] (default proxy): "admin", "proxy", "mcp", or "run" (admin + proxy together in one process).	Command trustgate starts a server selected by argv[1] (default proxy): "admin", "proxy", "mcp", or "run" (admin + proxy together in one process).
docs Package docs Code generated by swaggo/swag.	Package docs Code generated by swaggo/swag.
pkg
api/handler/http
api/handler/http/auth
api/handler/http/auth/request
api/handler/http/auth/response
api/handler/http/catalog
api/handler/http/catalog/response
api/handler/http/consumer
api/handler/http/consumer/request
api/handler/http/consumer/response
api/handler/http/gateway
api/handler/http/gateway/request
api/handler/http/gateway/response
api/handler/http/helpers
api/handler/http/mcp
api/handler/http/oauth
api/handler/http/playground
api/handler/http/policy
api/handler/http/policy/request
api/handler/http/policy/response
api/handler/http/proxy
api/handler/http/registry
api/handler/http/registry/request
api/handler/http/registry/response
api/handler/http/role
api/handler/http/role/request
api/handler/http/role/response
api/middleware
api/resolver
app/auth
app/auth/mocks
app/catalog
app/catalog/mocks
app/consumer
app/consumer/mocks
app/gateway
app/gateway/mocks
app/identity/sts
app/identity/sts/mocks
app/mcp
app/mcp/mocks
app/metrics
app/metrics/mocks
app/oauth
app/oauth/mocks
app/plugins
app/plugins/mocks
app/policy
app/policy/mocks
app/proxy
app/proxy/mocks
app/registry
app/registry/mocks
app/role
app/role/mocks
app/routing
app/routing/mocks
app/session
app/session/mocks
common/errors
common/secret
common/strutil Package strutil holds small, dependency-free string helpers.	Package strutil holds small, dependency-free string helpers.
config Package config loads configuration from environment variables.	Package config loads configuration from environment variables.
container Package container wraps uber/dig with per-context modules.	Package container wraps uber/dig with per-context modules.
container/modules
domain/auth
domain/auth/mocks
domain/catalog
domain/consumer
domain/consumer/mocks
domain/embedding
domain/embedding/mocks
domain/gateway
domain/gateway/mocks
domain/identity
domain/ids
domain/policy
domain/policy/mocks
domain/registry
domain/registry/mocks
domain/role
domain/role/mocks
domain/routing
domain/session
domain/session/mocks
domain/telemetry
domain/vault
domain/vault/mocks
infra/auth/introspection
infra/auth/jwt
infra/auth/jwt/mocks
infra/auth/mtls
infra/auth/oidc
infra/auth/session
infra/bedrock
infra/bedrock/mocks
infra/cache
infra/cache/cachetest Package cachetest provides cache test doubles shared across app-layer tests.	Package cachetest provides cache test doubles shared across app-layer tests.
infra/cache/channel
infra/cache/event
infra/cache/mocks
infra/cache/semantic Package semantic provides a Redis Stack (RediSearch) vector store used by the semantic cache plugin to look up and persist responses by embedding similarity, scoped per rule via a hashed tag.	Package semantic provides a Redis Stack (RediSearch) vector store used by the semantic cache plugin to look up and persist responses by embedding similarity, scoped per rule via a hashed tag.
infra/cache/semantic/mocks
infra/cache/subscriber
infra/catalog/modelsdev
infra/context
infra/crypto
infra/database
infra/database/migrations Package migrations registers in-code schema migrations via init().	Package migrations registers in-code schema migrations via init().
infra/embedding/factory
infra/embedding/factory/mocks
infra/embedding/openai
infra/identity/sts
infra/loadbalancer
infra/loadbalancer/algorithm
infra/loadbalancer/strategies
infra/logger
infra/mcp/client
infra/metrics
infra/metrics/events
infra/metrics/playground Package playground stores the metrics Event of playground proxy requests in Redis so the dashboard can fetch the trace by TraceID (the X-AG-Trace-Id echoed in the proxy response).	Package playground stores the metrics Event of playground proxy requests in Redis so the dashboard can fetch the trace by TraceID (the X-AG-Trace-Id echoed in the proxy response).
infra/oauth
infra/plugins/azurecontentsafety
infra/plugins/bedrockguardrail
infra/plugins/cors Package cors implements a PreRequest CORS plugin that validates the request origin, applies CORS response headers, and short-circuits preflight requests.	Package cors implements a PreRequest CORS plugin that validates the request origin, applies CORS response headers, and short-circuits preflight requests.
infra/plugins/costcap
infra/plugins/llmcost Package llmcost holds the model-pricing primitives shared by the LLM budget and cost cap plugins: glob matching of model slugs, price resolution, model downgrade, and the stateless cost cap decision engine.	Package llmcost holds the model-pricing primitives shared by the LLM budget and cost cap plugins: glob matching of model slugs, price resolution, model downgrade, and the stateless cost cap decision engine.
infra/plugins/modelallowlist
infra/plugins/openaimoderation
infra/plugins/pertoolratelimit
infra/plugins/pluginutil
infra/plugins/prompttemplate
infra/plugins/ratelimit
infra/plugins/requestsize
infra/plugins/semanticcache Package semanticcache implements a semantic response cache: on PreRequest it serves a cached response when a semantically similar request is found, and on PostResponse it stores successful upstream responses for future hits.	Package semanticcache implements a semantic response cache: on PreRequest it serves a cached response when a semantically similar request is found, and on PostResponse it stores successful upstream responses for future hits.
infra/plugins/tokenratelimit
infra/plugins/tool_call_validation
infra/plugins/toolallowlist
infra/plugins/tooltransform
infra/plugins/trustguard
infra/providers
infra/providers/adapter
infra/providers/anthropic
infra/providers/azure
infra/providers/bedrock
infra/providers/deepseek
infra/providers/factory
infra/providers/factory/mocks
infra/providers/google
infra/providers/groq
infra/providers/mistral
infra/providers/mocks
infra/providers/openai
infra/providers/openaicompat Package openaicompat implements a provider client for arbitrary OpenAI-compatible Chat Completions endpoints (Together, Fireworks, vLLM, Ollama, self-hosted gateways, ...).	Package openaicompat implements a provider client for arbitrary OpenAI-compatible Chat Completions endpoints (Together, Fireworks, vLLM, Ollama, self-hosted gateways, ...).
infra/providers/vertex
infra/repository/auth
infra/repository/catalog
infra/repository/consumer
infra/repository/gateway
infra/repository/policy
infra/repository/registry
infra/repository/role
infra/repository/session
infra/repository/vault
infra/telemetry
infra/telemetry/kafka
infra/telemetry/otlp
infra/trace
server Package server hosts the shared HTTP server lifecycle for admin and proxy.	Package server hosts the shared HTTP server lifecycle for admin and proxy.
server/router Package router declares the ServerRouter contract wired into BaseServer.	Package router declares the ServerRouter contract wired into BaseServer.
version Package version exposes build metadata injected via ldflags.	Package version exposes build metadata injected via ldflags.
seed
mcp-catalog Package mcpcatalog embeds the curated catalog of enterprise remote MCP servers so it ships inside the binary as the single source of truth for the MCP servers catalog endpoint.	Package mcpcatalog embeds the curated catalog of enterprise remote MCP servers so it ships inside the binary as the single source of truth for the MCP servers catalog endpoint.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL