AgentGuard

module

v0.5.1 Latest Latest Go to latest Published: May 7, 2026 License: Apache-2.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/Caua-ferraz/AgentGuard

Links

Open Source Insights

README ¶

The firewall for AI agents.
Every tool call, every API call — gated by policy, logged, and routed for human approval. No opt-out path.

Quickstart • Why AgentGuard • Architecture • Limitations & Threat Model • Production • Docs • Contributing

AgentGuard Cloud (preview)

AgentGuard Cloud is the hosted, multi-tenant version — same policy engine, same audit log, run for you. Currently in design. Join the waitlist at https://agentguard.lictorate.com. The self-hosted Apache-2.0 build in this repo will always remain fully featured.

The Problem

Every trending AI project is giving agents more autonomy — running shell commands, browsing the web, calling APIs, moving money, even performing penetration tests. But nobody is building the guardrails.

Right now, most teams deploying AI agents are just... hoping they behave. AgentGuard fixes that.

Why AgentGuard

AgentGuard is the wire-level checkpoint that sits between your agent and everything it touches:

Policy-gated tool calls. Every shell command, file write, network call, browser action, or model spend evaluated against a YAML policy before it runs.
Human-in-the-loop approvals. Risky actions pause, ping Slack/webhooks, surface on a live dashboard, and resume only after a human says yes.
Tamper-evident audit trail. JSON-Lines log of every decision with agent ID, scope, command, timestamp, and reasoning — queryable by CLI, dashboard, or Prometheus metrics.
Per-agent, per-environment, per-tool scoping. One policy file, finely overridable for each agent identity.

Quickstart

AgentGuard ships three integration paths, listed from "no code change" to "deepest control":

1. MCP Gateway

For Claude Desktop and any MCP-aware client (Cursor, Cline, Continue, Zed), point your config at agentguard-mcp-gateway and every tools/call from the model is policy-checked before reaching the real MCP server:

go install github.com/Caua-ferraz/AgentGuard/cmd/agentguard-mcp-gateway@latest

Then add to claude_desktop_config.json (macOS path: ~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "agentguard": {
      "command": "agentguard-mcp-gateway",
      "args": [
        "--upstream", "fs:npx -y @modelcontextprotocol/server-filesystem /tmp",
        "--guard-url", "http://127.0.0.1:8080",
        "--api-key", "$AGENTGUARD_API_KEY",
        "--policy", "/etc/agentguard/policy.yaml",
        "--policy-mode", "strict",
        "--fail-mode", "deny"
      ],
      "env": { "AGENTGUARD_API_KEY": "<your-key-or-source-from-shell>" }
    }
  }
}

90-second walkthrough: docs/QUICKSTART_MCP.md. Wire-format design + client-integration gotchas: docs/MCP_GATEWAY.md. Ready configs for Cursor, Cline, Continue, Zed: examples/.

2. LLM API Proxy

For any code that already uses the OpenAI / Anthropic SDKs, set one environment variable and your existing client flows through AgentGuard:

go install github.com/Caua-ferraz/AgentGuard/cmd/agentguard-llm-proxy@latest

agentguard-llm-proxy \
    --listen 127.0.0.1:8081 \
    --policy configs/default.yaml \
    --guard-url http://127.0.0.1:8080 \
    --api-key "$AGENTGUARD_API_KEY" &

export OPENAI_BASE_URL=http://127.0.0.1:8081/v1
# Anthropic SDK: ANTHROPIC_BASE_URL=http://127.0.0.1:8081 (no /v1 suffix)

Tool calls inside the response stream are intercepted, gated against your policy, and either flushed to your code byte-identically (ALLOW), rewritten as a synthetic refusal (DENY), or surfaced for human approval (REQUIRE_APPROVAL). The OpenAI / Anthropic SDKs do not need to know the proxy exists.

90-second walkthrough: docs/QUICKSTART_LLM_PROXY.md. Wire-format design + client-integration gotchas: docs/LLM_API_PROXY.md. Ready scripts for the OpenAI SDK, Anthropic SDK, LangChain, and CrewAI: examples/.

3. SDK (compatibility tier)

The Python and TypeScript SDKs remain fully supported for direct callers and for code paths where the proxy isn't practical (offline tools, embedded scripts, custom transports). They opt in via an explicit Guard.check(...) call:

pip install agentguardproxy

from agentguard import Guard

guard = Guard("http://localhost:8080", agent_id="my-bot")

result = guard.check("shell", command="rm -rf ./old_data")
# result.decision = "REQUIRE_APPROVAL"
# result.approval_url = "http://localhost:8080/v1/approve/ap_..."

if result.allowed:
    execute(command)

TypeScript/Node.js:

import { AgentGuard } from '@agentguard/sdk';

const guard = new AgentGuard({ baseUrl: 'http://localhost:8080', agentId: 'my-bot' });
const result = await guard.check('network', { url: 'https://api.production.internal/deploy' });

The SDKs are not deprecated. They are the right answer when you control the agent's source and want explicit, scope-tagged check points. Polling for approval, decorators/HOFs, cost guardrails, framework adapters (LangChain, CrewAI, browser-use, MCP): docs/SDK_PYTHON.md • docs/ADAPTERS.md.

Install the server

# From source
git clone https://github.com/Caua-ferraz/AgentGuard.git
cd AgentGuard && go build -o agentguard ./cmd/agentguard

# Or via Go install
go install github.com/Caua-ferraz/AgentGuard/cmd/agentguard@latest

# Or Docker
docker run -d -p 8080:8080 \
  -v agentguard-audit:/var/lib/agentguard \
  agentguard:latest

Prerequisites: Go 1.22+, Python 3.9+ (optional, for the SDK; 3.8 is unsupported — upstream EOL October 2024). See docs/SETUP.md for details.

Minimal policy

configs/default.yaml — a ready-to-use default ships in the repo. A minimal example:

version: "1"
name: "development-sandbox"
rules:
  - scope: shell
    require_approval:
      - pattern: "sudo *"
      - pattern: "rm -rf *"
    allow:
      - pattern: "ls *"
      - pattern: "cat *"
  - scope: network
    allow:
      - domain: "api.openai.com"
      - domain: "api.anthropic.com"

Full schema (filesystem, cost, per-agent overrides, rate limits, conditional rules, notifications): docs/POLICY_REFERENCE.md.

Start the server

agentguard serve --policy configs/default.yaml --dashboard --watch

CLI flags and subcommands: docs/CLI.md.

Architecture

┌─────────────────┐     ┌──────────────────────────┐     ┌─────────────┐
│   AI Agent      │────▶│   AgentGuard Proxy        │────▶│  Target     │
│  (any framework)│◀────│                            │◀────│  (tools,    │
│                 │     │  ┌──────────────────────┐  │     │   APIs,     │
│  • LangChain    │     │  │  Policy Engine       │  │     │   shell)    │
│  • CrewAI       │     │  ├──────────────────────┤  │     └─────────────┘
│  • browser-use  │     │  │  Rate Limiter        │  │
│  • Claude (MCP) │     │  ├──────────────────────┤  │     ┌─────────────┐
│  • Custom       │     │  │  Approval Queue      │  │────▶│  Dashboard  │
│                 │     │  ├──────────────────────┤  │     │  (web UI)   │
│                 │     │  │  Notifier (Slack/WH) │  │     └─────────────┘
│                 │     │  ├──────────────────────┤  │
│                 │     │  │  Audit Logger         │  │     ┌─────────────┐
│                 │     │  └──────────────────────┘  │────▶│  Audit Log  │
└─────────────────┘     └──────────────────────────┘     │  (JSON)     │
                                                          └─────────────┘

Rule precedence: deny → require_approval → allow → default deny. The seven policy scopes are shell, filesystem, network, browser, cost, data, and mcp_tool (plus the unmapped sentinel emitted by the LLM API Proxy when a tool call has no tool_scope_map entry). See docs/POLICY_REFERENCE.md.

Limitations & Threat Model

AgentGuard is a policy enforcement and audit layer. It is not an OS sandbox. Read this before you trust it as your last line of defense.

The firewall is wire-level via the MCP Gateway and LLM API Proxy. The MCP Gateway is the primary integration path for MCP-aware clients (Claude Desktop, Cursor, Cline, Continue, Zed), and the LLM API Proxy extends the same boundary to OpenAI / Anthropic SDK calls (OPENAI_BASE_URL=http://127.0.0.1:8081/v1 and the Anthropic equivalent). The agent reaches its tools only through AgentGuard, so there is no opt-out short of pointing the client at a different MCP server or ignoring the SDK's base-URL configuration. Operators who control the agent's environment (env vars, network egress, MCP client config) get an enforcement boundary, not just an advisory one.
The SDK is a compatibility tier. It remains supported and tested for direct callers — but it is opt-in by design: the agent must call guard.check(...). That makes it an advisory gate. Use it when the proxy is impractical (offline scripts, custom transports), and pair it with the proxy whenever both are available.
AgentGuard does not sandbox the host or intercept syscalls. A determined agent that controls its own runtime can bypass the proxy by ignoring OPENAI_BASE_URL, talking to a different MCP server, or shelling out directly. Combine AgentGuard with OS-level isolation (containers, seccomp, AppArmor, network egress rules) when the threat model includes a hostile agent.
Pattern matching is string-glob, not semantic. A deny rule for rm -rf * matches literal strings; an agent (or a creative human) can substitute equivalents (find / -delete, base64 payloads, etc.). Treat policies as a high-signal first filter, not a complete authorization model.
Approval queue and rate-limiter state are in-memory. Both reset on restart and are not shared across instances. Run replicas: 1 until persistent state lands.

Dashboard

Live SSE action feed, one-click approve/deny, running totals, agent context. Start with --dashboard and open http://localhost:8080/dashboard. Walkthrough: docs/DASHBOARD.md.

Production

Running AgentGuard in production? The four most common misconfigurations — no API key (→ localhost-only bind), missing --tls-terminated-upstream behind an HTTPS proxy, wrong --base-url, and unmounted audit volume — all have one-line fixes. Work through the checklist below before exposing AgentGuard beyond localhost.

Set --api-key (or AGENTGUARD_API_KEY). Without it, AgentGuard binds to 127.0.0.1 only.
Set --base-url to the public URL. Otherwise Slack/webhook approval links point at http://localhost:8080.
Pass --tls-terminated-upstream if TLS is terminated upstream, or the dashboard login loops.
Set --allowed-origin to your frontend's exact origin.
Mount a writable volume for the audit log — no mount, log lost on restart.
Stay on replicas: 1 — rate-limit buckets and session-cost accumulators are per-instance.

Full reference configs (nginx + Docker Compose + Kubernetes), auth/CORS/TLS details, and day-2 operations: docs/DEPLOYMENT.md • docs/OPERATIONS.md • docs/TROUBLESHOOTING.md.

Documentation

Topic	Doc
Getting started	`docs/SETUP.md`
Policy YAML schema + gotchas	`docs/POLICY_REFERENCE.md`
HTTP API	`docs/API.md`
CLI reference	`docs/CLI.md`
Python SDK	`docs/SDK_PYTHON.md`
Framework adapters (LangChain, CrewAI, browser-use, MCP)	`docs/ADAPTERS.md`
Dashboard walkthrough	`docs/DASHBOARD.md`
Approval workflow end-to-end	`docs/APPROVAL_WORKFLOW.md`
Deployment / TLS / CORS	`docs/DEPLOYMENT.md`
Day-2 operations	`docs/OPERATIONS.md`
Metrics + alerting	`docs/OBSERVABILITY.md`
Tunable knobs	`docs/TUNING.md`
Troubleshooting	`docs/TROUBLESHOOTING.md`
FAQ	`docs/FAQ.md`
Config schema	`docs/CONFIG.md`
File formats + migrations	`docs/FILE_FORMATS.md`
Migration guide	`docs/MIGRATION.md`
Deprecations	`docs/DEPRECATIONS.md`
Contributing	`docs/CONTRIBUTING.md`

Roadmap

Implemented

Core policy engine with YAML rules (deny → require_approval → allow → default deny)
Audit logging (JSON lines) with size-triggered rotation, retention, and gzip compression — wired by default (v0.5)
Shell, filesystem, network, browser, cost, data, and mcp_tool scopes (string-glob matching — see Limitations)
Approval queue with Slack/webhook/console notifications (in-memory, not persisted)
Web dashboard (live SSE feed, stats, interactive approve/deny)
Token-bucket rate limiting per scope per agent (in-memory)
Per-agent policy overrides via agents: config
Cost guardrails — per-action limits, alert thresholds, and session-level cost tracking
Conditional rules — require_prior and time_window conditions evaluated at check time
Python SDK + adapters: LangChain, CrewAI, browser-use, MCP
TypeScript/Node.js SDK
Full CLI: serve, validate, check, approve, deny, status, audit, migrate, version
Docker support with multi-stage build
Policy hot-reload via --watch
Data scope — first-class data scope for exfiltration / sensitive-payload checks, wired through policy engine and SDKs (v0.5)
MCP Gateway — wire-level Model Context Protocol proxy with multi-upstream namespacing, capability merging, reconnect-with-backoff, and approval _meta round-trip; ships as the agentguard-mcp-gateway binary with copy-paste configs for Claude Desktop, Cursor, Cline, Continue, and Zed (v0.5)
LLM API Proxy — drop-in OpenAI / Anthropic-compatible base URL with streaming pause/resume/rewrite, tool-call gating, provider-aware synthetic refusals, and tool→scope mapping; ships as the agentguard-llm-proxy binary with copy-paste examples for the OpenAI SDK, Anthropic SDK, LangChain, and CrewAI (v0.5)

Planned

SQLite/PostgreSQL audit backend
Persistent approval queue
Policy-as-code (test policies in CI/CD)
Multi-agent session correlation
Session replay in dashboard
Policy editor in dashboard
AutoGPT adapter
OpenAI Agents SDK adapter
SOC 2 / compliance report generation
VS Code extension for policy authoring

Contributing

See CONTRIBUTING.md. Priority areas: adapters for more agent frameworks, new scope types and matching strategies, dashboard UI, documentation.

License

Apache 2.0 — see LICENSE.

Stop hoping your agents behave. Start knowing.

Directories ¶

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

Path	Synopsis
cmd
agentguard command
agentguard-llm-proxy command Command agentguard-llm-proxy is the AgentGuard LLM API Proxy: an HTTP server that speaks the OpenAI Chat Completions and Anthropic Messages wire formats, forwards traffic to the real upstream, and gates any tool calls the model emits through the central AgentGuard policy engine.	Command agentguard-llm-proxy is the AgentGuard LLM API Proxy: an HTTP server that speaks the OpenAI Chat Completions and Anthropic Messages wire formats, forwards traffic to the real upstream, and gates any tool calls the model emits through the central AgentGuard policy engine.
agentguard-mcp-gateway command Command agentguard-mcp-gateway is the AgentGuard MCP Gateway: a stdio JSON-RPC bridge that sits between an MCP host (Claude Desktop, Cursor, IDE plugins) and one or more downstream MCP servers, gating every tools/call through the AgentGuard policy engine.	Command agentguard-mcp-gateway is the AgentGuard MCP Gateway: a stdio JSON-RPC bridge that sits between an MCP host (Claude Desktop, Cursor, IDE plugins) and one or more downstream MCP servers, gating every tools/call through the AgentGuard policy engine.
pkg
audit Package audit — BufferedAsyncLogger.	Package audit — BufferedAsyncLogger.
deprecation Package deprecation is the AgentGuard helper for flagging features that are scheduled for removal.	Package deprecation is the AgentGuard helper for flagging features that are scheduled for removal.
llmproxy Package llmproxy implements the AgentGuard LLM API Proxy: an HTTP server that speaks OpenAI Chat Completions and Anthropic Messages wire formats, forwards requests to the real upstreams, and gates any tool calls the model emits through the central AgentGuard policy engine.	Package llmproxy implements the AgentGuard LLM API Proxy: an HTTP server that speaks OpenAI Chat Completions and Anthropic Messages wire formats, forwards requests to the real upstreams, and gates any tool calls the model emits through the central AgentGuard policy engine.
mcpgw Package mcpgw — gateway-level audit + SSE wiring stubs.	Package mcpgw — gateway-level audit + SSE wiring stubs.
metrics Package metrics provides a lightweight in-process metrics registry with Prometheus text-format output.	Package metrics provides a lightweight in-process metrics registry with Prometheus text-format output.
migrate Package migrate coordinates on-disk schema migrations for AgentGuard.	Package migrate coordinates on-disk schema migrations for AgentGuard.
migrate/v040_to_v041 Package v040_to_v041 adds the schema-v2 `_meta` header to a pre-existing v0.4.0 audit log.	Package v040_to_v041 adds the schema-v2 `_meta` header to a pre-existing v0.4.0 audit log.
notify
policy
proxy
proxy/schema/v1 Package v1 documents and re-exports the v1 wire-protocol types for the AgentGuard /v1/check endpoint.	Package v1 documents and re-exports the v1 wire-protocol types for the AgentGuard /v1/check endpoint.
ratelimit