flex-agent-runtime

module

v0.2.0 Latest Latest Go to latest Published: Apr 3, 2026 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/dcosson/flex-agent-runtime

Links

Open Source Insights

README ¶

flex-agent-runtime

Building blocks for an AI agent runtime in Go. Work in progress.

What's here

This project provides composable components for building agent systems. Some examples of what it includes today:

LLM & Embedding client library — unified streaming interface across Anthropic, OpenAI, Google, and Cohere with provider registry, model catalog, cost tracking, extended thinking support, and context overflow detection
A Pi-inspired agent loop — LLM-to-tools-to-LLM turn loop with a state machine, event bus, control queue for steering (pause, follow-up, exit), and session persistence
Sandbox execution — run tools in isolated environments with ZFS copy-on-write snapshots, gVisor containers, per-session resource quotas, and automatic snapshot rollback
Built-in tools — bash, file read/write/edit, grep, glob, git operations, and a code interpreter, split into tiers (in-process vs. isolated)
Terminal multiplexer — PTY session management with ANSI/VT100 emulation, multi-client streaming, and real-time activity monitoring
RPC layer — ConnectRPC-based API for sandbox lifecycle, agent events, and terminal streaming with auth, versioning, and WebSocket relay
Stubserver — test double that serves canned SSE and JSON responses with fault injection (TCP reset, malformed responses, backpressure) for testing provider integrations without real API calls

Supported providers

Provider	Chat/LLM	Embeddings	Key Env Var
Anthropic	Yes	-	`ANTHROPIC_API_KEY`
OpenAI	Yes	Yes	`OPENAI_API_KEY`
Google	Yes	Yes	`GOOGLE_API_KEY` / `GEMINI_API_KEY`
Cohere	-	Yes	`COHERE_API_KEY`
OpenRouter	Yes	-	`OPENROUTER_API_KEY`

Provider configs (base URLs, API client types, headers) and the model catalog are embedded at build time from internal/ai/models/catalog.json. API keys are resolved automatically from environment variables via ResolveEndpoint.

Quick start

# Build everything
make build

# Run the interactive LLM demo
export ANTHROPIC_API_KEY=your-key-here
./bin/llm-demo chat

# Run the embedding similarity demo
export OPENAI_API_KEY=your-key-here
./bin/embedding-demo rank --query "how to cook pasta" \
  --text "boil water and add spaghetti" \
  --text "the weather is sunny today" \
  --text "Italian recipes for beginners"

Using as a library

API clients are stateless protocol implementations. Provider configs (base URL, API keys, headers) are loaded from the embedded catalog at init and resolved per-call. Set the appropriate env var and go:

import (
    "github.com/dcosson/flex-agent-runtime/ai"
    "github.com/dcosson/flex-agent-runtime/ai/provider/anthropic"
)

func init() {
    anthropic.Register(anthropic.ClientConfig{}) // registers the API client
    // Provider config + model catalog loaded automatically from embedded JSON
}

// Stream a response — API key resolved from ANTHROPIC_API_KEY env var
model, _ := ai.GetModel("anthropic", "claude-sonnet-4-6")
es := ai.StreamSimple(ctx, model, ai.Context{
    Messages: messages,
    Tools:    tools,
}, ai.SimpleStreamOptions{})

for event := range es.C {
    // handle streaming events
}
msg, err := es.Result()

import (
    "github.com/dcosson/flex-agent-runtime/ai"
    "github.com/dcosson/flex-agent-runtime/ai/provider/openai"
)

func init() {
    openai.RegisterEmbeddingClient(openai.ClientConfig{})
}

// Embed — API key resolved from OPENAI_API_KEY env var
resp, err := ai.Embed(ctx, "text-embedding-3-small", ai.EmbeddingRequest{
    Texts: []string{"hello world", "goodbye world"},
})

For custom or third-party endpoints:

ai.RegisterCustomProvider(ai.CustomProviderConfig{
    ProviderConfig: ai.ProviderConfig{
        Name:          "my-proxy",
        APIClientType: "openai-completions",
        BaseURL:       "https://my-proxy.example.com/v1",
        KeyEnvVars:    []string{"PROXY_API_KEY"},
    },
})

Deployment modes

The orchestrator supports flexible agent deployment across four independent dimensions:

Dimensions

Agent Loop Placement — where the agent loop process runs
- Orchestrator — in-process on the orchestrator host
- Remote — on a separate host
Tools Placement — where tool execution happens
- Co-located — tools run wherever the agent loop is
- Sandbox — tools dispatch to a sandbox-host via RPC
Agent Type — what drives the agent loop
- Native — built-in AgentLoopService (flex-agent-runtime's own loop)
- Terminal coding agent — 3rd party agent driven via terminal multiplexer (e.g. Claude Code, Codex)
Execution Environment — isolation level for the host
- Bare instance — EC2/VPS with VM-level isolation only (static-host or fleet)
- gVisor — sandbox-host with gVisor containers + ZFS snapshots (static-host or fleet)
- (Future: E2B, Daytona, Fly)

Valid combinations

Agent Type + Environment	Orch + Co-located	Orch + Sandbox	Remote + Co-located	Remote + Sandbox
Native + Bare	dev/local	—	—	—
Native + gVisor	—	tools-sandbox	—	—
Native + E2B	—	(future)	—	—
Native + Daytona	—	(future)	—	—
Native + Fly	—	(future)	—	—
Terminal Agent + Bare	—	—	agent-direct	(future)
Terminal Agent + gVisor	—	—	agent-sandbox	(future)
Terminal Agent + E2B	—	—	(future)	(future)
Terminal Agent + Daytona	—	—	(future)	(future)
Terminal Agent + Fly	—	—	(future)	(future)

A single orchestrator can serve multiple modes concurrently. Each agent session specifies its placement mode at creation time.

Build & test

make help          # list all targets
make build         # build all binaries into ./bin/
make check         # gofmt + vet + staticcheck
make test          # quick tests
make test-race     # race-enabled tests
make test-harness  # property/determinism/concurrency tests
make test-bench    # benchmarks

License

MIT

Directories ¶

Path	Synopsis
ai
provider
provider/anthropic
provider/cohere
provider/google
provider/openai
cmd
flexagent command Package main provides the unified flexagent binary with serve subcommands.	Package main provides the unified flexagent binary with serve subcommands.
flexchat command Package main provides a minimal terminal chat client for the flexagent orchestrator RPC endpoint.	Package main provides a minimal terminal chat client for the flexagent orchestrator RPC endpoint.
sandbox-host command
stubserver command
demos
embedding-demo command
llm-demo command
internal
agent
agent/agenttest
agent/api
ai
ai/provider/anthropic
ai/provider/cohere
ai/provider/google
ai/provider/openai
ai/sse
ai/testutil/stubserver Package stubserver provides a configurable HTTP test server for provider testing.	Package stubserver provides a configurable HTTP test server for provider testing.
orchestrator
rpc
rpc/api
rpc/client
rpc/codec
rpc/server
rpc/transport
rpc/wsrelay Package wsrelay bridges WebSocket connections to the TerminalService RPC.	Package wsrelay bridges WebSocket connections to the TerminalService RPC.
sandbox
sandbox/control Package control defines the SandboxControl interface for orchestrator-level sandbox lifecycle management.	Package control defines the SandboxControl interface for orchestrator-level sandbox lifecycle management.
sandbox/control/cloud Package cloud provides placeholder stubs for cloud-based SandboxControl providers (E2B, Daytona, Fly).	Package cloud provides placeholder stubs for cloud-based SandboxControl providers (E2B, Daytona, Fly).
sandbox/control/direct Package direct implements DirectSandboxControl, which manages raw EC2 instances as agent execution environments.	Package direct implements DirectSandboxControl, which manages raw EC2 instances as agent execution environments.
sandbox/control/fleet Package fleet implements FleetSandboxControl, a SandboxControl adapter that manages a fleet of sandbox-host instances with auto-scaling, health monitoring, warm pool maintenance, and capacity-aware routing.	Package fleet implements FleetSandboxControl, a SandboxControl adapter that manages a fleet of sandbox-host instances with auto-scaling, health monitoring, warm pool maintenance, and capacity-aware routing.
sandbox/control/fleet/ec2 Package ec2 implements the instance.InstanceProvisioner interface using the AWS EC2 API (SDK v2).	Package ec2 implements the instance.InstanceProvisioner interface using the AWS EC2 API (SDK v2).
sandbox/control/instance Package instance defines the InstanceProvisioner interface and associated types for cloud instance lifecycle management.	Package instance defines the InstanceProvisioner interface and associated types for cloud instance lifecycle management.
sandbox/control/node Package node implements NodeSandboxControl, which wraps our sandbox-host RPC service to satisfy the SandboxControl interface.	Package node implements NodeSandboxControl, which wraps our sandbox-host RPC service to satisfy the SandboxControl interface.
sandbox/environment
sandbox/environment/daytona
sandbox/environment/e2b
sandbox/environment/fly
sandbox/environment/local
sandbox/environment/native
sandbox/environment/restapi Package restapi provides shared HTTP/JSON helpers for remote ExecutionEnvironment providers (E2B, Daytona, Fly).	Package restapi provides shared HTTP/JSON helpers for remote ExecutionEnvironment providers (E2B, Daytona, Fly).
sandbox/gvisor
sandbox/zfs
termmux
termmux/driver Package driver defines the TermmuxDriverAdapter interface and canonical conversation types for bidirectional session log conversion.	Package driver defines the TermmuxDriverAdapter interface and canonical conversation types for bidirectional session log conversion.
termmux/driver/claudecode Package claudecode implements the TermmuxDriverAdapter for Claude Code CLI.	Package claudecode implements the TermmuxDriverAdapter for Claude Code CLI.
termmux/driver/codex Package codex implements the TermmuxDriverAdapter for the Codex CLI.	Package codex implements the TermmuxDriverAdapter for the Codex CLI.
termmux/eventsrc/eventstore Package eventstore provides JSONL event persistence.	Package eventstore provides JSONL event persistence.
termmux/eventsrc/otelserver Package otelserver provides a per-session HTTP server that receives OTEL log, metric, and trace data from agent CLIs.	Package otelserver provides a per-session HTTP server that receives OTEL log, metric, and trace data from agent CLIs.
termmux/eventsrc/sessionlog Package sessionlog provides a JSONL file tailer that polls for new lines.	Package sessionlog provides a JSONL file tailer that polls for new lines.
termmux/monitor Package monitor provides the agent state machine, metrics, and event fan-out.	Package monitor provides the agent state machine, metrics, and event fan-out.
tools
tools/codeinterp
tools/codeinterp/datastore
tools/envtools Package envtools bridges the circular-import gap between internal/agent, internal/tools, and internal/sandbox/environment.	Package envtools bridges the circular-import gap between internal/agent, internal/tools, and internal/sandbox/environment.
tests
external Package external contains end-to-end tests for the flex-agent-runtime organized in three tiers:	Package external contains end-to-end tests for the flex-agent-runtime organized in three tiers:
external/common
integration/harness
integration/mode2/harness
integration/mode3/harness
integration/runtime/harness
integration/runtime/workloads
integration/testutil

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL