adk-utils-go

module

v0.13.0 Latest Latest Go to latest Published: Mar 30, 2026 License: Apache-2.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/achetronic/adk-utils-go

Links

Open Source Insights

README ¶

ADK Utils Go

Utilities and implementations for Google's Agent Development Kit (ADK) in Go.

This repository provides production-ready implementations for:

LLM Clients: OpenAI and Anthropic clients compatible with ADK
Session Management: Redis-based session persistence
Long-term Memory: PostgreSQL + pgvector for semantic search
Memory Tools: Toolsets for agent-controlled memory operations
Artifact Storage: Filesystem-based artifact persistence with versioning
Context Guard: Automatic context window management with LLM-powered summarization
Langfuse: Observability plugin — traces every LLM call to Langfuse with full prompt/response payloads and token usage

Structure

├── genai/            # LLM client implementations
│   ├── openai/       # OpenAI client (works with Ollama, OpenRouter, etc.)
│   └── anthropic/    # Anthropic Claude client
├── session/          # Session service implementations
│   └── redis/        # Redis session service
├── memory/           # Memory service implementations
│   └── postgres/     # PostgreSQL + pgvector memory service
├── tools/            # Tool and toolset implementations
│   └── memory/       # Memory toolset for agents
├── artifact/         # Artifact service implementations
│   └── filesystem/   # Filesystem artifact service (versioned, user-scoped)
├── plugin/           # ADK plugin implementations
│   ├── contextguard/ # Context window management plugin + CrushRegistry
│   └── langfuse/     # Langfuse observability plugin (OTLP/HTTP traces)
└── examples/         # Working examples

Installation

go get github.com/achetronic/adk-utils-go

LLM Clients

OpenAI Client

Works with OpenAI API and any OpenAI-compatible API (Ollama, OpenRouter, Azure OpenAI, etc.):

import genaiopenai "github.com/achetronic/adk-utils-go/genai/openai"

llmModel := genaiopenai.New(genaiopenai.Config{
    APIKey:    os.Getenv("OPENAI_API_KEY"),
    BaseURL:   "http://localhost:11434/v1", // For Ollama
    ModelName: "gpt-4o",                     // Or "qwen3:8b" for Ollama
})

agent, _ := llmagent.New(llmagent.Config{
    Name:  "assistant",
    Model: llmModel,
})

Anthropic Client

Native Anthropic Claude support:

import genaianthropic "github.com/achetronic/adk-utils-go/genai/anthropic"

llmModel := genaianthropic.New(genaianthropic.Config{
    APIKey:    os.Getenv("ANTHROPIC_API_KEY"),
    ModelName: "claude-sonnet-4-5-20250929",
})

agent, _ := llmagent.New(llmagent.Config{
    Name:  "assistant",
    Model: llmModel,
})

Extended Thinking

Claude can generate an internal reasoning chain before producing its final response. Thinking tokens are output tokens — Claude writes the reasoning as text (it just isn't shown to the user). Set ThinkingBudgetTokens to reserve a portion of the output budget for this reasoning. The remaining tokens (MaxOutputTokens - ThinkingBudgetTokens) are available for the final response.

llmModel := genaianthropic.New(genaianthropic.Config{
    APIKey:               os.Getenv("ANTHROPIC_API_KEY"),
    ModelName:            "claude-sonnet-4-5-20250929",
    MaxOutputTokens:      16000,
    ThinkingBudgetTokens: 10000, // must be >= 1024 and < MaxOutputTokens
})

Custom HTTP Headers

Both clients support custom HTTP headers via HTTPOptions, useful for beta features, auth proxies, or provider-specific flags:

import "net/http"

llmModel := genaianthropic.New(genaianthropic.Config{
    APIKey:    os.Getenv("ANTHROPIC_API_KEY"),
    ModelName: "claude-sonnet-4-6-20250929",
    HTTPOptions: genaianthropic.HTTPOptions{
        Headers: http.Header{
            "anthropic-beta": []string{"context-1m-2025-08-07"},
        },
    },
})

Supported Features

Both clients support:

Streaming and non-streaming responses
System instructions
Tool/function calling
Image inputs (base64)
Temperature, TopP, MaxOutputTokens, StopSequences
Extended thinking (ThinkingBudgetTokens)
Usage metadata
Custom HTTP headers (multi-value)

Session Service (Redis)

Persistent session storage with Redis:

import sessionredis "github.com/achetronic/adk-utils-go/session/redis"

sessionService, _ := sessionredis.NewRedisSessionService(sessionredis.RedisSessionServiceConfig{
    Addr:     "localhost:6379",
    Password: "",
    DB:       0,
    TTL:      24 * time.Hour,
})
defer sessionService.Close()

runner, _ := runner.New(runner.Config{
    SessionService: sessionService,
})

Memory Service (PostgreSQL + pgvector)

Long-term memory with semantic search:

import memorypostgres "github.com/achetronic/adk-utils-go/memory/postgres"

memoryService, _ := memorypostgres.NewPostgresMemoryService(ctx, memorypostgres.PostgresMemoryServiceConfig{
    ConnString: "postgres://postgres:postgres@localhost:5432/postgres?sslmode=disable",
    EmbeddingModel: memorypostgres.NewOpenAICompatibleEmbedding(memorypostgres.OpenAICompatibleEmbeddingConfig{
        BaseURL: "http://localhost:11434/v1",
        Model:   "nomic-embed-text",
    }),
})
defer memoryService.Close()

runner, _ := runner.New(runner.Config{
    MemoryService: memoryService,
})

Memory Toolset

Give agents explicit control over long-term memory:

import memorytools "github.com/achetronic/adk-utils-go/tools/memory"

memoryToolset, _ := memorytools.NewToolset(memorytools.ToolsetConfig{
    MemoryService: memoryService,
    AppName:       "my_app",
})

agent, _ := llmagent.New(llmagent.Config{
    Toolsets: []tool.Toolset{memoryToolset},
})

The toolset provides:

search_memory: Semantic search across stored memories
save_to_memory: Save information for future recall

Artifact Service (Filesystem)

Versioned artifact storage backed by the local filesystem. Agents can save, load, list, and delete files (code, documents, data) that are delivered to the user as downloadable content.

import artifactfs "github.com/achetronic/adk-utils-go/artifact/filesystem"

artifactService, _ := artifactfs.NewFilesystemService(artifactfs.FilesystemServiceConfig{
    BasePath: "data/artifacts",
})

// Use with ADK launcher
launcherCfg := &launcher.Config{
    SessionService:  sessionService,
    AgentLoader:     agentLoader,
    ArtifactService: artifactService,
}

Artifacts are stored at {BasePath}/{appName}/{userID}/{sessionID}/{fileName}/{version}.json. Filenames prefixed with user: are scoped to the user across all sessions, making them accessible from any conversation.

Langfuse Plugin

Traces every agent invocation and LLM call to Langfuse via OTLP/HTTP. Enriches generate_content spans with full request/response payloads and token usage so Langfuse can display costs, latency, and prompt/completion content.

Supports all ADK agent topologies: single agents, sequential delegation, SequentialAgent, LoopAgent, and ParallelAgent.

Setup

import "github.com/achetronic/adk-utils-go/plugin/langfuse"

pluginCfg, shutdown, err := langfuse.Setup(&langfuse.Config{
    PublicKey:   os.Getenv("LANGFUSE_PUBLIC_KEY"),
    SecretKey:   os.Getenv("LANGFUSE_SECRET_KEY"),
    Host:        "https://cloud.langfuse.com", // or self-hosted URL
    Environment: "production",
    ServiceName: "my-agent",
})
if err != nil { log.Fatal(err) }
defer shutdown(context.Background())

runnr, _ := runner.New(runner.Config{
    Agent:        myAgent,
    PluginConfig: pluginCfg,
})

Combining with ContextGuard

langfuseCfg, shutdown, _ := langfuse.Setup(langfuseCfg)
guardCfg := guard.PluginConfig()

combined := runner.PluginConfig{
    Plugins: append(langfuseCfg.Plugins, guardCfg.Plugins...),
}

Per-Request Context

Inject per-request attributes via context (typically in HTTP middleware):

ctx = langfuse.WithUserID(ctx, "user-123")
ctx = langfuse.WithTags(ctx, []string{"beta", "internal"})
ctx = langfuse.WithTraceName(ctx, "customer-support")
ctx = langfuse.WithTraceMetadata(ctx, map[string]string{"tenant": "acme"})

Config

Field	Required	Default	Description
`PublicKey`	Yes	—	Langfuse project public key (Basic Auth user)
`SecretKey`	Yes	—	Langfuse project secret key (Basic Auth pass)
`Host`	No	`https://cloud.langfuse.com`	Langfuse server URL
`Environment`	No	—	Deployment environment tag
`Release`	No	—	Application version tag
`ServiceName`	No	`langfuse-adk`	OTel `service.name` resource attribute
`Insecure`	No	`false`	Disable TLS for the OTLP/HTTP exporter (for self-hosted plain-HTTP instances)

Use cfg.IsEnabled() to conditionally skip setup when credentials are absent.

Context Guard Plugin

Automatic context window management that prevents conversations from exceeding the LLM's token limit. It works as an ADK BeforeModelCallback plugin — before every LLM call, it checks whether the conversation is approaching the limit and summarizes older messages to make room.

Strategies

Strategy	Trigger	Best for
`threshold`	Token count approaches context window limit	Maximizing context usage, models with known limits
`sliding_window`	Turn count exceeds a configured maximum	Predictable compaction, long-running conversations

Setup

The plugin requires a ModelRegistry to look up context window sizes. A built-in CrushRegistry is provided that fetches model metadata from Crush's provider.json and refreshes every 6 hours:

import "github.com/achetronic/adk-utils-go/plugin/contextguard"

// 1. Start the registry (built-in, fetches from Crush)
registry := contextguard.NewCrushRegistry()
registry.Start(ctx)
defer registry.Stop()

// 2. Create the guard and add agents
guard := contextguard.New(registry)
guard.Add("assistant", llmModel)

// 3. Pass to ADK runner
runnr, _ := runner.New(runner.Config{
    Agent:        myAgent,
    PluginConfig: guard.PluginConfig(),
})

Per-agent options are available via functional options:

guard := contextguard.New(registry)

// Threshold strategy (default) — summarizes when tokens approach the limit
guard.Add("assistant", llmModel)

// Sliding window — summarizes after N turns regardless of token count
guard.Add("researcher", llmResearcher, contextguard.WithSlidingWindow(30))

// Manual context window override — bypasses the registry for this agent
guard.Add("writer", llmWriter, contextguard.WithMaxTokens(1_000_000))

// Custom compaction retry limit (default: 3) — applies to both strategies
guard.Add("analyst", llmAnalyst, contextguard.WithMaxCompactionAttempts(5))

Multi-agent setup is the same API — just call Add multiple times:

guard := contextguard.New(registry)
for _, agentDef := range agents {
    guard.Add(agentDef.ID, llmMap[agentDef.ID], optsFromDef(agentDef)...)
}

Custom Model Registry

You can implement your own ModelRegistry instead of using CrushRegistry:

type myRegistry struct{}

func (r *myRegistry) ContextWindow(modelID string) int {
    windows := map[string]int{
        "claude-sonnet-4-5-20250929": 200000,
        "gpt-4o":                     128000,
    }
    if w, ok := windows[modelID]; ok {
        return w
    }
    return 128000
}

func (r *myRegistry) DefaultMaxTokens(modelID string) int {
    return 4096
}

guard := contextguard.New(&myRegistry{})
guard.Add("assistant", llmModel)

How it works

Before every LLM call, the plugin checks the configured strategy for the agent
Threshold: estimates total tokens and triggers summarization when remaining capacity drops below a safety buffer (fixed 20k for windows >200k, 20% for smaller ones)
Sliding window: counts Content entries since the last compaction and triggers when the limit is exceeded
When triggered, the conversation is split into "old" (summarized by the agent's own LLM) and "recent" (kept verbatim)
Both strategies retry compaction up to 3 times (maxCompactionAttempts) if the resulting summary still exceeds the threshold. After exhausting all attempts the request is sent as-is (best-effort)
The summary is persisted in session state and injected on subsequent requests until the next compaction
Tool call chains (tool_use + tool_result) are never split mid-chain to prevent provider errors

Examples

Complete working examples in the examples/ directory:

Example	Description
openai-client	OpenAI/Ollama client usage
anthropic-client	Anthropic Claude client usage
session-memory	Session management with Redis
long-term-memory	Long-term memory with PostgreSQL + pgvector
full-memory	Combined session + long-term memory
context-guard	ContextGuard plugin with CrushRegistry, manual thresholds, and sliding window

Quick Start

# Start services
docker run -d --name postgres -e POSTGRES_PASSWORD=postgres -p 5432:5432 pgvector/pgvector:pg16
docker run -d --name redis -p 6379:6379 redis:alpine
ollama pull qwen3:8b
ollama pull nomic-embed-text

# Run an example
go run ./examples/openai-client

Environment Variables

Variable	Default	Description
`OPENAI_API_KEY`	-	OpenAI API key (not needed for Ollama)
`OPENAI_BASE_URL`	-	OpenAI-compatible API endpoint
`ANTHROPIC_API_KEY`	-	Anthropic API key
`MODEL_NAME`	`gpt-4o` / `claude-sonnet-4-5-20250929`	Model name
`EMBEDDING_BASE_URL`	`http://localhost:11434/v1`	Embedding API endpoint
`EMBEDDING_MODEL`	`nomic-embed-text`	Embedding model
`POSTGRES_URL`	`postgres://postgres:postgres@localhost:5432/postgres?sslmode=disable`	PostgreSQL connection
`REDIS_ADDR`	`localhost:6379`	Redis address

Requirements

Go 1.24+
Google ADK v0.5.0+

License

Apache 2.0

Directories ¶

Path	Synopsis
artifact
filesystem
examples
anthropic-client command
context-guard command Context Guard Example	Context Guard Example
full-memory command
long-term-memory command
openai-client command
session-memory command
genai
anthropic
openai Package openai provides an OpenAI-compatible LLM implementation for the ADK.	Package openai provides an OpenAI-compatible LLM implementation for the ADK.
memory
memorytypes
postgres
plugin
contextguard Package contextguard implements an ADK plugin that prevents conversations from exceeding the LLM's context window.	Package contextguard implements an ADK plugin that prevents conversations from exceeding the LLM's context window.
langfuse Package langfuse wires ADK Go telemetry to a Langfuse instance via OTLP/HTTP.	Package langfuse wires ADK Go telemetry to a Langfuse instance via OTLP/HTTP.
session
redis
tools
memory

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL