delegate

package

v0.3.0 Latest Latest Go to latest Published: May 4, 2026 License: MIT Imports: 15 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/SocialGouv/iterion

Links

Open Source Insights

Documentation ¶

Overview ¶

Package delegate provides the Backend interface and types for executing agent/judge nodes via pluggable backends (CLI agents like claude-code/codex, or API-based backends like claw).

When a node has `backend: "claude_code"`, the executor invokes the named Backend which handles execution (subprocess, API call, etc.).

Index ¶

Constants
type Backend
type ClaudeCodeBackend
- func (b *ClaudeCodeBackend) Execute(ctx context.Context, task Task) (Result, error)
type CodexBackend
- func (b *CodexBackend) Execute(ctx context.Context, task Task) (Result, error)
type ErrAskUser
- func (e *ErrAskUser) Error() string
type Registry
- func DefaultRegistry(logger *iterlog.Logger) *Registry
- func NewRegistry() *Registry
- func (r *Registry) Register(name string, b Backend)
- func (r *Registry) Resolve(name string) (Backend, error)
type Result
type Task
- func (t Task) SystemPromptWithInteraction() string
type ToolDef

Constants ¶

View Source

const (
	BackendClaw       = "claw"
	BackendClaudeCode = "claude_code"
	BackendCodex      = "codex"
)

Backend name constants used for registration and dispatch.

View Source

const (
	PriorAskUserQuestionKey   = "_prior_ask_user_question"
	PriorAskUserAnswerKey     = "_prior_ask_user_answer"
	ResumeConversationKey     = "_resume_conversation"
	ResumePendingToolUseIDKey = "_resume_pending_tool_use_id"
	ResumeAnswerKey           = "_resume_answer"
)

Reserved input keys used to relay ask_user pause/resume state across runtime → executor → backend. Owned by the delegate package because they are part of the ask_user contract and both pkg/runtime and pkg/backend/model already import delegate.

PriorAskUser* keys carry the question/answer text for the prompt-side fallback (claude_code, codex). Resume* keys carry the persisted backend conversation, the pending tool_use ID, and the user's answer for in-process backends (claw) that can rehydrate the LLM mid-loop.

View Source

const AskUserQuestionKey = "ask_user_response"

AskUserQuestionKey is the canonical key under which iterion files an ask_user question in the Interaction record (and looks up the answer on resume). Stable across runs so workflow authors can reference {{input.ask_user_response}} in their prompts if they want explicit handling beyond the auto-prepended context block.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type Backend ¶

type Backend interface {
	// Execute runs the CLI agent with the given task and returns structured output.
	Execute(ctx context.Context, task Task) (Result, error)
}

Backend is the interface for delegation execution. Each backend wraps a CLI agent (e.g. claude, codex) and handles prompt delivery, tool forwarding, and output collection.

type ClaudeCodeBackend ¶

type ClaudeCodeBackend struct {
	// Command overrides the CLI binary path (default: "claude").
	Command string
	// Logger is the leveled logger for diagnostic output.
	Logger *iterlog.Logger
}

ClaudeCodeBackend delegates work to the `claude` CLI (claude-code) via the Claude Agent SDK.

func (*ClaudeCodeBackend) Execute ¶

func (b *ClaudeCodeBackend) Execute(ctx context.Context, task Task) (Result, error)

Execute runs the claude CLI with the given task using the Claude Agent SDK.

type CodexBackend ¶

type CodexBackend struct {
	// Command overrides the CLI binary path (default: "codex").
	Command string
	// Logger is the leveled logger for diagnostic output.
	Logger *iterlog.Logger
}

CodexBackend delegates work to the `codex` CLI (OpenAI Codex) via the Codex Agent SDK.

func (*CodexBackend) Execute ¶

func (b *CodexBackend) Execute(ctx context.Context, task Task) (Result, error)

Execute runs the codex CLI with the given task using the Codex Agent SDK.

type ErrAskUser ¶

type ErrAskUser struct {
	Question         string
	PendingToolUseID string
	Conversation     json.RawMessage
}

ErrAskUser is returned by the iterion-wired `ask_user` tool's handler when an LLM calls it during the agent loop. It propagates up through the generation layer to the backend, which converts it into a standard _needs_interaction Result so iterion's existing pause/resume flow surfaces the question to the dev's terminal and re-invokes the node with the answer.

Conversation and PendingToolUseID enable mid-tool-loop resume: when set, they let the backend rehydrate the LLM's exact pre-pause state on the next turn (the persisted message history plus a tool_result block answering the captured tool_use). The opaque json.RawMessage type keeps the delegate package agnostic of any specific LLM SDK's message shape.

func (*ErrAskUser) Error ¶

func (e *ErrAskUser) Error() string

type Registry ¶

type Registry struct {
	// contains filtered or unexported fields
}

Registry maps backend names to Backend implementations.

func DefaultRegistry ¶

func DefaultRegistry(logger *iterlog.Logger) *Registry

DefaultRegistry returns a registry pre-loaded with the standard claude_code and codex backends.

func NewRegistry ¶

func NewRegistry() *Registry

NewRegistry creates an empty delegation backend registry.

func (*Registry) Register ¶

func (r *Registry) Register(name string, b Backend)

Register adds a backend under the given name.

func (*Registry) Resolve ¶

func (r *Registry) Resolve(name string) (Backend, error)

Resolve looks up a backend by name. Returns an error if not found.

type Result ¶

type Result struct {
	// Output is the parsed structured output from the CLI agent.
	Output map[string]interface{}

	// Tokens is an estimate of total tokens consumed (if available from CLI metadata).
	Tokens int

	// Duration is the wall-clock time of the subprocess execution.
	Duration time.Duration

	// ExitCode is the process exit code (0 on success).
	ExitCode int

	// Stderr contains captured stderr output (warnings, progress info).
	Stderr string

	// BackendName identifies which backend produced this result (e.g. "claude_code", "codex").
	BackendName string

	// RawOutputLen is the byte length of raw stdout before parsing.
	RawOutputLen int

	// ParseFallback is true when structured output was expected (OutputSchema set)
	// but JSON parsing fell back to wrapping plain text as {"text": "..."}.
	ParseFallback bool

	// FormattingPassUsed is true when a two-pass execution was performed:
	// Pass 1 with tools (no output format), Pass 2 with WithOutputFormat
	// (no tools) to guarantee structured output conforming to the schema.
	FormattingPassUsed bool

	// SessionID is the session ID returned by the CLI agent (empty if unavailable).
	SessionID string

	// PendingConversation is the persisted LLM conversation captured at
	// the moment the agent loop was suspended by an ask_user call. The
	// runtime serializes this opaque blob into the checkpoint so that
	// resume can replay it via Task.ResumeConversation, preserving the
	// LLM's mid-tool-loop state across the pause. Backends that cannot
	// persist conversation state (CLI-based: claude_code, codex) leave
	// this nil and rely on the [PRIOR INTERACTION] prompt-side fallback.
	PendingConversation json.RawMessage

	// PendingToolUseID is the ID of the tool_use block awaiting an
	// answer in PendingConversation. Required when PendingConversation
	// is non-nil.
	PendingToolUseID string
}

Result contains the output from a delegation backend.

type Task ¶

type Task struct {
	// NodeID is the IR node identifier, used for observability hooks.
	NodeID string

	// SystemPrompt is the fully resolved system prompt text.
	SystemPrompt string

	// UserPrompt is the fully resolved user message text.
	UserPrompt string

	// AllowedTools is the list of tool names the CLI agent may use.
	// Used by CLI-based backends; API-based backends use ToolDefs instead.
	AllowedTools []string

	// ToolDefs provides full tool definitions for backends that manage tool
	// loops internally (e.g. claw). CLI-based backends ignore this field.
	ToolDefs []ToolDef

	// OutputSchema is the JSON Schema for the expected structured output.
	// Nil means free-form text output.
	OutputSchema json.RawMessage

	// Model is the resolved model spec (e.g. "anthropic/claude-sonnet-4-6").
	// Required for API-based backends; ignored by CLI-based backends.
	Model string

	// HasTools indicates whether the node has tools, enabling backends to
	// choose between structured-output and text-with-tools generation strategies.
	HasTools bool

	// ToolMaxSteps is the maximum number of tool-use iterations (0 = default).
	ToolMaxSteps int

	// MaxTokens caps the LLM response length per call. Honored by API-based
	// backends (claw); CLI-based backends (claude_code, codex) ignore it.
	// Zero means "use the backend default" (typically 8192).
	MaxTokens int

	// WorkDir is the working directory for the CLI subprocess.
	WorkDir string

	// BaseDir is the allowed base directory for WorkDir validation.
	// If set, WorkDir must resolve to a path within BaseDir.
	BaseDir string

	// ReasoningEffort is the reasoning effort level.
	// Valid values: "low", "medium", "high", "xhigh", "max".
	ReasoningEffort string

	// CompactThresholdRatio is the resolved compaction trigger as a
	// fraction of the model's context window (0 = use backend default).
	// Backends that maintain their own session history (claw) honor this;
	// CLI-based backends ignore it (claude_code does its own compaction).
	CompactThresholdRatio float64

	// CompactPreserveRecent is the number of recent messages kept verbatim
	// during compaction (0 = use backend default of 4).
	CompactPreserveRecent int

	// SessionID is an optional session ID to resume (empty = fresh session).
	SessionID string

	// ForkSession, when true, forks from the resumed session instead of
	// continuing it. Requires SessionID to be set. The forked session gets
	// a new ID and does not mutate the original session.
	ForkSession bool

	// InteractionEnabled, when true, instructs the delegate to signal when
	// it needs user input by including _needs_interaction and
	// _interaction_questions fields in its output.
	InteractionEnabled bool

	// ResumeConversation, when non-nil, instructs the backend to skip
	// rendering the system+user prompts from scratch and instead replay
	// the persisted conversation history captured at the previous pause.
	// The backend appends a tool_result content block (tool_use_id =
	// ResumePendingToolUseID, content = ResumeAnswer) to answer the
	// pending ask_user call, then continues the agent loop. The opaque
	// json.RawMessage shape lets each backend choose its own message
	// representation (e.g. claw uses []api.Message).
	ResumeConversation json.RawMessage

	// ResumePendingToolUseID is the ID of the tool_use block waiting
	// for an answer in the persisted conversation. Required when
	// ResumeConversation is set.
	ResumePendingToolUseID string

	// ResumeAnswer is the human-supplied answer to the captured
	// ask_user call, sent back to the LLM as the tool_result content.
	ResumeAnswer string
}

Task describes the work to execute on a backend.

func (Task) SystemPromptWithInteraction ¶

func (t Task) SystemPromptWithInteraction() string

SystemPromptWithInteraction returns the task's SystemPrompt augmented with the interaction protocol instructions when InteractionEnabled is true. Backends should call this instead of reading SystemPrompt directly so the LLM consistently learns how to escalate to a human.

type ToolDef ¶

type ToolDef struct {
	Name        string
	Description string
	InputSchema json.RawMessage
	Execute     func(ctx context.Context, input json.RawMessage) (string, error)
}

ToolDef is a fully resolved tool definition for backends that execute tools internally (e.g. claw). CLI-based backends use AllowedTools (string names) instead.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
claudesdk Package claudesdk provides a Go SDK for the Claude Code CLI.	Package claudesdk provides a Go SDK for the Claude Code CLI.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL