multimodal_input

command

v0.0.2 Latest Latest Go to latest Published: Apr 24, 2026 License: GPL-3.0 Imports: 8 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/ethpandaops/opencode-agent-sdk-go

Links

Open Source Insights

Documentation ¶

Overview ¶

Demonstrates multimodal prompt input via the opencodesdk.QueryContent one-shot helper.

This example shows three entry points:

opencodesdk.QueryContent — one-shot text + image prompt
opencodesdk.ImageFileInput — load an image from disk
opencodesdk.Text / Blocks — ergonomic block constructors

The legacy string-only opencodesdk.Query still works for plain text — QueryContent is only needed when you want to attach images, embedded resources, or other non-text content blocks.

Image support requires the agent to advertise the "image" prompt capability during ACP initialize; opencode advertises it when attached to a multimodal-capable model. If the capability isn't advertised, the SDK rejects image blocks with ErrCapabilityUnavailable before the prompt reaches opencode.

go run ./examples/multimodal_input

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL