config

package

v0.0.2 Latest Latest Go to latest Published: Apr 16, 2026 License: GPL-3.0 Imports: 14 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/ethpandaops/vllm-agent-sdk-go

Links

Open Source Insights

Documentation ¶

Index ¶

Constants
type ChatRequest
type Effort
type Options
- func (o *Options) ApplyDefaults()
type PluginConfig
type QueryLifecycleNotifier
type SessionMetricsRecorder
type SystemPromptPreset
type ThinkingConfig
type ThinkingConfigAdaptive
type ThinkingConfigDisabled
type ThinkingConfigEnabled
type ToolsConfig
type ToolsList
type ToolsPreset
type Transport
type VLLMAPIMode

Constants ¶

View Source

const DefaultBaseURL = "http://127.0.0.1:8000/v1"

DefaultBaseURL is the default vLLM OpenAI-compatible base URL.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type ChatRequest ¶

type ChatRequest struct {
	Model              string
	Models             []string
	Messages           []map[string]any
	Tools              []map[string]any
	Stream             bool
	ToolChoice         any
	MaxTokens          *int
	MaxOutputTokens    *int
	Temperature        *float64
	TopP               *float64
	TopK               *float64
	PresencePenalty    *float64
	FrequencyPenalty   *float64
	Seed               *int64
	Stop               []string
	Logprobs           *bool
	TopLogprobs        *int
	ParallelToolCalls  *bool
	ResponseFormat     map[string]any
	ResponseText       map[string]any
	Metadata           map[string]any
	Provider           map[string]any
	Plugins            []map[string]any
	Route              string
	Reasoning          map[string]any
	SessionID          string
	Trace              *bool
	Modalities         []string
	ImageConfig        map[string]any
	User               string
	Instructions       string
	PreviousResponseID string
	PromptCacheKey     string
	MaxToolCalls       *int
	ServiceTier        string
	Truncation         string
	Include            []string
	Background         *bool
	SafetyIdentifier   string
	Store              *bool
	Prompt             map[string]any
	Extra              map[string]any
}

ChatRequest is the normalized VLLM chat request used by transports.

type Effort ¶

type Effort string

Effort controls thinking depth.

const (
	EffortLow    Effort = "low"
	EffortMedium Effort = "medium"
	EffortHigh   Effort = "high"
	EffortMax    Effort = "max"
)

func (*Options) ApplyDefaults ¶

func (o *Options) ApplyDefaults()

ApplyDefaults fills missing option defaults.

type PluginConfig ¶

type PluginConfig struct {
	Type string `json:"type"` // "local"
	Path string `json:"path"`
}

PluginConfig configures a plugin to load.

type QueryLifecycleNotifier ¶ added in v0.0.2

type QueryLifecycleNotifier interface {
	MarkQueryStart()
}

QueryLifecycleNotifier is optionally implemented by SessionMetricsRecorder implementations that need query lifecycle notifications for TTFT tracking.

type SessionMetricsRecorder ¶ added in v0.0.2

type SessionMetricsRecorder interface {
	Observe(ctx context.Context, msg message.Message)
}

SessionMetricsRecorder is the narrow observability interface used by the SDK runtime. When configured via WithMeterProvider or WithTracerProvider, the SDK creates a recorder that emits OpenTelemetry metrics and traces at existing observation points. The context parameter enables trace correlation and exemplar propagation.

type SystemPromptPreset ¶

type SystemPromptPreset struct {
	Type   string  `json:"type"`   // "preset"
	Preset string  `json:"preset"` // backend-defined preset identifier
	Append *string `json:"append,omitempty"`
}

SystemPromptPreset defines a system prompt preset configuration.

type ThinkingConfig ¶

type ThinkingConfig interface {
	// contains filtered or unexported methods
}

ThinkingConfig is a marker interface for thinking settings.

type ThinkingConfigAdaptive ¶

type ThinkingConfigAdaptive struct{}

ThinkingConfigAdaptive enables adaptive thinking.

type ThinkingConfigDisabled ¶

type ThinkingConfigDisabled struct{}

ThinkingConfigDisabled disables thinking.

type ThinkingConfigEnabled ¶

type ThinkingConfigEnabled struct {
	BudgetTokens int
}

ThinkingConfigEnabled enables thinking with a token budget.

type ToolsConfig ¶

type ToolsConfig interface {
	// contains filtered or unexported methods
}

ToolsConfig is an interface for configuring available tools.

type ToolsList ¶

type ToolsList []string

ToolsList is a list of tool names to make available.

type ToolsPreset ¶

type ToolsPreset struct {
	Type   string `json:"type"`   // "preset"
	Preset string `json:"preset"` // backend-defined preset identifier
}

ToolsPreset represents a preset configuration for available tools.

type Transport ¶

type Transport interface {
	Start(ctx context.Context) error
	CreateStream(ctx context.Context, req *ChatRequest) (<-chan map[string]any, <-chan error)
	Close() error
}

Transport defines the runtime transport interface.

type VLLMAPIMode ¶

type VLLMAPIMode string

VLLMAPIMode is retained for public compatibility. The vLLM backend serves chat-completions requests and may emulate responses-mode behavior locally where practical.

const (
	// VLLMAPIModeChatCompletions uses /chat/completions.
	VLLMAPIModeChatCompletions VLLMAPIMode = "chat_completions"
	// VLLMAPIModeResponses uses /responses.
	VLLMAPIModeResponses VLLMAPIMode = "responses"
)

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL