llm

package

v0.8.0 Latest Latest Go to latest Published: Mar 25, 2026 License: MIT Imports: 9 Imported by: 2

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/tyemirov/utils

Links

Open Source Insights

README ¶

`github.com/tyemirov/utils/llm`

Overview

llm provides the reusable plumbing for chat-completion based flows. It exposes:

Message, ChatRequest, and the ChatClient interface so call sites can build prompts without depending on HTTP details.
Client, a minimal HTTP adapter to /chat/completions with request validation, timeout handling, and error trimming.
Factory, a wrapper that layers configurable retry/backoff semantics and still satisfies the ChatClient interface.
Helper types (Config, ResponseFormat, RetryPolicy, etc.) so packages can describe their needs declaratively.

The package is dependency-free beyond the Go standard library.

Usage

Construct a client (or factory) with llm.Config:

llmConfig := llm.Config{
	BaseURL:             os.Getenv("OPENAI_BASE_URL"),
	APIKey:              os.Getenv("OPENAI_API_KEY"),
	Model:               "gpt-4.1-mini",
	MaxCompletionTokens: 512,
	Temperature:         0.2,
	HTTPClient:          &http.Client{Timeout: 30 * time.Second},
	RequestTimeout:      60 * time.Second,
	RetryAttempts:       3,
	RetryInitialBackoff: 200 * time.Millisecond,
	RetryMaxBackoff:     2 * time.Second,
	RetryBackoffFactor:  2,
}

Use client, _ := llm.NewClient(cfg) when you want a single request/response without automatic retries.
Use factory, _ := llm.NewFactory(cfg) when you want retry/backoff semantics. The factory still implements ChatClient.

Documentation ¶

Overview ¶

Package llm provides reusable client and factory implementations for chat-based large language model integrations. It exposes lightweight interfaces (`ChatClient`), request/response types (`Message`, `ChatRequest`), and configurable HTTP plumbing so higher-level packages can inject or mock LLM dependencies without duplicating API code.

Index ¶

Variables
type ChatClient
type ChatRequest
type Client
- func NewClient(configuration Config) (*Client, error)
- func (client *Client) Chat(ctx context.Context, request ChatRequest) (string, error)
type Config
type Factory
- func NewFactory(configuration Config, options ...FactoryOption) (*Factory, error)
- func (factory *Factory) Chat(ctx context.Context, request ChatRequest) (string, error)
type FactoryOption
- func WithRetryPolicy(policy RetryPolicy) FactoryOption
- func WithSleepFunc(fn SleepFunc) FactoryOption
type HTTPClient
type Message
type ResponseFormat
type RetryPolicy
type SleepFunc

Constants ¶

This section is empty.

Variables ¶

View Source

var ErrEmptyResponse = errors.New("llm returned empty response")

ErrEmptyResponse indicates the LLM returned only whitespace or an empty payload.

Functions ¶

This section is empty.

Types ¶

type ChatClient ¶

type ChatClient interface {
	Chat(ctx context.Context, request ChatRequest) (string, error)
}

ChatClient issues chat completion requests.

type ChatRequest ¶

type ChatRequest struct {
	Model          string
	Messages       []Message
	MaxTokens      int
	Temperature    *float64
	ResponseFormat *ResponseFormat
}

ChatRequest describes a chat completion request.

type Client ¶

type Client struct {
	// contains filtered or unexported fields
}

Client communicates with an LLM chat completion endpoint.

func NewClient ¶

func NewClient(configuration Config) (*Client, error)

NewClient constructs a client from configuration.

func (*Client) Chat ¶

func (client *Client) Chat(ctx context.Context, request ChatRequest) (string, error)

Chat returns a trimmed response string for the provided request.

type Config ¶

type Config struct {
	BaseURL             string
	APIKey              string
	Model               string
	MaxCompletionTokens int
	Temperature         float64
	HTTPClient          HTTPClient
	RequestTimeout      time.Duration
	RetryAttempts       int
	RetryInitialBackoff time.Duration
	RetryMaxBackoff     time.Duration
	RetryBackoffFactor  float64
}

Config configures an LLM client.

type Factory ¶

type Factory struct {
	// contains filtered or unexported fields
}

Factory constructs chat clients and enforces retry/backoff behaviour.

func NewFactory ¶

func NewFactory(configuration Config, options ...FactoryOption) (*Factory, error)

NewFactory builds a Factory using the provided configuration.

func (*Factory) Chat ¶

func (factory *Factory) Chat(ctx context.Context, request ChatRequest) (string, error)

Chat executes the chat request with retry/backoff safeguards.

type FactoryOption ¶

type FactoryOption func(*Factory)

FactoryOption customises factory behaviour.

func WithRetryPolicy ¶

func WithRetryPolicy(policy RetryPolicy) FactoryOption

WithRetryPolicy overrides the retry policy applied by the factory.

func WithSleepFunc ¶

func WithSleepFunc(fn SleepFunc) FactoryOption

WithSleepFunc overrides the sleep function used between retries (intended for tests).

type HTTPClient ¶

type HTTPClient interface {
	Do(request *http.Request) (*http.Response, error)
}

HTTPClient issues HTTP requests.

type Message ¶

type Message struct {
	Role    string `json:"role"`
	Content string `json:"content"`
}

Message represents a chat message role/content pair.

type ResponseFormat ¶

type ResponseFormat struct {
	Type   string          `json:"type"`
	Name   string          `json:"name"`
	Schema json.RawMessage `json:"schema,omitempty"`
	Strict bool            `json:"strict,omitempty"`
}

ResponseFormat configures structured responses.

type RetryPolicy ¶

type RetryPolicy struct {
	MaxAttempts       int
	InitialBackoff    time.Duration
	MaxBackoff        time.Duration
	BackoffMultiplier float64
}

RetryPolicy controls retry behaviour for chat completion requests.

type SleepFunc ¶

type SleepFunc func(ctx context.Context, duration time.Duration) error

SleepFunc waits for the provided duration or returns early if the context is cancelled.

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL