ai-provider-kit

module

v1.0.40 Latest Latest Go to latest Published: Dec 12, 2025 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/cecil-the-coder/ai-provider-kit

Links

Open Source Insights

README ¶

AI Provider Kit

A comprehensive Go library for creating and managing AI provider instances with support for multiple AI services.

Features

Multi-Provider Support: OpenAI, Anthropic, Gemini, Cerebras, OpenRouter, Qwen, and more
Interface Segregation: Focused interfaces for specific capabilities
Standardized Core API: Consistent request/response patterns across providers
Provider Extensions: Preserve provider-specific capabilities while using standardized API
Multimodal Content: First-class support for images, documents, and audio with automatic provider translation
Factory Pattern: Dynamic provider creation and configuration
Authentication Support: API keys, OAuth 2.0, and custom authentication methods
Health Monitoring: Automatic health checks and metrics collection
Load Balancing: Multiple API keys with round-robin distribution
Extensible: Easy to add new providers through the factory pattern
Local Models: Support for LM Studio, Ollama, and Llama.cpp
Streaming: Real-time response streaming
Tool Calling: Built-in support for function calling
Comprehensive Testing: 100% test coverage with unit and integration tests

Architecture

The AI Provider Kit features a modern, flexible architecture with interface segregation and standardized patterns:

Interface Segregation: 9 focused interfaces instead of a monolithic provider interface
Standardized Core API: Universal request/response types with provider-specific extensions
Extensible Design: Easy to add new providers and capabilities

For complete architecture documentation, see the Architecture Guide.

Installation

go get github.com/cecil-the-coder/ai-provider-kit@latest

Quick Start

package main

import (
    "context"
    "log"

    "github.com/cecil-the-coder/ai-provider-kit/pkg/factory"
    "github.com/cecil-the-coder/ai-provider-kit/pkg/types"
)

func main() {
    // Create a new provider factory
    f := factory.NewProviderFactory()

    // Initialize default providers (with real implementations)
    factory.RegisterDefaultProviders(f)

    // Configure an OpenAI provider
    config := types.ProviderConfig{
        Type:         types.ProviderTypeOpenAI,
        Name:         "openai-primary",
        APIKey:       "your-api-key",
        DefaultModel: "gpt-4",
    }

    // Create provider instance
    provider, err := f.CreateProvider(types.ProviderTypeOpenAI, config)
    if err != nil {
        log.Fatal(err)
    }

    // Generate completion
    stream, err := provider.GenerateChatCompletion(context.Background(), types.GenerateOptions{
        Prompt: "Hello, world!",
        Stream: false,
    })
    if err != nil {
        log.Fatal(err)
    }
    defer stream.Close()
}

Supported Providers

Cloud Providers

OpenAI: GPT models, embeddings, and more
Anthropic: Claude models
Google Gemini: Gemini models
Cerebras: High-performance inference
OpenRouter: Multi-provider routing
Qwen: Alibaba Cloud models

Local Providers

LM Studio: Local model serving
Ollama: Local model management
Llama.cpp: Efficient local inference

Configuration

Each provider can be configured with the following options:

type ProviderConfig struct {
    Type              ProviderType
    Name              string
    APIKey            string
    BaseURL           string
    DefaultModel      string
    Timeout           time.Duration
    MaxRetries        int
    RateLimit         RateLimitConfig
    OAuth             *OAuthConfig
    Headers           map[string]string
}

Authentication

API Key Authentication

config := types.ProviderConfig{
    Type:   types.ProviderTypeOpenAI,
    APIKey: "your-api-key-here",
}

OAuth 2.0 Authentication

config := types.ProviderConfig{
    Type: types.ProviderTypeAnthropic,
    OAuth: &types.OAuthConfig{
        ClientID:     "your-client-id",
        ClientSecret: "your-client-secret",
        TokenURL:     "https://api.anthropic.com/oauth/token",
    },
}

Load Balancing

Configure multiple API keys for load balancing:

provider, _ := f.CreateProvider(types.ProviderTypeOpenAI, types.ProviderConfig{
    Type:   types.ProviderTypeOpenAI,
    APIKeys: []string{
        "key1",
        "key2",
        "key3",
    },
    LoadBalancer: &types.LoadBalancerConfig{
        Strategy: types.LoadBalancerStrategyRoundRobin,
    },
})

Streaming

Enable streaming for real-time responses:

stream, err := provider.GenerateChatCompletion(context.Background(), types.GenerateOptions{
    Prompt: "Explain quantum computing",
    Stream: true,
})

for {
    chunk, err := stream.Next()
    if err != nil {
        break
    }

    if chunk.Done {
        break
    }

    fmt.Print(chunk.Content)
}

Multimodal Content

ai-provider-kit provides first-class support for multimodal content including images, documents, and audio. The library handles provider-specific format translations automatically while maintaining a clean, unified API.

Design Philosophy

The multimodal system is built on primitives, not patterns. We provide the building blocks (ContentPart, MediaSource) that let you compose rich multimodal interactions without imposing high-level abstractions. This design enables you to build your own patterns - whether that's caching layers, agent delegation systems, or preprocessing pipelines.

Quick Example

import "github.com/cecil-the-coder/ai-provider-kit/pkg/types"

// Simple text message (backwards compatible)
msg := types.ChatMessage{
    Role:    "user",
    Content: "Hello, world!",
}

// Image with text using Parts
msg := types.ChatMessage{
    Role: "user",
    Parts: []types.ContentPart{
        types.NewTextPart("What's in this image?"),
        types.NewImagePart("image/png", base64Data),
    },
}

// Image from URL
msg := types.ChatMessage{
    Role: "user",
    Parts: []types.ContentPart{
        types.NewTextPart("Describe this image"),
        types.NewImageURLPart("image/jpeg", "https://example.com/image.jpg"),
    },
}

// Document (PDF)
msg := types.ChatMessage{
    Role: "user",
    Parts: []types.ContentPart{
        types.NewTextPart("Summarize this document"),
        types.NewDocumentPart("application/pdf", pdfBase64Data),
    },
}

Supported Content Types

The library supports the following content types through the ContentPart struct:

text - Plain text content
image - Images (PNG, JPEG, GIF, WebP)
document - Documents (PDF, etc.)
audio - Audio files (provider-dependent)
tool_use - Tool/function calls from the model
tool_result - Results returned from tool execution
thinking - Extended reasoning/thinking blocks

Media Sources

Media content can be provided in two ways:

// Base64-encoded data
source := &types.MediaSource{
    Type:      types.MediaSourceBase64,
    MediaType: "image/png",
    Data:      base64EncodedString,
}

// URL reference
source := &types.MediaSource{
    Type:      types.MediaSourceURL,
    MediaType: "image/jpeg",
    URL:       "https://example.com/image.jpg",
}

Helper Methods

The ChatMessage type includes several helper methods for working with multimodal content:

// Get unified content as []ContentPart (handles both Content string and Parts)
parts := message.GetContentParts()

// Extract all text content from message
text := message.GetTextContent()

// Check if message contains images
if message.HasImages() {
    // Handle image content
}

// Check if message contains any media (images, documents, audio)
if message.HasMedia() {
    // Handle media content
}

// Add content parts dynamically
message.AddContentPart(types.NewTextPart("Additional text"))
message.AddContentPart(types.NewImagePart("image/png", imageData))

Provider Support

Different providers support different content types. The library automatically handles format translation for each provider:

Content Type	Anthropic	OpenAI	Gemini
text	✅	✅	✅
image (base64)	✅	✅ (data URL)	✅ (inlineData)
image (url)	✅	✅	✅ (fileData)
document	✅	❌	✅
audio	❌	❌	✅
tool_use	✅	✅	✅
tool_result	✅	✅	✅

Provider-Specific Notes:

OpenAI: Images are converted to data URLs (data:image/png;base64,...). Documents and audio are not supported in chat completions.
Anthropic: Native support for images and PDFs through content blocks. Images and documents use the source structure.
Gemini: Most comprehensive support including audio. Images/documents use inlineData for base64 and fileData for URLs.

Advanced Use Cases

The primitive-based design enables sophisticated patterns:

Image Caching Layer

// Build your own caching system for frequently-used images
type ImageCache struct {
    cache map[string]string // URL -> base64
}

func (c *ImageCache) GetOrFetch(url string) types.ContentPart {
    if data, exists := c.cache[url]; exists {
        return types.NewImagePart("image/jpeg", data)
    }
    // Fetch and cache...
}

Agent-Based Delegation

// Route multimodal requests to specialized agents
if message.HasImages() {
    // Send to vision-specialized model
    return visionAgent.Process(message)
} else if message.HasMedia() {
    // Send to multimodal-capable model
    return multimodalAgent.Process(message)
}

Preprocessing Pipelines

// Transform content before sending to provider
for i, part := range message.Parts {
    if part.Type == types.ContentTypeImage {
        // Resize, compress, or convert format
        message.Parts[i] = preprocessImage(part)
    }
}

Backwards Compatibility

The multimodal system is fully backwards compatible. Existing code using Content string continues to work:

// Legacy approach - still works
msg := types.ChatMessage{
    Role:    "user",
    Content: "Hello!",
}

// Multimodal approach - new capability
msg := types.ChatMessage{
    Role: "user",
    Parts: []types.ContentPart{
        types.NewTextPart("Hello!"),
        types.NewImagePart("image/png", data),
    },
}

Both approaches use the same ChatMessage type, and the library handles the translation automatically through GetContentParts().

Tool Calling

ai-provider-kit provides comprehensive tool calling support across all providers with format translation, validation, and advanced control features.

Quick Example

tools := []types.Tool{
    {
        Name:        "get_weather",
        Description: "Get current weather for a location",
        InputSchema: map[string]interface{}{
            "type": "object",
            "properties": map[string]interface{}{
                "location": map[string]interface{}{
                    "type":        "string",
                    "description": "The city and state, e.g. San Francisco, CA",
                },
                "unit": map[string]interface{}{
                    "type": "string",
                    "enum": []string{"celsius", "fahrenheit"},
                },
            },
            "required": []string{"location"},
        },
    },
}

options := types.GenerateOptions{
    Prompt: "What's the weather in San Francisco?",
    Tools:  tools,
}

stream, err := provider.GenerateChatCompletion(context.Background(), options)
if err != nil {
    log.Fatal(err)
}
defer stream.Close()

chunk, err := stream.Next()
if err != nil {
    log.Fatal(err)
}

// Handle tool calls
if len(chunk.Choices) > 0 && len(chunk.Choices[0].Message.ToolCalls) > 0 {
    for _, toolCall := range chunk.Choices[0].Message.ToolCalls {
        // Execute tool locally
        result := executeToolCall(toolCall)

        // Send result back
        // (see TOOL_CALLING.md for complete multi-turn example)
    }
}

For a comprehensive guide, see TOOL_CALLING.md

Features

Universal Format: Define tools once, use across all providers
ToolChoice Control: Fine-grained control over tool selection
- auto - Model decides whether to use tools
- required - Must use at least one tool
- none - Don't use tools
- specific - Force a specific tool
Parallel Tool Calling: Handle multiple tool calls in a single response
Tool Validation: Optional validation via pkg/toolvalidator
Streaming Support: Tool calls work with streaming responses
Format Translation: Automatic conversion between provider formats

Supported Providers

All 6 providers support tool calling:

OpenAI (native format)
Anthropic (content blocks format)
Gemini (function declarations format)
Cerebras (OpenAI-compatible)
Qwen (OpenAI-compatible)
OpenRouter (OpenAI-compatible)

Health Monitoring

Monitor provider health and performance:

// Health check
err := provider.HealthCheck(context.Background())
if err != nil {
    log.Printf("Provider unhealthy: %v", err)
}

// Get metrics
metrics := provider.GetMetrics()
log.Printf("Requests: %d, Success Rate: %.2f%%",
    metrics.RequestCount,
    float64(metrics.SuccessCount)/float64(metrics.RequestCount)*100)

Custom Providers

Add your own provider implementations:

// Implement the Provider interface
type CustomProvider struct {
    // your fields
}

func (p *CustomProvider) GenerateChatCompletion(ctx context.Context, options types.GenerateOptions) (types.ChatCompletionStream, error) {
    // your implementation
}

// Register your provider
factory.RegisterProvider(types.ProviderTypeCustom, func(config types.ProviderConfig) types.Provider {
    return &CustomProvider{config: config}
})

Development

Prerequisites

Go 1.23 or later
Docker (optional)

Running Tests

# Run all tests
go test ./...

# Run with coverage
go test -coverprofile=coverage.out ./...
go tool cover -html=coverage.out

# Run benchmarks
go test -bench=. ./...

Building

# Build library
go build ./...

# Build CLI tool (if available)
go build -o ai-provider-kit ./cmd/ai-provider-kit

Docker

# Build Docker image
docker build -t ai-provider-kit .

# Run with Docker
docker run -p 8080:8080 ai-provider-kit

Documentation

Comprehensive documentation is available in the /docs directory:

Getting Started - Quick start guide
API Reference - Complete API documentation
Error Code Mapping - Error handling and troubleshooting
Best Practices - Recommended patterns
Provider Guides - Provider-specific documentation
OAuth Manager - Enterprise credential management
Metrics - Monitoring and observability

See the complete documentation index for all available guides.

Contributing

Contributions are welcome! Please see our Contributing Guidelines for details.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License.

Summary: You are free to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the software, and to permit persons to whom the software is furnished to do so, subject to the following conditions:

The copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
The software is provided "AS IS", without warranty of any kind.

For the full license text, see the LICENSE file.

Changelog

See CHANGELOG.md for a list of changes and version history.

For detailed upgrade instructions between versions, including breaking changes and API modifications, see the Version-Specific Migration Guide.

Support

Directories ¶

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

Path	Synopsis
examples
ollama command
ollama-structured-outputs command
config module
internal
http Package http provides HTTP client utilities and helpers for AI providers.	Package http provides HTTP client utilities and helpers for AI providers.
testutil Package testutil provides shared testing utilities, mocks, and fixtures for use across the ai-provider-kit test suite.	Package testutil provides shared testing utilities, mocks, and fixtures for use across the ai-provider-kit test suite.
pkg
auth Package auth provides authentication and authorization utilities for AI providers.	Package auth provides authentication and authorization utilities for AI providers.
backend Package backend provides HTTP server infrastructure for building AI-powered backend services.	Package backend provides HTTP server infrastructure for building AI-powered backend services.
backend/extensions Package extensions provides a plugin system for extending backend functionality.	Package extensions provides a plugin system for extending backend functionality.
backend/handlers Package handlers provides HTTP request handlers for the AI Provider Kit backend.	Package handlers provides HTTP request handlers for the AI Provider Kit backend.
backend/middleware Package middleware provides HTTP middleware components for the backend server.	Package middleware provides HTTP middleware components for the backend server.
backendtypes Package backendtypes defines types for backend server configuration and API communication.	Package backendtypes defines types for backend server configuration and API communication.
factory Package factory provides the provider factory pattern for creating and managing AI provider instances.	Package factory provides the provider factory pattern for creating and managing AI provider instances.
metrics Package metrics provides a centralized metrics collection system for ai-provider-kit.	Package metrics provides a centralized metrics collection system for ai-provider-kit.
oauthmanager Package oauthmanager provides OAuth credential management and rotation for AI providers.	Package oauthmanager provides OAuth credential management and rotation for AI providers.
providers/anthropic Package anthropic provides an Anthropic Claude AI provider implementation.	Package anthropic provides an Anthropic Claude AI provider implementation.
providers/anthropic/bedrock Package bedrock provides AWS Bedrock integration middleware for the Anthropic provider.	Package bedrock provides AWS Bedrock integration middleware for the Anthropic provider.
providers/anthropic/vertex Package vertex provides Google Vertex AI integration for Anthropic Claude models.	Package vertex provides Google Vertex AI integration for Anthropic Claude models.
providers/base Package base provides common functionality and utilities for AI providers.	Package base provides common functionality and utilities for AI providers.
providers/cerebras Package cerebras provides a Cerebras AI provider implementation.	Package cerebras provides a Cerebras AI provider implementation.
providers/common Package common provides shared utilities and infrastructure for AI provider implementations.	Package common provides shared utilities and infrastructure for AI provider implementations.
providers/common/auth Package auth provides authentication utilities and helper functions for AI provider implementations	Package auth provides authentication utilities and helper functions for AI provider implementations
providers/common/config Package config provides configuration utilities for AI provider implementations	Package config provides configuration utilities for AI provider implementations
providers/common/errors Package errors provides rich error context for AI provider operations.	Package errors provides rich error context for AI provider operations.
providers/common/middleware Package middleware provides a flexible middleware infrastructure for AI provider requests and responses.	Package middleware provides a flexible middleware infrastructure for AI provider requests and responses.
providers/common/models Package models provides model metadata, caching, and registry functionality for AI providers, including capability tracking and discovery.	Package models provides model metadata, caching, and registry functionality for AI providers, including capability tracking and discovery.
providers/common/retry Package retry provides retry infrastructure for AI provider implementations.	Package retry provides retry infrastructure for AI provider implementations.
providers/common/streaming Package streaming provides streaming utilities for AI provider implementations.	Package streaming provides streaming utilities for AI provider implementations.
providers/common/streaming/decoders Package decoders provides pluggable stream decoders for various streaming formats including Server-Sent Events (SSE), NDJSON, and EventStream.	Package decoders provides pluggable stream decoders for various streaming formats including Server-Sent Events (SSE), NDJSON, and EventStream.
providers/gemini Package gemini provides extension implementation for Google Gemini AI provider.	Package gemini provides extension implementation for Google Gemini AI provider.
providers/ollama Package ollama provides an Ollama AI provider implementation.	Package ollama provides an Ollama AI provider implementation.
providers/openai Package openai provides integration with OpenAI's GPT models including chat completions, streaming, tool calling, and authentication support.	Package openai provides integration with OpenAI's GPT models including chat completions, streaming, tool calling, and authentication support.
providers/openrouter Package openrouter provides an OpenRouter AI provider implementation.	Package openrouter provides an OpenRouter AI provider implementation.
providers/qwen Package qwen provides integration with Qwen (Alibaba Cloud) AI models supporting both API key and OAuth authentication, streaming, and tool calling.	Package qwen provides integration with Qwen (Alibaba Cloud) AI models supporting both API key and OAuth authentication, streaming, and tool calling.
providers/virtual Package virtual provides composite provider implementations that combine multiple underlying providers.	Package virtual provides composite provider implementations that combine multiple underlying providers.
providers/virtual/fallback Package fallback provides a virtual provider that implements fallback logic.	Package fallback provides a virtual provider that implements fallback logic.
providers/virtual/loadbalance Package loadbalance provides a virtual provider that distributes requests across multiple providers.	Package loadbalance provides a virtual provider that distributes requests across multiple providers.
providers/virtual/racing Package racing provides a virtual provider that races multiple providers concurrently.	Package racing provides a virtual provider that races multiple providers concurrently.
ratelimit Package ratelimit provides rate limiting functionality for AI provider API requests.	Package ratelimit provides rate limiting functionality for AI provider API requests.
toolvalidator Package toolvalidator provides validation utilities for tool definitions and tool calls.	Package toolvalidator provides validation utilities for tool definitions and tool calls.
types Package types defines the core types and interfaces for the AI provider kit.	Package types defines the core types and interfaces for the AI provider kit.
utils Package utils provides utility functions for token estimation, tool call validation, and embedded error detection.	Package utils provides utility functions for token estimation, tool call validation, and embedded error detection.