embedding

package

v1.0.2 Latest Latest Go to latest Published: Mar 9, 2026 License: MIT Imports: 6 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/DotNetAge/gorag

Links

Open Source Insights

Documentation ¶

Index ¶

type BatchOptions
- func DefaultBatchOptions() BatchOptions
type BatchProcessor
- func NewBatchProcessor(provider Provider, opts BatchOptions) *BatchProcessor
type BatchProvider
- func NewBatchProvider(provider Provider, opts BatchOptions) *BatchProvider
type ProgressCallback
type Provider
type RetryConfig
- func DefaultRetryConfig() RetryConfig
type Stats

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

This section is empty.

Types ¶

type BatchOptions ¶ added in v1.0.1

type BatchOptions struct {
	BatchSize   int
	MaxWorkers  int
	RateLimit   time.Duration // Minimum time between requests
	RetryConfig RetryConfig
}

BatchOptions configures batch processing

func DefaultBatchOptions ¶ added in v1.0.1

func DefaultBatchOptions() BatchOptions

DefaultBatchOptions returns default batch options

type BatchProcessor ¶ added in v1.0.1

type BatchProcessor struct {
	// contains filtered or unexported fields
}

BatchProcessor handles batch processing of embeddings with optimization

func NewBatchProcessor ¶ added in v1.0.1

func NewBatchProcessor(provider Provider, opts BatchOptions) *BatchProcessor

NewBatchProcessor creates a new batch processor

func (*BatchProcessor) Close ¶ added in v1.0.2

func (bp *BatchProcessor) Close()

Close stops the rate limiter ticker and releases resources

func (*BatchProcessor) Process ¶ added in v1.0.1

func (bp *BatchProcessor) Process(ctx context.Context, texts []string) ([][]float32, error)

Process processes texts in optimized batches

func (*BatchProcessor) ProcessWithProgress ¶ added in v1.0.1

func (bp *BatchProcessor) ProcessWithProgress(
	ctx context.Context,
	texts []string,
	callback ProgressCallback,
) ([][]float32, error)

ProcessWithProgress processes with progress tracking

type BatchProvider ¶ added in v1.0.1

type BatchProvider struct {
	// contains filtered or unexported fields
}

BatchProvider wraps a Provider with batch processing optimization

func NewBatchProvider ¶ added in v1.0.1

func NewBatchProvider(provider Provider, opts BatchOptions) *BatchProvider

NewBatchProvider creates a new batch provider

func (*BatchProvider) Dimension ¶ added in v1.0.1

func (bp *BatchProvider) Dimension() int

Dimension returns the embedding dimension Implements the Provider interface

func (*BatchProvider) Embed ¶ added in v1.0.1

func (bp *BatchProvider) Embed(ctx context.Context, texts []string) ([][]float32, error)

Embed generates embeddings with batch processing optimization Implements the Provider interface

func (*BatchProvider) GetProcessor ¶ added in v1.0.1

func (bp *BatchProvider) GetProcessor() *BatchProcessor

GetProcessor returns the underlying batch processor

func (*BatchProvider) ProcessWithProgress ¶ added in v1.0.1

func (bp *BatchProvider) ProcessWithProgress(
	ctx context.Context,
	texts []string,
	callback ProgressCallback,
) ([][]float32, error)

ProcessWithProgress processes embeddings with progress tracking

type ProgressCallback ¶ added in v1.0.1

type ProgressCallback func(processed, total int)

ProgressCallback is called during batch processing

type Provider ¶

type Provider interface {
	// Embed generates embeddings for the given texts
	//
	// Parameters:
	// - ctx: Context for cancellation and timeout
	// - texts: Slice of texts to embed
	//
	// Returns:
	// - [][]float32: Slice of embeddings, one for each text
	// - error: Error if embedding generation fails
	Embed(ctx context.Context, texts []string) ([][]float32, error)

	// Dimension returns the dimension of the embeddings generated by this provider
	//
	// Returns:
	// - int: Embedding dimension (e.g., 1536 for text-embedding-ada-002)
	Dimension() int
}

Provider defines the interface for embedding providers

This interface is implemented by all embedding model providers (OpenAI, Ollama, Cohere, Voyage) and allows the RAG engine to generate embeddings for text chunks.

Example implementation:

type OpenAIProvider struct {
    client *openai.Client
    model  string
}

func (p *OpenAIProvider) Embed(ctx context.Context, texts []string) ([][]float32, error) {
    // Call OpenAI API to generate embeddings
}

func (p *OpenAIProvider) Dimension() int {
    return 1536 // text-embedding-ada-002 dimension
}

type RetryConfig ¶ added in v1.0.1

type RetryConfig struct {
	MaxRetries int
	BaseDelay  time.Duration
	MaxDelay   time.Duration
	Multiplier float64
}

RetryConfig configures retry behavior

func DefaultRetryConfig ¶ added in v1.0.1

func DefaultRetryConfig() RetryConfig

DefaultRetryConfig returns default retry configuration

type Stats ¶ added in v1.0.1

type Stats struct {
	TotalBatches     int
	ProcessedBatches int
	FailedBatches    int
	TotalTexts       int
	ProcessingTime   time.Duration
}

Stats returns batch processor statistics

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
cohere
ollama
openai
voyage

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL