gorag

module

v0.5.0 Latest Latest Go to latest Published: Mar 6, 2026 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/DotNetAge/gorag

Links

Open Source Insights

README ¶

GoRAG

GoRAG - Production-ready RAG (Retrieval-Augmented Generation) framework for Go

Features

High Performance - Built for production with low latency and high throughput
Modular Design - Pluggable parsers, vector stores, and LLM providers
Cloud Native - Kubernetes friendly, single binary deployment
Type Safe - Full type safety with Go's strong typing
Production Ready - Observability, metrics, and error handling built-in
Hybrid Retrieval - Combine vector and keyword search for better results
Reranking - LLM-based result reranking for improved relevance
Streaming Responses - Real-time streaming for better user experience
Plugin System - Extensible architecture for custom functionality
CLI Tool - Command-line interface for easy usage
Comprehensive Testing - 85%+ test coverage with integration tests using Testcontainers
Multi-modal Support - Process images and other media types
Excel and PPT Parsing - Support for Excel and PowerPoint files
Domestic LLM Support - Support for popular Chinese LLMs (qwen, seed2, minmax, kimi, glm5, etc.)
Azure OpenAI Support - Full support for Azure OpenAI Service
Configuration Management - Flexible YAML and environment variable configuration
Custom Prompt Templates - Create custom prompt formats with placeholders
Performance Benchmarks - Built-in benchmarking for performance optimization

Quick Start

package main

import (
    "context"
    "log"
    "os"
    
    embedder "github.com/DotNetAge/gorag/embedding/openai"
    llm "github.com/DotNetAge/gorag/llm/openai"
    "github.com/DotNetAge/gorag/parser/text"
    "github.com/DotNetAge/gorag/rag"
    "github.com/DotNetAge/gorag/vectorstore/memory"
)

func main() {
    ctx := context.Background()
    apiKey := os.Getenv("OPENAI_API_KEY")
    
    // Create RAG engine
    embedderInstance, _ := embedder.New(embedder.Config{APIKey: apiKey})
    llmInstance, _ := llm.New(llm.Config{APIKey: apiKey})
    
    engine, err := rag.New(
        rag.WithParser(text.NewParser()),
        rag.WithVectorStore(memory.NewStore()),
        rag.WithEmbedder(embedderInstance),
        rag.WithLLM(llmInstance),
    )
    if err != nil {
        log.Fatal(err)
    }
    
    // Index documents
    err = engine.Index(ctx, rag.Source{
        Type: "text",
        Content: "Go is an open source programming language...",
    })
    
    // Query with custom prompt template
    resp, err := engine.Query(ctx, "What is Go?", rag.QueryOptions{
        TopK: 5,
        PromptTemplate: "You are a helpful assistant. Based on the following context:\n\n{context}\n\nAnswer the question: {question}",
    })
    
    log.Println(resp.Answer)
}

Installation

go get github.com/DotNetAge/gorag

Architecture

┌─────────────────────────────────────────────────────────┐
│                    GoRAG                                 │
├─────────────────────────────────────────────────────────┤
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐     │
│  │  Document   │  │   Vector    │  │    LLM      │     │
│  │   Parser    │  │   Store     │  │   Client    │     │
│  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘     │
│         └─────────────────┼─────────────────┘           │
│                           ▼                           │
│                  ┌─────────────────┐                    │
│                  │   RAG Engine    │                    │
│                  └─────────────────┘                    │
└─────────────────────────────────────────────────────────┘

Modules

parser - Document parsers (Text, PDF, DOCX, HTML, JSON, YAML, Excel, PPT, Image, etc.)
vectorstore - Vector storage backends (Memory, Milvus, Qdrant, Pinecone, Weaviate, etc.)
embedding - Embedding providers (OpenAI, Ollama, etc.)
llm - LLM clients (OpenAI, Anthropic, Azure OpenAI, Ollama, Domestic LLMs, etc.)
rag - RAG engine and orchestration
plugins - Plugin system for extending functionality
config - Configuration management system

CLI Tool

GoRAG includes a command-line interface for easy usage:

# Install
go install github.com/DotNetAge/gorag/cmd/gorag@latest

# Index documents
gorag index --api-key $OPENAI_API_KEY "Go is an open source programming language..."

# Index from file
gorag index --api-key $OPENAI_API_KEY --file README.md

# Query the engine
gorag query --api-key $OPENAI_API_KEY "What is Go?"

# Stream responses
gorag query --api-key $OPENAI_API_KEY --stream "What are the key features of Go?"

# Use custom prompt template
gorag query --api-key $OPENAI_API_KEY --prompt "You are a helpful assistant. Answer the question: {question}"

# Export indexed documents
gorag export --api-key $OPENAI_API_KEY --file export.json

# Import documents
gorag import --api-key $OPENAI_API_KEY --file export.json

Configuration

GoRAG supports flexible configuration through YAML files and environment variables:

YAML Configuration

Create a config.yaml file:

rag:
  topK: 5
  chunkSize: 1000
  chunkOverlap: 100

embedding:
  provider: "openai"
  openai:
    apiKey: "your-api-key"
    model: "text-embedding-ada-002"

llm:
  provider: "openai"
  openai:
    apiKey: "your-api-key"
    model: "gpt-4"

vectorstore:
  type: "milvus"
  milvus:
    host: "localhost"
    port: 19530

logging:
  level: "info"
  format: "json"

metrics:
  enabled: true
  port: 9090

Environment Variables

export GORAG_RAG_TOPK=5
export GORAG_EMBEDDING_PROVIDER=openai
export GORAG_LLM_PROVIDER=openai
export GORAG_VECTORSTORE_TYPE=memory
export GORAG_OPENAI_API_KEY=your-api-key
export GORAG_ANTHROPIC_API_KEY=your-api-key
export GORAG_PINECONE_API_KEY=your-api-key

Examples

Basic - Simple RAG usage example
Advanced - Advanced features including streaming and hybrid retrieval
Web - HTTP API server example

Testing

GoRAG has comprehensive test coverage with both unit tests and integration tests:

Test Coverage

Overall Coverage: 85%+ across all modules
Unit Tests: All core modules have comprehensive unit tests
Integration Tests: Real-world testing with actual vector databases using Testcontainers
Performance Benchmarks: Built-in benchmarks for Index and Query operations

Running Tests

# Run all unit tests
go test ./...

# Run integration tests (requires Docker)
go test -v ./integration_test/...

# Run tests with coverage
go test -cover ./...

# Run benchmarks
go test -bench=. ./rag/

Integration Testing

Integration tests use Testcontainers to spin up real instances of:

Milvus - Vector database for production workloads
Qdrant - High-performance vector search engine
Weaviate - Semantic search engine with GraphQL API

This ensures that GoRAG works correctly with actual vector databases in production environments.

Documentation

Roadmap

Completed (v0.5.0)

More vector store integrations (Milvus, Qdrant, Weaviate)
Advanced retrieval strategies (Hybrid, Reranking)
Streaming responses
Multi-modal support (Image, Audio)
Plugin system
CLI tool
Comprehensive test coverage (85%+)
Integration tests with Testcontainers
Excel and PPT parsing
Domestic LLM support
Azure OpenAI client
Configuration management
Custom prompt templates
Performance benchmarks
Production deployment guides

Planned (v1.0.0)

Multi-tenancy support
Advanced security features
Rate limiting
Authentication/Authorization
Graph RAG support

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.

License

MIT License - see LICENSE for details.

Changelog

See CHANGELOG.md for version history and changes.

Directories ¶

Path	Synopsis
cmd
gorag command
config
core
embedding
ollama
openai
examples
advanced command
basic command
web command
integration_test
testcontainers
llm
anthropic
azureopenai
compatible
ollama
openai
observability
parser
docx
excel
html
image
json
pdf
ppt
text
yaml
plugins
rag
retrieval
utils
vectorstore
memory
milvus
pinecone
qdrant
weaviate

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL