rag-with-validation

command

v0.10.2 Latest Latest Go to latest Published: Jul 24, 2025 License: MIT Imports: 19 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/sevigo/goframe

Links

Open Source Insights

README ¶

Gemini and Ollama: A Validated RAG Example with MS-MARCO

This example demonstrates an advanced Retrieval-Augmented Generation (RAG) pattern that uses two different LLMs: a fast, local Ollama model for validation, and a powerful Google Gemini model for final answer generation. It uses the MS-MARCO dataset to provide a real-world knowledge base and a diverse set of questions for testing.

The Concept: Validated RAG

Standard RAG pipelines retrieve documents and immediately use them as context. This can be inefficient if the retrieved documents are irrelevant. This example implements a "Validated RAG" pipeline:

Retrieve: Fetch relevant passages from a vector store (Qdrant) populated with the MS-MARCO dataset.
Validate: Use a fast, local LLM (Ollama with gemma3:1b) to check if the retrieved context is actually relevant to the user's question. This acts as a "gatekeeper."
Generate:
- If the context is valid, send it along with the question to a powerful generator LLM (Google Gemini) for a high-quality, context-aware answer.
- If the context is invalid, discard it and send only the question to Gemini, allowing it to use its general knowledge.

This pattern improves accuracy by preventing "context stuffing" with irrelevant information and can reduce costs by avoiding unnecessary work by the larger model.

Features Demonstrated

Using multiple, distinct LLMs for different tasks in a single pipeline.
Implementing a custom chain (ValidatingRetrievalChain) with conditional logic.
Ingesting a real-world dataset (MS-MARCO) into a vector store.
Testing the pipeline against a random sample of 10 questions from the dataset.

Prerequisites

Go 1.21+
Ollama: Must be running locally.
Qdrant: Must be running locally (e.g., via Docker).
Gemini API Key: Your key must be set as an environment variable.
MS-MARCO Dataset: The example expects the dataset to be at testdata/rag_dataset/msmarco/ms_marco_1000.csv. Make sure this file exists.

Setup

Start Services:

# Start Qdrant
docker run -p 6333:6333 -p 6334:6334 qdrant/qdrant

# Ensure Ollama is running

Pull Ollama Models: This example uses two models from Ollama.
```
ollama pull nomic-embed-text
ollama pull gemma3:1b
```
Set Gemini API Key:
```
export GEMINI_API_KEY="YOUR_API_KEY"
```

How to Run

Execute the main.go file from the root of the project:

go run ./examples/rag-with-validation/main.go

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL