go-vectorsearch

command module

v0.0.0-...-f8036e4 Latest Latest Go to latest Published: Jun 20, 2025 License: Unlicense Imports: 23 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/expki/go-vectorsearch

Links

Open Source Insights

README ¶

VectorSearch

Vectorsearch is a high-performance vector search engine that indexes AI-generated embeddings from uploaded documents, files, and text. It features a fast search interface powered by a IVF Flat index with centroids generated using a divide-and-conquer strategy for efficient multi-core processing. Supports AVX-accelerated CPU execution, with GPU support planned. This library is to be imported as a vector search engine in projects and it's UI is not meant for production use.

Demo: https://vectorsearch.vdh.dev/

Projects Used

https://github.com/expki/vectorpedia
A Wikipedia search engine powered by this Vector Search library.

Design

graph TD
  subgraph Client
    A[Upload Document]
    B[Search Query]
  end

  subgraph Server
    C[Centroids]
  end

  subgraph Database
    D[SQLite / PostgreSQL]
  end

  subgraph AI
    E[OpenAI / Ollama]
  end

  A -->| Document | E
  B -->| Query | E
  E -->| Vector | C
  C -->| Vector | D

Technologies Used

Cosine Similarity
Cosine similarity measures the similarity between two vectors by calculating the cosine of the angle between them. This is used to produce a percentage match between documents and user search queries.
IVF Flat Index
IVF Flat Index samples a set of vectors from the dataset to act as centroid allowing search quries to be narrowed down to smaller subsets on which the vector search can occur. This indexing method enables data to be inserted without rebuilding the index (a very expensive operation in vector databases).
Divide and Conquer
Divide and Conquer strategy sub-divides the main problem into subproblems solving which can be solved easier and in parallel. This solves the scalability problem of IVF Flat Index.
Quantization
Quantization reduces the memory footprint of vector embeddings without significantly impacting result accuracy. This project scales all float64 (8-byte) & float32 (4-byte) vectors to 1-byte with weights targeting 99.8% accuracy.
Gonum
Enables AVX, AVX2 & AVX512 CPU acceleration of Cosine Similarity enabling a ×10 faster Cosine Similarity.
Gorgonia
Provides alternative AVX CPU acceleration to Gonum and will power the GPU acceleration in the future.
Gorm
A Golang Object-Relational Mapper enabling strongly typed Relational Database queries.

Development

This project has five main components:

Compute
The compute/ directory implements the Cosine Similarity and Quantization methods.
Database
The database/ directory implements PostgreSQL / SQLite client connection and auto migration schema.
Divide and Conquer
The dnc/ directory implements the IVF Flat Vector indexing executed by a custom Divide and Conquer strategy.
Divide and Conquer
The server/ implements the libary methods and API methods to use the vector search engine.
Website
The ui/ directory implements vector search engine website where users can upload and search.

Dependancies
- Linux (or Windows with WSL), Windows (not fun to get working)
- Golang >=1.24.4
- GCC => 8.0
- Node >= 22.0
- OpenAI / Ollama API URL
Run development

go run .

Build for production

./build.sh

Operations

Running the executable will produce a config.json file in the current directory. Modify this file to configure your database settings and other parameters.

Run App

The configuration file is auto generated if it does not exist.

Linux

./build/vectorsearch ./config.json

Windows

./build/vectorsearch ./config.json

Configuration

The config.json file contains all necessary configuration for the application, including:

Database type (SQLite or PostgreSQL)
Connection strings for each database type
Ollama and/or OpenAI configuration Not all configuration is required, the autogenerated configuration is sufficient for a MVP installation.

{
  "server": {
    "http_address": ":7500",
    "https_address": ":7501"
  },
  "tls": {
    "dns": ["computer001.localdomain"],
    "ip": ["192.168.1.100"],
    "certificates": [
      {
        "cert_path": "/etc/ssl/certs/server.pem",
        "key_path": "/etc/ssl/private/server.pem"
      }
    ]
  },
  "database": {
    "sqlite": "./vectors.db",
    "postgres": ["host=localhost user=vectorsearch password=1234 dbname=vectordb port=9920 sslmode=disable"],
    "postgres_readonly": ["host=localhost user=vectorsearch password=1234 dbname=vectordb port=9920 sslmode=disable"]
  },
  "ollama": {
    "embed": {
      "model": "nomic-embed-text",
      "api_base": "http://localhost:11434",
      "num_ctx": 8192
    },
    "generate": {
      "model": "llama3.2",
      "api_base": "http://localhost:11434",
      "num_ctx": 128000
    },
    "chat": {
      "model": "llama3.2",
      "api_base": "http://localhost:11434",
      "num_ctx": 128000
    }
  },
  "openai": {
    "embed": {
      "model": "o4-mini",
      "api_base": "https://api.openai.com",
      "api_key": "sk-1234",
      "num_ctx": 200000
    },
    "generate": {
      "model": "o4-mini",
      "api_base": "https://api.openai.com",
      "api_key": "sk-1234",
      "num_ctx": 100000
    },
    "chat": {
      "model": "o4-mini",
      "api_base": "https://api.openai.com",
      "api_key": "sk-1234",
      "num_ctx": 100000
    }
  },
  "cache": "./cache/",
  "log_level": "error"
}

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
ai
aicomms
httpclient
ollama
openai
cache
compute
config
database
dnc
env
logger
noop
server
static

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL