zerfoo

package module

v0.3.0 Latest Latest Go to latest Published: Aug 25, 2025 License: Apache-2.0 Imports: 10 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/zerfoo/zerfoo

Links

Open Source Insights

README ¶

Zerfoo: A High-Performance, Scalable, and Idiomatic Go Framework for Machine Learning

Zerfoo is a machine learning framework built from the ground up in Go. It is designed for performance, scalability, and developer experience, enabling everything from practical deep learning tasks to large-scale AGI experimentation.

By leveraging Go's strengths—simplicity, strong typing, and best-in-class concurrency—Zerfoo provides a robust and maintainable foundation for building production-ready ML systems.

Status: Pre-release — actively in development.

Quick Start

Define, train, and run a simple model in just a few lines of idiomatic Go.

package main

import (
	"fmt"

	"github.com/zerfoo/zerfoo/compute"
	"github.com/zerfoo/zerfoo/graph"
	"github.com/zerfoo/zerfoo/layers/activations"
	"github.com/zerfoo/zerfoo/layers/core"
	"github.com/zerfoo/zerfoo/tensor"
	"github.com/zerfoo/zerfoo/training"
)

func main() {
	// 1. Create a compute engine
	engine := compute.NewCPUEngine()

	// 2. Define the model architecture using a graph builder
	builder := graph.NewBuilder[float32](engine)
	input := builder.Input([]int{1, 10})
	dense1 := builder.AddNode(core.NewDense(10, 32), input)
	act1 := builder.AddNode(activations.NewReLU(), dense1)
	output := builder.AddNode(core.NewDense(32, 1), act1)

	// 3. Build the computational graph
	forward, backward, err := builder.Build(output)
	if err != nil {
		panic(err)
	}

	// 4. Create an optimizer
	optimizer := training.NewAdamOptimizer[float32](0.01)

	// 5. Generate dummy data
	inputTensor, _ := tensor.NewTensor(engine, []int{1, 10})
	targetTensor, _ := tensor.NewTensor(engine, []int{1, 1})

	// 6. Run the training loop
	for i := 0; i < 100; i++ {
		// Forward pass
		predTensor := forward(map[graph.NodeHandle]*tensor.Tensor[float32]{input: inputTensor})

		// Compute loss (dummy loss for this example)
		loss := predTensor.Data()[0] - targetTensor.Data()[0]
		grad := tensor.NewScalar(engine, 2*loss)

		// Backward pass to compute gradients
		backward(grad, map[graph.NodeHandle]*tensor.Tensor[float32]{input: inputTensor})

		// Update weights
		optimizer.Step(builder.Parameters())
	}

	fmt.Println("Training complete!")
}

Why Zerfoo?

Zerfoo is designed to address the limitations of existing ML frameworks by embracing Go's philosophy.

✅ Idiomatic and Simple: Build models using clean, readable Go. We favor composition over inheritance and explicit interfaces over magic.
🚀 High-Performance by Design: A static graph execution model, pluggable compute engines (CPU, GPU planned), and minimal Cgo overhead ensure your code runs fast.
⛓️ Robust and Type-Safe: Leverage Go's strong type system to catch errors at compile time, not runtime. Shape mismatches and configuration issues are caught before training even begins.
🌐 Scalable from the Start: With first-class support for distributed training, Zerfoo is architected to scale from a single laptop to massive compute clusters.
🧩 Modular and Extensible: A clean, layered architecture allows you to extend any part of the framework—from custom layers to new hardware backends—by implementing well-defined interfaces.

Core Features

Declarative Graph Construction: Define models programmatically with a Builder API or declaratively using Go structs and tags.
Static Execution Graph: The graph is built and validated once, resulting in error-free forward/backward passes and significant performance optimizations.
Pluggable Compute Engines: A hardware abstraction layer allows Zerfoo to target different backends. The default is a pure Go engine, with BLAS and GPU (CUDA) engines planned.
Automatic Differentiation: Gradients are computed efficiently using reverse-mode AD (backpropagation).
First-Class Distributed Training: A DistributedStrategy interface abstracts away the complexity of multi-node training, with support for patterns like All-Reduce and Parameter Server.
Multi-Precision Support: Native support for float32 and float64, with float16 and float8 for cutting-edge, low-precision training.
ONNX Interoperability: Export models to the Open Neural Network Exchange (ONNX) format for deployment in any compatible environment.

Architectural Vision

Zerfoo is built on a clean, layered architecture that separates concerns, ensuring the framework is both powerful and maintainable. A core tenet of this architecture is the use of the Zerfoo Model Format (ZMF) as the universal intermediate representation for models. This enables a strict decoupling from model converters like zonnx, ensuring that zerfoo focuses solely on efficient model execution without any ONNX-specific dependencies.

High-Level Architecture

Composition Layer: Define models as a Directed Acyclic Graph (DAG) of nodes (layers, activations). This layer is hardware-agnostic.
Execution Layer: A pluggable Engine performs the actual tensor computations on specific hardware (CPU, GPU). This allows the same model to run on different devices without code changes.

For a deep dive into the design philosophy, core interfaces, and technical roadmap, please read our Architectural Design Document.

Getting Started

This project is in a pre-release state. The API is not yet stable.

To install Zerfoo, use go get:

go get github.com/zerfoo/zerfoo

Contributing

Zerfoo is an ambitious project, and we welcome contributions from the community! Whether you're an expert in machine learning, compilers, or distributed systems, or a Go developer passionate about AI, there are many ways to get involved.

Please read our Architectural Design Document to understand the project's vision and technical foundations.

License

Zerfoo is licensed under the Apache 2.0 License.

Documentation ¶

Overview ¶

Package zerfoo provides the core building blocks for creating and training neural networks. It offers a prelude of commonly used types to simplify development and enhance readability of model construction code.

Index ¶

func BuildFromZMF[T tensor.Numeric](engine compute.Engine[T], ops numeric.Arithmetic[T], m *zmf.Model) (*graph.Graph[T], error)
func NewAdamW[T tensor.Numeric](learningRate, beta1, beta2, epsilon, weightDecay T) *optimizer.AdamW[T]
func NewCPUEngine[T tensor.Numeric]() compute.Engine[T]
func NewDefaultTrainer[T tensor.Numeric](graph *graph.Graph[T], loss graph.Node[T], opt optimizer.Optimizer[T], ...) *training.DefaultTrainer[T]
func NewFloat32Ops() numeric.Arithmetic[float32]
func NewGraph[T tensor.Numeric](engine compute.Engine[T]) *graph.Builder[T]
func NewMSE[T tensor.Numeric](engine compute.Engine[T]) *loss.MSE[T]
func NewRMSNorm[T tensor.Numeric](name string, engine compute.Engine[T], ops numeric.Arithmetic[T], modelDim int, ...) (*normalization.RMSNorm[T], error)
func NewTensor[T tensor.Numeric](shape []int, data []T) (*tensor.TensorNumeric[T], error)
func RegisterLayer[T tensor.Numeric](opType string, builder model.LayerBuilder[T])
func UnregisterLayer(opType string)
type Batch
type Engine
type Graph
type LayerBuilder
type Node
type Numeric
type Parameter
type Tensor
type ZMFModel

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func BuildFromZMF ¶

func BuildFromZMF[T tensor.Numeric](
	engine compute.Engine[T],
	ops numeric.Arithmetic[T],
	m *zmf.Model,
) (*graph.Graph[T], error)

BuildFromZMF builds a graph from a ZMF model.

func NewAdamW ¶

func NewAdamW[T tensor.Numeric](learningRate, beta1, beta2, epsilon, weightDecay T) *optimizer.AdamW[T]

NewAdamW creates a new AdamW optimizer.

func NewCPUEngine ¶

func NewCPUEngine[T tensor.Numeric]() compute.Engine[T]

NewCPUEngine creates a new CPU engine for the given numeric type.

func NewDefaultTrainer ¶

func NewDefaultTrainer[T tensor.Numeric](
	graph *graph.Graph[T],
	loss graph.Node[T],
	opt optimizer.Optimizer[T],
	strategy training.GradientStrategy[T],
) *training.DefaultTrainer[T]

NewDefaultTrainer creates a new default trainer.

func NewFloat32Ops ¶

func NewFloat32Ops() numeric.Arithmetic[float32]

NewFloat32Ops returns the float32 arithmetic operations.

func NewGraph ¶

func NewGraph[T tensor.Numeric](engine compute.Engine[T]) *graph.Builder[T]

NewGraph creates a new computation graph.

func NewMSE ¶

func NewMSE[T tensor.Numeric](engine compute.Engine[T]) *loss.MSE[T]

NewMSE creates a new Mean Squared Error loss function.

func NewRMSNorm ¶

func NewRMSNorm[T tensor.Numeric](name string, engine compute.Engine[T], ops numeric.Arithmetic[T], modelDim int, options ...normalization.RMSNormOption[T]) (*normalization.RMSNorm[T], error)

NewRMSNorm is a factory function for creating RMSNorm layers.

func NewTensor ¶

func NewTensor[T tensor.Numeric](shape []int, data []T) (*tensor.TensorNumeric[T], error)

NewTensor creates a new tensor with the given shape and data.

func RegisterLayer ¶

func RegisterLayer[T tensor.Numeric](opType string, builder model.LayerBuilder[T])

RegisterLayer registers a new layer builder.

func UnregisterLayer ¶

func UnregisterLayer(opType string)

UnregisterLayer unregisters a layer builder.

Types ¶

type Batch ¶

type Batch[T tensor.Numeric] struct {
	Inputs  map[graph.Node[T]]*tensor.TensorNumeric[T]
	Targets *tensor.TensorNumeric[T]
}

Batch represents a training batch.

type Engine ¶

type Engine[T tensor.Numeric] interface {
	compute.Engine[T]
}

Engine represents a computation engine (e.g., CPU).

type Graph ¶

type Graph[T tensor.Numeric] struct {
	*graph.Graph[T]
}

Graph represents a computation graph.

type LayerBuilder ¶

type LayerBuilder[T tensor.Numeric] func(
	engine compute.Engine[T],
	ops numeric.Arithmetic[T],
	name string,
	params map[string]*graph.Parameter[T],
	attributes map[string]interface{},
) (graph.Node[T], error)

LayerBuilder is a function that builds a layer.

type Node ¶

type Node[T tensor.Numeric] interface {
	graph.Node[T]
}

Node represents a node in the computation graph.

type Numeric ¶

type Numeric tensor.Numeric

Numeric represents a numeric type constraint.

type Parameter ¶

type Parameter[T tensor.Numeric] struct {
	*graph.Parameter[T]
}

Parameter represents a trainable parameter in the model.

type Tensor ¶

type Tensor[T tensor.Numeric] struct {
	*tensor.TensorNumeric[T]
}

Tensor represents a multi-dimensional array.

type ZMFModel ¶

type ZMFModel = zmf.Model

Model is a ZMF model.

Source Files ¶

View all Source files

zerfoo.go

Directories ¶

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

Path	Synopsis
compute Package compute implements tensor computation engines and operations.	Package compute implements tensor computation engines and operations.
data
device Package device provides device abstraction and memory allocation interfaces.	Package device provides device abstraction and memory allocation interfaces.
distributed Package distributed provides distributed training strategies and coordination mechanisms for multi-node machine learning workloads in the Zerfoo framework.	Package distributed provides distributed training strategies and coordination mechanisms for multi-node machine learning workloads in the Zerfoo framework.
coordinator Package coordinator provides a distributed training coordinator.	Package coordinator provides a distributed training coordinator.
pb
features
graph Package graph provides a computational graph abstraction.	Package graph provides a computational graph abstraction.
internal
xblas
layers
activations Package activations provides activation function layers.	Package activations provides activation function layers.
attention Package attention provides attention mechanisms for neural networks.	Package attention provides attention mechanisms for neural networks.
components Package components provides reusable components for neural network layers.	Package components provides reusable components for neural network layers.
core Package core provides core neural network layer implementations.	Package core provides core neural network layer implementations.
embeddings Package embeddings provides neural network embedding layers for the Zerfoo ML framework.	Package embeddings provides neural network embedding layers for the Zerfoo ML framework.
features
gather Package gather provides the Gather layer for the Zerfoo ML framework.	Package gather provides the Gather layer for the Zerfoo ML framework.
hrm Package hrm implements the Hierarchical Reasoning Model.	Package hrm implements the Hierarchical Reasoning Model.
normalization Package normalization provides various normalization layers for neural networks.	Package normalization provides various normalization layers for neural networks.
recurrent
reducesum Package reducesum provides the ReduceSum layer for the Zerfoo ML framework.	Package reducesum provides the ReduceSum layer for the Zerfoo ML framework.
registry Package registry provides a central registration point for all layer builders.	Package registry provides a central registration point for all layer builders.
tokenizers
transformer Package transformer provides transformer building blocks such as the Transformer `Block` used in encoder/decoder stacks.	Package transformer provides transformer building blocks such as the Transformer `Block` used in encoder/decoder stacks.
transpose Package transpose provides the Transpose layer for the Zerfoo ML framework.	Package transpose provides the Transpose layer for the Zerfoo ML framework.
model Package model provides the core structures and loading mechanisms for Zerfoo models.	Package model provides the core structures and loading mechanisms for Zerfoo models.
hrm Package hrm provides experimental Hierarchical Reasoning Model types.	Package hrm provides experimental Hierarchical Reasoning Model types.
numeric Package numeric provides precision types, arithmetic operations, and generic constraints for the Zerfoo ML framework.	Package numeric provides precision types, arithmetic operations, and generic constraints for the Zerfoo ML framework.
pkg
prelude
tokenizer Package tokenizer provides basic text tokenization functionality.	Package tokenizer provides basic text tokenization functionality.
tensor Package tensor provides a multi-dimensional array (tensor) implementation.	Package tensor provides a multi-dimensional array (tensor) implementation.
testing
testutils Package testutils provides testing utilities and mock implementations for the Zerfoo ML framework.	Package testutils provides testing utilities and mock implementations for the Zerfoo ML framework.
training Package training provides tools for training neural networks.	Package training provides tools for training neural networks.
loss Package loss provides various loss functions for neural networks.	Package loss provides various loss functions for neural networks.
optimizer Package optimizer provides various optimization algorithms for neural networks.	Package optimizer provides various optimization algorithms for neural networks.
types Package types contains shared, fundamental types for the Zerfoo framework.	Package types contains shared, fundamental types for the Zerfoo framework.