manta

module
v0.0.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 28, 2024 License: Apache-2.0

README

llmaz

A lightweight P2P-based cache system for model distributions.

stability-alpha GoReport Widget Latest Release

Name Story: the inspiration of the name Manta is coming from Dota2, called Manta Style, which will create 2 images of your hero just like peers in the P2P network.

Architecture

architecture

Note: llmaz is just one kind of integrations, Manta can be deployed and used independently.

Features Overview

  • Preheat Models: Models could be preloaded to the cluster, or even specified nodes to accelerate the model serving.
  • Model Caching: Once models are downloaded, origin access is no longer necessary, but from other node peers.
  • Plug Framework: Filter and Score extension points could be customized with plugins to pick the right peers.
  • Model LCM: Manage the model lifecycles automatically with different configurations.
  • Memory Management(WIP): Specify the maximum reserved memory for use, and GC with LRU algorithm.

Quick Start

Installation

Read the Installation for guidance.

Preload Models

A toy sample to preload the Qwen/Qwen2-7B-Instruct model:

apiVersion: manta.io/v1alpha1
kind: Torrent
metadata:
  name: torrent-sample
spec:
  replicas: 1
  hub:
    repoID: Qwen/Qwen2-7B-Instruct

If you want to preload the model to specified nodes, use the NodeSelector:

apiVersion: manta.io/v1alpha1
kind: Torrent
metadata:
  name: torrent-sample
spec:
  replicas: 1
  hub:
    repoID: Qwen/Qwen2-7B-Instruct
  nodeSelector:
    zone: zone-a

If you want to remove the model weights once Torrent is deleted, set the ReclaimPolicy=Delete, default to Retain:

apiVersion: manta.io/v1alpha1
kind: Torrent
metadata:
  name: torrent-sample
spec:
  replicas: 1
  hub:
    repoID: Qwen/Qwen2-7B-Instruct
  nodeSelector:
    zone: zone-a
  reclaimPolicy: Delete

More details refer to the APIs.

Roadmap

  • GC policy with LRU algorithm
  • More integrations with serving projects
  • Support file chunking

Contributions

🚀 All kinds of contributions are welcomed ! Please follow CONTRIBUTING.md.

🎉 Thanks to all these contributors !

Directories

Path Synopsis
agent
cmd command
api
v1alpha1
Package v1alpha1 contains API Schema definitions for the v1alpha1 API group +kubebuilder:object:generate=true +groupName=manta.io
Package v1alpha1 contains API Schema definitions for the v1alpha1 API group +kubebuilder:object:generate=true +groupName=manta.io
hack
pkg
test

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL