rtp-agent

module
v0.0.2 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 14, 2026 License: MIT

README

RTP Agent

Go Version Docs Architecture License Status

Golang-first effort to build a practical alternative to the LiveKit Agent SDK, focused on lower end-to-end latency and reduced hardware requirements.

Documentation: https://cavos-io.github.io/rtp-agent/

Why This Project

rtp-agent explores a lean, production-friendly agent runtime in Go for real-time voice and multimodal workflows.

Core goals:

  • Reduce latency across STT -> LLM -> TTS turn loops.
  • Minimize CPU and memory requirements so agents can run on smaller instances.
  • Keep architecture modular through ports and adapters, so providers can be swapped quickly.

Table of Contents

Features

  • Go-based worker runtime and CLI entrypoint.
  • Hexagonal architecture (adapter, core, interface, library) for clear separation of concerns.
  • Broad provider coverage with 60+ adapter packages (LLM, STT, TTS, avatar, utilities).
  • LiveKit-compatible worker transport and job lifecycle handling.
  • Docusaurus documentation site for adapter references.

Built With

Getting Started

Prerequisites
  • Go 1.25+ (as defined in go.mod)
  • Access to a LiveKit deployment (URL, API key, API secret)
Installation
git clone https://github.com/cavos-io/rtp-agent.git
cd rtp-agent
go mod tidy
Configure Worker Options

Current bootstrap is code-first. Update worker options in cmd/main.go before running:

opts := worker.WorkerOptions{
  AgentName:  "example-agent",
  WorkerType: worker.WorkerTypeRoom,
  WSRL:       "wss://<your-livekit-host>",
  APIKey:     "<your-api-key>",
  APISecret:  "<your-api-secret>",
}

Usage

Run the worker:

go run ./cmd/main.go start

Development mode with autoreload:

go run ./cmd/main.go dev

Connect mode for local room testing:

go run ./cmd/main.go connect <room_name> [participant_identity]

Roadmap

  • End-to-end latency profiling and optimization pass.
  • Lower-memory execution mode for constrained environments.
  • Compatibility layer and integrations for Pipecat.
  • Compatibility layer and integrations for VisionAgent.
  • Additional open-source integration target: Vocode.

Documentation

Contributing

Contributions are welcome. Open an issue for bug reports or feature proposals, and submit a PR for improvements.

Repository: https://github.com/cavos-io/rtp-agent

License

This project is licensed under the MIT License.

See LICENSE for details.

Acknowledgments

  • LiveKit for the reference agent model and ecosystem.
  • Pion for core WebRTC building blocks in Go.
  • The broader open-source realtime AI community.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL