rtc-transcribe

command

v0.0.8 Latest Latest Go to latest Published: Jun 20, 2025 License: MIT Imports: 15 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/go-go-golems/go-go-labs

Links

Open Source Insights

README ¶

RTC Transcribe

A real-time audio transcription server using WebRTC and OpenAI's Whisper API.

Features

Browser-based audio capture using WebRTC
Real-time audio streaming to Go server
Robust Opus audio decoding with libopus support
Detailed structured logging for debugging and monitoring
Audio transcription using OpenAI's Whisper API
Real-time text streaming back to the browser using Server-Sent Events (SSE)
Clean, simple user interface

Architecture

RTC Transcribe is built with the following components:

Frontend: HTML/CSS/JavaScript web client that:
- Captures microphone audio
- Establishes WebRTC connection with the server
- Streams the audio in real-time
- Receives and displays transcription results
Backend Server: Go application that:
- Handles WebRTC signaling and establishes peer connections
- Receives and decodes Opus audio streams
- Buffers audio for optimal transcription
- Transcribes audio using Whisper API
- Streams results back to the client using SSE

Requirements

Go 1.23+
OpenAI API key for the Whisper API

Production Requirements

For production use with real Opus decoding (recommended), you'll need:

libopus and libopusfile development libraries
- Ubuntu/Debian: sudo apt-get install libopus-dev libopusfile-dev
- macOS: brew install opus opusfile
- Windows: Use MSYS2/MinGW or vcpkg

Building

Build with real Opus support (recommended for production)

# First install the required dependencies
sudo apt-get install libopus-dev libopusfile-dev

# Then build
go build -o rtc-transcribe ./cmd/apps/rtc-transcribe/

Build with mock Opus decoder (no dependencies, reduced audio quality)

go build -tags noopus -o rtc-transcribe ./cmd/apps/rtc-transcribe/

Usage

Run the application:

# Set your OpenAI API key
export OPENAI_API_KEY=your_api_key_here

# Run with default settings
./rtc-transcribe

# Run with debug logging
./rtc-transcribe --log-level debug

# Run on a different port
./rtc-transcribe --port 9000

# Specify OpenAI API key directly
./rtc-transcribe --api-key sk-xxxxxxxxxxxxx