Directories
ΒΆ
| Path | Synopsis |
|---|---|
|
cmd
|
|
|
sk-browser-smoke
command
Command sk-browser-smoke drives the public smoke page (`/`) of a running speechkit-server instance from a real headless Chrome process and asserts that every mode tile reports OK.
|
Command sk-browser-smoke drives the public smoke page (`/`) of a running speechkit-server instance from a real headless Chrome process and asserts that every mode tile reports OK. |
|
sk-e2e
command
Command sk-e2e is a thin end-to-end smoke client for a running speechkit-server instance.
|
Command sk-e2e is a thin end-to-end smoke client for a running speechkit-server instance. |
|
sk-localprobe
command
Command sk-localprobe verifies that the SpeechKit kernel libraries produce working Dictation, Assist, and Voice Agent results against LOCAL models only (Whisper.cpp + Gemma via llama-server).
|
Command sk-localprobe verifies that the SpeechKit kernel libraries produce working Dictation, Assist, and Voice Agent results against LOCAL models only (Whisper.cpp + Gemma via llama-server). |
|
speechkit
command
|
|
|
speechkit-cli
command
|
|
|
speechkit-mcp
command
|
|
|
speechkit-openwakeword
command
speechkit-openwakeword hosts the openWakeWord-compatible ONNX frontend in a sibling process.
|
speechkit-openwakeword hosts the openWakeWord-compatible ONNX frontend in a sibling process. |
|
speechkit-server
command
Package main is the canonical kombify SpeechKit Linux container server.
|
Package main is the canonical kombify SpeechKit Linux container server. |
|
speechkit-wakeword
command
speechkit-wakeword is the sidecar binary that hosts SpeechKit's on-device keyword spotter outside the main desktop process.
|
speechkit-wakeword is the sidecar binary that hosts SpeechKit's on-device keyword spotter outside the main desktop process. |
|
speechkit/internal/profiles
Package profiles provides pure model-profile selection helpers β the part of the legacy cmd/speechkit model_selection_helpers.go that depends only on config and the model catalog (no *appState, no network, no Wails surface).
|
Package profiles provides pure model-profile selection helpers β the part of the legacy cmd/speechkit model_selection_helpers.go that depends only on config and the model catalog (no *appState, no network, no Wails surface). |
|
speechkit/internal/transcription
Package transcription provides STT model-selection helpers and the vocabulary-dictionary primitives that the desktop adapters consume.
|
Package transcription provides STT model-selection helpers and the vocabulary-dictionary primitives that the desktop adapters consume. |
|
examples
|
|
|
library
command
Example: Using SpeechKit as a Go library for speech-to-text.
|
Example: Using SpeechKit as a Go library for speech-to-text. |
|
provider-catalog
command
Example: reading SpeechKit's public mode and provider catalog.
|
Example: reading SpeechKit's public mode and provider catalog. |
|
voice-agent/game-instructor
command
Example: 15-minute Voice-Agent game instructor.
|
Example: 15-minute Voice-Agent game instructor. |
|
internal
|
|
|
ai
Package ai wires the Genkit runtime and the SpeechKit model catalog into a single LLM/embedding/reranker surface used by Assist and the Voice Agent pipeline-fallback path.
|
Package ai wires the Genkit runtime and the SpeechKit model catalog into a single LLM/embedding/reranker surface used by Assist and the Voice Agent pipeline-fallback path. |
|
assist
Package assist implements the Assist Mode pipeline: STT transcript β Codeword check β LLM β TTS β Result with both text and audio.
|
Package assist implements the Assist Mode pipeline: STT transcript β Codeword check β LLM β TTS β Result with both text and audio. |
|
assist/skills/voice_companion
Package voice_companion provides ToolExecutor-compatible skill plugins for SpeechKit's Voice-Companion pattern.
|
Package voice_companion provides ToolExecutor-compatible skill plugins for SpeechKit's Voice-Companion pattern. |
|
audio
Audio playback via ebitengine/oto only requires cgo on Linux (ALSA/PulseAudio); the Windows and Darwin backends are pure-Go via purego.
|
Audio playback via ebitengine/oto only requires cgo on Linux (ALSA/PulseAudio); the Windows and Darwin backends are pure-Go via purego. |
|
auditlog
Package auditlog provides the dedicated audit-event stream for SpeechKit.
|
Package auditlog provides the dedicated audit-event stream for SpeechKit. |
|
auditlogtest
Package auditlogtest provides test-only helpers for resetting the audit log package state between test cases.
|
Package auditlogtest provides test-only helpers for resetting the audit log package state between test cases. |
|
auth
Package auth provides the authentication abstraction for SpeechKit.
|
Package auth provides the authentication abstraction for SpeechKit. |
|
dictation
Package dictation implements pause-based segmentation for Dictation Mode: it consumes VAD speech-probability frames and emits one transcription request per natural pause.
|
Package dictation implements pause-based segmentation for Dictation Mode: it consumes VAD speech-probability frames and emits one transcription request per natural pause. |
|
downloads
Package downloads manages model downloads for SpeechKit β HTTP file downloads and Ollama model pulls with progress tracking.
|
Package downloads manages model downloads for SpeechKit β HTTP file downloads and Ollama model pulls with progress tracking. |
|
features
Package features provides runtime feature detection for UI gating.
|
Package features provides runtime feature detection for UI gating. |
|
kombify
Package kombify is the build-tag seam between OSS and kombify builds.
|
Package kombify is the build-tag seam between OSS and kombify builds. |
|
models
Package models defines the SpeechKit model catalog: provider IDs, model identifiers, modality (STT, TTS, Realtime Voice, Assist, Utility, Embedding, Reranker), execution mode (local/cloud/direct), and the readiness metadata that setup UIs and the readiness endpoint consume.
|
Package models defines the SpeechKit model catalog: provider IDs, model identifiers, modality (STT, TTS, Realtime Voice, Assist, Utility, Embedding, Reranker), execution mode (local/cloud/direct), and the readiness metadata that setup UIs and the readiness endpoint consume. |
|
netsec
Package netsec provides centralized network security primitives used by every HTTP-based provider in SpeechKit (STT, TTS, LLM, downloads).
|
Package netsec provides centralized network security primitives used by every HTTP-based provider in SpeechKit (STT, TTS, LLM, downloads). |
|
router
Package router implements the STT routing layer.
|
Package router implements the STT routing layer. |
|
scaffold
Package scaffold renders embedded starter templates into a target directory so callers can bootstrap a SpeechKit integration without hand-copying boilerplate.
|
Package scaffold renders embedded starter templates into a target directory so callers can bootstrap a SpeechKit integration without hand-copying boilerplate. |
|
server/assist
Package assist implements the POST /v1/assist/process handler.
|
Package assist implements the POST /v1/assist/process handler. |
|
server/audio
Package audio normalizes inbound audio payloads to the Framework kernel's canonical PCM format (16 kHz, signed 16-bit little-endian, mono) before they enter the STT router.
|
Package audio normalizes inbound audio payloads to the Framework kernel's canonical PCM format (16 kHz, signed 16-bit little-endian, mono) before they enter the STT router. |
|
server/cli
Package cli holds the small amount of CLI-level glue for the Linux SpeechKit Server entry point.
|
Package cli holds the small amount of CLI-level glue for the Linux SpeechKit Server entry point. |
|
server/core
Package core is the SpeechKit server bootstrap layer.
|
Package core is the SpeechKit server bootstrap layer. |
|
server/dictation
Package dictation implements the POST /v1/dictation/transcribe handler.
|
Package dictation implements the POST /v1/dictation/transcribe handler. |
|
server/httpx
Package httpx contains tiny cross-handler helpers for JSON error envelopes and status mapping.
|
Package httpx contains tiny cross-handler helpers for JSON error envelopes and status mapping. |
|
server/middleware
Package middleware provides HTTP middleware primitives for the SpeechKit server adapter.
|
Package middleware provides HTTP middleware primitives for the SpeechKit server adapter. |
|
server/persona
Package persona provides the Voice Agent persona / role / sequence catalog for the Server-Target.
|
Package persona provides the Voice Agent persona / role / sequence catalog for the Server-Target. |
|
server/voiceagent
Package voiceagent implements the Voice Agent WebSocket surface on the Server-Target.
|
Package voiceagent implements the Voice Agent WebSocket surface on the Server-Target. |
|
server/wakewordtraining
Package wakewordtraining mounts the v0.37.5 REST endpoints that accept wake-word activation training-data uploads from device clients.
|
Package wakewordtraining mounts the v0.37.5 REST endpoints that accept wake-word activation training-data uploads from device clients. |
|
serverclient
Package serverclient is the client-side transport adapter that lets a device-target (cmd/speechkit) or a local-target binary delegate one or more modes (Dictation, Assist, Voice Agent) to a remote SpeechKit Server-Target instead of running the Framework kernel in-process.
|
Package serverclient is the client-side transport adapter that lets a device-target (cmd/speechkit) or a local-target binary delegate one or more modes (Dictation, Assist, Voice Agent) to a remote SpeechKit Server-Target instead of running the Framework kernel in-process. |
|
shortcuts
Package shortcuts implements pattern-matched intent shortcuts used by Assist Mode.
|
Package shortcuts implements pattern-matched intent shortcuts used by Assist Mode. |
|
stt
Package stt defines the SpeechKit speech-to-text provider interface and houses the concrete provider implementations: whisper.cpp (local built-in), HuggingFace, OpenAI, Groq, Google, an OpenAI-compatible adapter (covers Ollama and other compatible servers), and the self-hosted VPS adapter.
|
Package stt defines the SpeechKit speech-to-text provider interface and houses the concrete provider implementations: whisper.cpp (local built-in), HuggingFace, OpenAI, Groq, Google, an OpenAI-compatible adapter (covers Ollama and other compatible servers), and the self-hosted VPS adapter. |
|
tts
Package tts implements the SpeechKit text-to-speech surface: a small provider interface plus concrete adapters for OpenAI, Google, and Hugging Face.
|
Package tts implements the SpeechKit text-to-speech surface: a small provider interface plus concrete adapters for OpenAI, Google, and Hugging Face. |
|
voiceagent/cascaded
Package cascaded implements a turn-based STT -> LLM -> TTS voice agent provider.
|
Package cascaded implements a turn-based STT -> LLM -> TTS voice agent provider. |
|
voicebehavior
Package voicebehavior contains the shared Voice Agent behavior catalog used by both the local desktop runtime and the Linux server target.
|
Package voicebehavior contains the shared Voice Agent behavior catalog used by both the local desktop runtime and the Linux server target. |
|
voiceeval
Package voiceeval contains deterministic dialogue checks for Voice Agent workflow tests.
|
Package voiceeval contains deterministic dialogue checks for Voice Agent workflow tests. |
|
wakeword
Package wakeword implements an always-on, on-device keyword spotter that any of the three SpeechKit modes (Dictation, Assist, Voice Agent) can opt into.
|
Package wakeword implements an always-on, on-device keyword spotter that any of the three SpeechKit modes (Dictation, Assist, Voice Agent) can opt into. |
|
winapi
Package winapi provides shared Windows DLL proc references used by multiple packages.
|
Package winapi provides shared Windows DLL proc references used by multiple packages. |
|
pkg
|
|
|
speechkit
Package speechkit provides the public SDK for embedding SpeechKit voice capture, transcription, and assist/voice-agent pipelines into host applications.
|
Package speechkit provides the public SDK for embedding SpeechKit voice capture, transcription, and assist/voice-agent pipelines into host applications. |
|
speechkit/agentkit
Package agentkit provides a small Go harness for building SpeechKit Voice Agent hosts.
|
Package agentkit provides a small Go harness for building SpeechKit Voice Agent hosts. |
|
speechkit/assist
Package assist provides an embeddable Assist Mode service.
|
Package assist provides an embeddable Assist Mode service. |
|
speechkit/client
Package client provides a typed HTTP client for talking to a remote SpeechKit Server (the `cmd/speechkit-server` Linux container or any compatible deployment).
|
Package client provides a typed HTTP client for talking to a remote SpeechKit Server (the `cmd/speechkit-server` Linux container or any compatible deployment). |
|
speechkit/dictation
Package dictation provides an embeddable strict Dictation runtime.
|
Package dictation provides an embeddable strict Dictation runtime. |
|
speechkit/voiceagent
Package voiceagent provides an embeddable Voice Agent service.
|
Package voiceagent provides an embeddable Voice Agent service. |
|
speechkit/voiceagent/live
Package live exposes the low-level Voice Agent realtime-protocol types.
|
Package live exposes the low-level Voice Agent realtime-protocol types. |
Click to show internal directories.
Click to hide internal directories.