Skip to content

Glossary

This glossary defines technical and domain terms used in Voice2Machine.

General Terms

Local-First

Design philosophy where data (audio, text) is processed and stored exclusively on the user's device, without relying on the cloud.

Daemon

Background process (written in Python) that manages recording, transcription, and communication with the frontend.

REST API

Communication mechanism between the Daemon (Python) and clients (scripts, frontends). We use FastAPI with standard HTTP endpoints and WebSocket for real-time events.

Technical Components

Whisper

Speech recognition model (ASR) developed by OpenAI. Voice2Machine uses faster-whisper, an optimized implementation with CTranslate2.

Workflows

Specialized coordination components that manage the complete lifecycle of a specific task (e.g., RecordingWorkflow, LLMWorkflow). They replace the old monolithic "Orchestrator" for better traceability and maintainability.

Features

Self-contained modules that group domain logic and its infrastructure adapters (audio, llm, transcription). They represent the system's core capabilities.

BackendProvider

Frontend component (React Context) that manages connection with the Daemon and distributes state to the UI.

TelemetryContext

Sub-context in React optimized for high-frequency updates (GPU metrics, audio levels) to avoid unnecessary re-renders of the main UI.

Modular Architecture

Evolution of Hexagonal Architecture that organizes code around business modules (Features) and execution flows (Workflows), minimizing coupling and maximizing clarity.