π οΈ Installation and Setup¶
Prerequisite
This project is optimized for Linux (Debian/Ubuntu). State of the Art 2026: We use hardware acceleration (CUDA) and a modular approach to ensure privacy and performance.
This guide will take you from zero to a fully functional dictation system on your local machine.
π Method 1: Automatic Installation (Recommended)¶
We've created a script that handles all the "dirty work" for you: verifies your system, installs dependencies (apt), creates the virtual environment (venv), and configures credentials.
What this script does:
- π¦ Installs system libraries (
ffmpeg,xclip,pulseaudio-utils). - π Creates an isolated Python environment (
venv). - βοΈ Installs project dependencies (
faster-whisper,torch). - π Helps you configure your Gemini API Key (optional, for generative AI).
- π₯οΈ Verifies if you have a compatible NVIDIA GPU.
π οΈ Method 2: Manual Installation¶
If you prefer full control or the automatic script fails, follow these steps.
1. System Dependencies (System Level)¶
We need tools to manipulate audio and clipboard at the OS level.
sudo apt update
sudo apt install ffmpeg xclip pulseaudio-utils python3-venv build-essential python3-dev
2. Python Environment¶
We isolate libraries to avoid conflicts.
# Navigate to the backend directory
cd apps/daemon/backend
# Create virtual environment
python3 -m venv venv
# Activate environment (Do this every time you work on the project!)
source venv/bin/activate
# Install dependencies
pip install -e .
3. AI Configuration (Optional)¶
To use "Text Refinement" features (rewriting with LLM), you need a Google Gemini API Key.
- Get your key at Google AI Studio.
- Create a
.envfile at the root:
β Verification¶
Make sure everything works before continuing.
1. Verify GPU Acceleration¶
This confirms that Whisper can use your graphics card (essential for speed).
2. System Diagnostics¶
Verify that the daemon and audio services are ready.
βοΈ Next Steps¶
Once installed, it's time to configure how you interact with the tool.
- Detailed Configuration - Adjust models and sensitivity.
- Keyboard Shortcuts - Configure your magic keys.