Installation¶
Prerequisites¶
- Claude Code (CLI) installed and working
- Python 3.11+ (for hook scripts)
- Audio output — speakers, headphones, or system audio
No external Python packages are required. cc-vox uses only the standard library.
Install the Plugin¶
Pick a Backend¶
You need at least one TTS backend running. cc-vox supports three, in priority order:
| Backend | Type | Setup | Quality |
|---|---|---|---|
| Fish Speech | GPU (Docker) | docker run + NVIDIA GPU |
Best |
| Kokoro | CPU (Docker) | docker run |
Great |
| pocket-tts | CPU (uvx) | Zero setup — auto-starts | Good |
Option A: Zero Setup (pocket-tts)¶
No action needed. pocket-tts auto-starts via uvx when cc-vox first tries to speak. The pocket-tts model downloads automatically on first use.
Tip
If you have uv installed, pocket-tts will download and start automatically on first use. You can pre-download the model with hf download kyutai/pocket-tts.
Option B: Kokoro (Recommended)¶
Tip
Kokoro offers the best balance of quality and simplicity. One command, CPU-only, great results.
Option C: Fish Speech (Best Quality)¶
# Download the model
hf download fishaudio/openaudio-s1-mini --local-dir checkpoints/openaudio-s1-mini
# Start the container
docker run -d --name fish-speech \
--gpus all \
-p 32611:7860 \
-v ./checkpoints:/app/checkpoints \
fishaudio/fish-speech:latest
Important
Requires an NVIDIA GPU with Docker GPU support configured. The openaudio-s1-mini model is licensed CC-BY-NC-SA-4.0.
Verify¶
# Start Claude Code
claude
# In the session, type anything — you should hear audio after the response
# Or test the TTS script directly:
./scripts/say "Hello from cc-vox"