yap is a free, open-source voice-to-text CLI that converts speech to text at your cursor. It runs on Linux, macOS, and Windows with ~5 MB RAM.
Lightweight voice-to-text for your desktop.
Hold a key. Speak. Text appears at your cursor. Zero idle footprint.
yap is a free, open-source voice-to-text CLI built with Go. It runs as a single static binary on Linux, macOS, and Windows with no runtime dependencies. Audio is transcribed via the Groq Whisper API in ~1–2 seconds, and yap uses just ~5–10 MB of RAM when idle — a fraction of typical desktop dictation software.
Demo GIF placeholder — hold-to-talk workflow
Features
Hold-to-Talk Workflow
Hold
Press hotkey
Speak
Record audio
Release
Send to API
Text
Appears at cursor
Memory
~5-10MB
yap uses just ~5–10 MB RAM when idle with negligible CPU when not recording.
Speed
~1-2s
yap transcribes in ~1–2s after you release the hotkey.
Powered by Groq Whisper API
Portability
yap ships as a single static Go binary with no runtime dependencies.
Configuration
yap includes a first-class NixOS module and uses standard TOML config.
[transcription] provider = "groq" [hotkey] key = "Super_R"
Trust & Transparency
Zero Telemetry.
Local Environment Variables.
100% Open Source.
No analytics. No phone-home. Your API keys stay in your environment. Audit every line of code.
Bring Your Own Key
$0
recurring software costs.
Use your own Groq API key. Pay only for what you use. No subscriptions, no tiers, no vendor lock-in.