Skip to main content

What is InkSpoke?

InkSpoke is voice-first writing for every app you already use. You hold a hotkey, speak, and InkSpoke transcribes your words on your own device, optionally polishes them with AI, and types the result straight into whatever app has your cursor — your editor, your email, a chat box, a terminal. No window-switching, no copy-paste.

The tagline says it best: you spoke, it's inked.

The core loop

Everything in InkSpoke is built around one fast loop. On the desktop it looks like this:

  1. Press the activation hotkey (default Alt + Space, or + Space on macOS). A small listening overlay appears.
  2. Speak. A live waveform (and, if you like, a live transcript) shows InkSpoke is hearing you.
  3. Press the hotkey again (or click Send). InkSpoke transcribes your speech — by default with an on-device model, so your audio never has to leave your computer.
  4. AI refinement (optional) cleans up filler and grammar and matches the tone of the app you're writing in.
  5. The finished text is injected wherever your cursor was.

The whole round trip is designed to feel instant.

What you can do with it

InkSpoke is much more than a dictation box. Its main capabilities:

CapabilityWhat it means for you
Dictate anywherePush-to-talk into any application, with live streaming transcription and multi-language support.
Refine with AITurn rambling speech into polished, app-aware writing — casual in chat, precise in an IDE.
Command ModeSelect text in any app and speak an instruction ("make this formal", "translate to Spanish") to transform it.
Meetings & filesRecord a meeting (your mic + the other participants' audio) or import an audio/video file, and get a speaker-labeled transcript you can export.
Voice controlA spoken wake word starts dictation hands-free; Pro users can map voice commands to actions.
WorkspacesTeach InkSpoke your vocabulary, tone, and domain knowledge, applied automatically based on the app you're in.
Go mobileDictate from the InkSpoke keyboard on iOS and Android, paired with your desktop.

Private by design

InkSpoke is offline-first. Speech recognition and (on desktop) AI refinement can run entirely on-device using downloadable models, and your dictation history, recordings, and workspaces live locally unless you explicitly turn on cloud sync. When you do sync, workspace and dictionary content is end-to-end encrypted — the servers can't read it.

You can also bring your own AI provider keys (BYOK) or use the built-in InkSpoke Platform models. You're always in control of where your words are processed.

On-device vs. cloud

New users start on the built-in Whisper Small on-device speech model — fully offline and free. You can switch to larger on-device models or cloud providers any time. See On-device vs. cloud.

The InkSpoke family

InkSpoke runs across your devices, each tuned to how you work there:

┌────────────────────┐   ┌────────────────────┐   ┌────────────────────┐
│ Desktop app │ │ Mobile apps │ │ Web account │
│ Windows·macOS·Linux│ │ iOS · Android │ │ docs & billing │
├────────────────────┤ ├────────────────────┤ ├────────────────────┤
│The main experience:│ │The InkSpoke │ │Manage your plan, │
│dictation, refine- │ │keyboard dictates │ │API keys, active │
│ment, meetings, │ │into any app; │ │models, and your │
│voice control, │ │syncs with desktop │ │encrypted synced │
│workspaces. │ │over your network. │ │data. │
└────────────────────┘ └────────────────────┘ └────────────────────┘
  • 🖥 Desktop app — the flagship. Everything above lives here first. (Windows, macOS, Linux.)
  • 📱 Mobile apps — the InkSpoke keyboard lets you dictate into any app on your phone, and pairs with your desktop over the local network. (iOS is the most complete; Android is voice-keyboard-first, with more features on the way.)
  • 🌐 Web account — sign in at the InkSpoke site to manage your subscription, personal API keys, active models, and to view or delete your end-to-end-encrypted synced data.

Who it's for

InkSpoke fits anyone who would rather talk than type: developers dictating into code and terminals, writers and creators drafting at the speed of speech, business professionals clearing their inbox by voice, students capturing notes, and non-native English speakers who find speaking faster than typing. There are tailored guides by persona for each.

Next steps