OpenSpeak

OpenSpeak is a local-first desktop dictation app built with Tauri, Rust, and whisper.cpp.

It is designed for fast global dictation workflows:

Trigger recording from a global hotkey
Speak naturally
Stop and insert text where you are working

Features

Local transcription via whisper.cpp (whisper-rs)
Global hotkey toggle for start/stop dictation
Menu bar (tray-first) app flow on macOS
Recording overlay HUD for background visual feedback
Two output modes:
- clipboard: copy text for manual paste
- auto-paste: copy then trigger paste automatically
Basic spoken formatting commands:
- comma, period, question mark
- new line, new paragraph
Model download and local model management:
- Tiny (75 MB, fastest)
- Base (142 MB)
- Small (466 MB, recommended default)
- Medium (1.5 GB)
- Large-v3 (3.1 GB, highest accuracy)
- Turbo (fast large model)

Architecture

Frontend: React + Vite
Desktop shell: Tauri v2
Core runtime: Rust
Audio capture: cpal
Inference: whisper-rs / whisper.cpp

Requirements

macOS (current focus)
Node.js 20+
Rust toolchain (rustup, cargo)
Xcode Command Line Tools
cmake (required by whisper build)

Install cmake:

brew install cmake

Getting Started

Install dependencies:

npm install

Run in development:

cargo tauri dev

Production build:

cargo tauri build

Build outputs:

.app: src-tauri/target/release/bundle/macos/

Install (GitHub Releases, macOS)

Current release artifacts are unsigned macOS app bundles (.app.tar.gz).

Download the latest OpenSpeak_*.app.tar.gz from GitHub Releases.
Extract it and move OpenSpeak.app to /Applications.
Remove the quarantine flag once:

xattr -rd com.apple.quarantine /Applications/OpenSpeak.app

Launch OpenSpeak from Applications.

If macOS still blocks launch, use Finder Right-click -> Open once and confirm.

Permissions (macOS)

For full functionality, OpenSpeak needs:

Microphone access (record speech)
Accessibility access (auto-paste automation)

If auto-paste is disabled, Accessibility permission is optional.

Model Storage

By default, model files are stored under:

~/Library/Application Support/openspeak/models/

OpenSpeak also supports legacy paths from earlier project naming and will continue to read existing local data if present.

Configuration

OpenSpeak persists settings in local app data, including:

Global hotkey
Default model
Paste mode
Privacy flags

Default hotkey:

CommandOrControl+Shift+Space

Development Notes

The app runs tray-first by default; open settings from the tray menu.
Closing the settings window hides it instead of quitting.
The overlay is a separate transparent always-on-top window.

GitHub Releases

This repository includes a GitHub Actions release workflow:

File: .github/workflows/release.yml
Trigger: push a semantic version tag (v*.*.*)
Runner: macos-latest
Artifacts uploaded to GitHub Releases via tauri-action

Create a release

Bump versions if needed (package.json, src-tauri/Cargo.toml, src-tauri/tauri.conf.json).
Commit to main.
Create and push a tag:
- git tag v0.1.1
- git push origin v0.1.1
Wait for the Release workflow to complete.
Verify assets in the GitHub Releases tab.

Unsigned build note

Because release builds are currently unsigned, users may need to run:

xattr -rd com.apple.quarantine /Applications/OpenSpeak.app

Optional: signing and notarization (recommended)

The workflow supports macOS signing/notarization if these repository secrets are set:

APPLE_SIGNING_IDENTITY
APPLE_CERTIFICATE (base64-encoded .p12)
APPLE_CERTIFICATE_PASSWORD
APPLE_ID
APPLE_PASSWORD (app-specific password)
APPLE_TEAM_ID

If these secrets are not set, builds can still complete, but distribution UX is typically better with signed/notarized artifacts.

License

Add your preferred license in LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
src-tauri		src-tauri
src		src
vendor/whisper-rs-sys		vendor/whisper-rs-sys
.gitignore		.gitignore
README.md		README.md
RELEASE.md		RELEASE.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenSpeak

Features

Architecture

Requirements

Getting Started

Install (GitHub Releases, macOS)

Permissions (macOS)

Model Storage

Configuration

Development Notes

GitHub Releases

Create a release

Unsigned build note

Optional: signing and notarization (recommended)

License

About

Uh oh!

Releases 2

Packages

Languages

devbrock/openspeak

Folders and files

Latest commit

History

Repository files navigation

OpenSpeak

Features

Architecture

Requirements

Getting Started

Install (GitHub Releases, macOS)

Permissions (macOS)

Model Storage

Configuration

Development Notes

GitHub Releases

Create a release

Unsigned build note

Optional: signing and notarization (recommended)

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages