Cadence

Cadence is an immersive reading pipeline: book file -> chapter text -> audiobook audio -> word-level synced reader data.

Overview

Imports EPUB/MOBI/AZW3 files (MOBI/AZW3 are converted to EPUB via Calibre) and extracts ordered chapter text.
Synthesizes chapter audio with Supertonic TTS.
Aligns audio to text with WhisperX for word timestamps.
Plays back in a synced reader/player UI.
Uses a streaming chapter pipeline so reading can start before full import finishes.

Pipeline

Source normalization: MOBI/AZW3 -> EPUB (Calibre, when needed)
EPUB extraction (Calibre) -> library/<book>/content/ch_XXX.txt
Per chapter: TTS synthesis (Supertonic) -> library/<book>/audio/ch_XXX.wav
Per chapter: Alignment (WhisperX) -> library/<book>/content/ch_XXX.json
Player can read chapters as soon as each chapter has audio + alignment.

Import Behavior

Cadence now runs chapter-by-chapter interleaved processing:
- If a chapter already has .wav, Cadence skips synthesis and aligns it.
- If .wav is missing, Cadence synthesizes first, then aligns.
- If .wav and .json both exist, Cadence skips that chapter.
This makes resume robust after interruptions and enables immediate reading while import is still running.
Library cards update live with ready counts (Audio x/y, Alignment x/y) during import.

Read While Importing

Cadence processes books one chapter at a time, not as one long batch.

As soon as a chapter finishes synthesis + alignment, it is immediately readable.
You can open the reader and start from available chapters while the rest of the book continues importing.
If import is interrupted, re-import resumes from existing chapter outputs instead of starting over.

UI Preview

Requirements

Windows 10/11
Python 3.12
Calibre (ebook-convert.exe) installed at:
- C:\Program Files\Calibre2\ebook-convert.exe
FFmpeg (ffmpeg.exe) available in PATH (required for Qt player speed control)
NVIDIA GPU recommended for faster TTS/ASR

Install

python -m venv venv
.\venv\Scripts\Activate.ps1
python -m pip install --upgrade pip

Install one runtime profile (fresh venv recommended):

GPU profile (default):

pip install -r requirements-gpu.txt

CPU profile:

pip install -r requirements-cpu.txt

Backward-compatible default (requirements.txt) points to GPU profile. Do not install both CPU and GPU ONNX Runtime packages in the same environment.

If you use a separate WhisperX venv, point Cadence to it:

$env:CADENCE_WHISPERX_PYTHON="C:\Users\mateo\Desktop\Cadence\venv_whisperx\Scripts\python.exe"

Run

.\venv\Scripts\Activate.ps1
python main.py

Configuration

Cadence uses cadence_settings.json (managed from the UI settings cog next to Import Book).

Settings are persisted automatically when you click Apply.
Settings are applied immediately to the current app process.
CADENCE_* environment variables are still usable for one-off CLI/script runs.

Useful keys:

CADENCE_EXTRACT_WORKERS
CADENCE_SYNTH_WORKERS
CADENCE_TTS_MAX_CHARS
CADENCE_FORCE_CPU
CADENCE_CUDA_ONLY
CADENCE_WHISPERX_MODEL
CADENCE_WHISPERX_BATCH_SIZE
CADENCE_WHISPERX_COMPUTE_TYPE
CADENCE_WHISPERX_DEVICE
CADENCE_WHISPERX_PYTHON
CADENCE_CALIBRE_PATH

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
adapters		adapters
assets/branding		assets/branding
core		core
docs		docs
library		library
qt		qt
scripts		scripts
system		system
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
forge.spec		forge.spec
main.py		main.py
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-common.txt		requirements-common.txt
requirements-cpu.txt		requirements-cpu.txt
requirements-dev.txt		requirements-dev.txt
requirements-gpu.txt		requirements-gpu.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cadence

Overview

Pipeline

Import Behavior

Read While Importing

UI Preview

Requirements

Install

Run

Configuration

Project Docs

About

Uh oh!

Releases

Packages

Languages

mateogon/Cadence

Folders and files

Latest commit

History

Repository files navigation

Cadence

Overview

Pipeline

Import Behavior

Read While Importing

UI Preview

Requirements

Install

Run

Configuration

Project Docs

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages