Audio transcription and text translation CLI. Transcribes audio files and translates text files, automatically selecting the right provider based on the source language.
uv pip install -e .| Variable | Required for |
|---|---|
SARVAM_API_KEY |
Indian language transcription and translation |
OPENAI_API_KEY |
Non-Indian language transcription and translation |
babel-scribe recording.mp3 --from hibabel-scribe recording.mp3 --from es --to frbabel-scribe recording.mp3 --from hi --timestampsbabel-scribe essay.txt --from hi --to enbabel-scribe file1.mp3 file2.mp3 --from ta --to enbabel-scribe recording.mp3 --from hi --output-format jsonFor all options and examples, run babel-scribe --help.
- Indian languages → Sarvam AI (
saaras:v3). When the target is English, Sarvam translates in a single step. - All other languages → OpenAI Whisper (
whisper-1). When the target is English, the Whisper translations endpoint is used.
- Indian↔English → Sarvam AI (
sarvam-translate:v1) - Indian↔non-English → chained via English (Sarvam + OpenAI)
- All other pairs → OpenAI (
gpt-5-mini)