TranscribeAI turns audio files and image scans into text quickly and accurately.
Built for a simple setup and high-volume transcription workflows.
⭐️ Star our repository to stay up-to-date with exciting new features and improvements! Get instant notifications for new releases!
- Transcribe audio files to text.
- Transcribe images and PDFs to text.
- Run a Quality Check to flag possible issues.
- Use Remediate for safe cleanups (intro/outro chatter, markdown artifacts, encoded entities).
- Get clear warnings for subtitle timestamp issues so you can review those sections manually.
- Track everything in real-time with progress and logs.
You need at least one API key:
- Mistral key (recommended; very fast, great for batch jobs): Mistral quickstart
- Gemini key (optional; slightly better image accuracy on many files): Get Gemini API key
- Mistral: Very fast, especially when processing large folders in batches. Audio results are also very good.
- Gemini: Slightly better image accuracy in many cases, so it is a strong OCR choice when precision matters most.
- Many users add both keys: use Mistral when speed matters and Gemini when image accuracy matters most.
More AI model support is coming soon.
- Open the latest release.
- Download your installer:
- macOS:
.dmg - Windows:
.exe - Linux:
.AppImageor.tar.gz
- macOS:
- Run the installer.
If your system shows a first-run security warning:
- macOS: open Privacy/Security and choose Open Anyway
- Windows: choose More info then Run anyway
- Open TranscribeAI.
- Click the gear icon (top-right) to open Settings.
- Paste your API key(s).
- Click Save.
- Choose mode: Audio or Image.
- Select input file/folder.
- Select output folder.
- Click Transcribe.
- Watch progress in the logs.
- Click Check Quality to score your transcript/subtitle files.
- Lower scores mean the app found more potential issues.
- Hover a score to see the penalty breakdown (why it scored lower).
- Red warning dots mean “review needed.”
For subtitle files (.srt):
- The app flags timestamp format/order/range issues.
- It shows cue numbers so you can quickly find where to listen and fix.
- Timestamp fixes are manual review (not auto-adjusted).
- Use the latest release.
- Install over your current version.
- Your existing settings and data remain.
This project uses the MIT License. See LICENSE.
|
