Intelligent Dictation Engine with real-time speech-to-text, filler removal, and dual tone transformation (Professional & Casual).
- Python 3.10+
- Intel i5+ / AMD5+
- Windows 10/11
# Clone the repository
git clone https://github.com/SharadhNaidu/CleanDictate.git
cd CleanDictate
# Create virtual environment
python -m venv venv
venv\Scripts\activate
# Install PyTorch with CUDA
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
# Install dependencies
pip install faster-whisper transformers spacy pyaudio pynput scipy
# Download spaCy model
python -m spacy download en_core_web_smpython cleandictate.py| Key | Action |
|---|---|
| F9 | Start Recording |
| F10 | Stop Recording |
| F11 | Copy Output |
| ESC | Quit |
- Live: Real-time VAD, instant typing
- Document: Record all, process on stop
neutral- No style changeformal- Professional business tonecasual- Friendly conversational toneconcise- Brief and directdual- Both formal & casual output