Automatic speech recognition with speaker diarisation.
Based on:
- NVIDIA NeMo Parakeet TDT 0.6b V3: Multilingual Speech-to-Text Model for automatic speech recognition
- NVIDIA NeMo Sortformer Diarizer 4spk v1 for speaker diarisation
Linux:
sudo apt install ffmpegpip install git+https://github.com/HanBnrd/NeMoASR.git
MacOS:
brew install ffmpegpip install git+https://github.com/HanBnrd/NeMoASR.gitnemoasr myfile.mp3pip install --upgrade git+https://github.com/HanBnrd/NeMoASR.git