NeMoASR

Automatic speech recognition with speaker diarisation.

Based on:

NVIDIA NeMo Parakeet TDT 0.6b V3: Multilingual Speech-to-Text Model for automatic speech recognition
NVIDIA NeMo Sortformer Diarizer 4spk v1 for speaker diarisation

Requirements

Linux:

sudo apt install ffmpeg

pip install git+https://github.com/HanBnrd/NeMoASR.git

MacOS:

brew install ffmpeg

pip install git+https://github.com/HanBnrd/NeMoASR.git

nemoasr myfile.mp3

pip install --upgrade git+https://github.com/HanBnrd/NeMoASR.git

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
nemoasr		nemoasr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml