Echolancer

Echolancer is a multi-speaker, transformer decoder-only English TTS model. We use NeuCodec as the audio tokenizer.

We (me and my cat) release pretrained checkpoints, notebooks, and a technical report

📦 Checkpoints

Name	Params	Training Data	Speaker Control	Download	Demo
Echolancer Stage 3 ZS	~1.3B	Base+7k hours multi-speaker	✔️ Zero-shot (ECAPA-TDNN)	HuggingFace
Echolancer Stage 3 Base	~1.3B	30K+ hours multi-speaker	❌ None (random)	HuggingFace	N/A
Echolancer Stage 2 ZS	~550M	Base+7k hours multi-speaker	✔️ Zero-shot (ECAPA-TDNN)	HuggingFace
Echolancer Stage 2 Base	~550M	30K+ hours multi-speaker	❌ None (random)	HuggingFace	N/A
Echolancer Stage 1 ZS	~177M	Base+7k hours multi-speaker	✔️ Zero-shot (ECAPA-TDNN)	HuggingFace
Echolancer Stage 1 Base	~177M	30K+ hours multi-speaker	❌ None (random)	HuggingFace

🔊 Inference

For inference code, please see the Colab demos

Features

Marked with ❌ means not currently available but is on high priority.

✔️ Base model without speaker conditioning
✔️ Inference notebook
✔️ Zero-shot
✔️ Multi-GPU training
🟡 LoRA finetuning (already capable - still need to write guide)
❌ Inference with KV cache
❌ ONNX export

Training & Fine-tuning

The base model can be finetuned to adapt it to a new voice (or multiple). You can either do full finetuning or LoRA. For LoRA, we recommend at least 10 minutes of audio; for full tuning, much more.

Single GPU

python train.py --train_config config/train_config.yaml --model_config config/model_config.yaml --shards_dir /path/to/shards --out_dir output

Multi-GPU (Single Node)

torchrun --nproc_per_node=NUM_GPUS train.py --train_config config/train_config.yaml --model_config config/model_config.yaml --shards_dir /path/to/shards --out_dir output

TODO: expand this

License

This codebase and model weights are released under the MIT license; basically, do what you want.

Contact

For any business/other formal inquiries, please e-mail nika109021@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
config		config
data		data
docs		docs
examples		examples
model		model
tests		tests
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
concat_dataset.py		concat_dataset.py
echolancerfe.py		echolancerfe.py
infer.py		infer.py
istftnetfe.py		istftnetfe.py
neucodecfe.py		neucodecfe.py
paired_dataset_concat.py		paired_dataset_concat.py
reformat_shards.py		reformat_shards.py
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Echolancer

📦 Checkpoints

🔊 Inference

Features

Training & Fine-tuning

Single GPU

Multi-GPU (Single Node)

License

Contact

About

Uh oh!

Releases

Packages

Languages

ZDisket/Echolancer

Folders and files

Latest commit

History

Repository files navigation

Echolancer

📦 Checkpoints

🔊 Inference

Features

Training & Fine-tuning

Single GPU

Multi-GPU (Single Node)

License

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages