use chatterbox MTLTokenizer for multilingual. #362

litmudoc · 2026-01-04T17:41:08Z

Context

The Chatterbox TTS model supports multilingual text-to-speech, but the tokenizer selection code was not properly configuring MTLTokenizer for multilingual models. When a model with "multilingual": true in its config..json was loaded, it would incorrectly use the English tokenizer (EnTokenizer) instead of the multilingual tokenizer.

Description

Modified _init_tokenizers() and from_pretrained(), post_load_hook() methods in chatterbox.py to:

Check for the existence of config.json in the model directory
Read the "multilingual" configuration flag from JSON config
Select and instantiate MTLTokenizer when multilingual is enabled, or EnTokenizer for English-only models
Added appropriate console logging messages indicating which tokenizer was loaded

Changes in the codebase

Tokenizer Selection Logic: Added conditional logic to check config.["multilingual"] and instantiate the appropriate tokenizer class
Import Updates: Added MTLTokenizer to imports alongside existing EnTokenizer
User Feedback: Added print statements showing which tokenizer was loaded ("Loaded multilingual tokenizer (MTLTokenizer)" or "Loaded English tokenizer (EnTokenizer)")
Three Locations Modified: _init_tokenizers(), load() method for S3 checkpoints, and load() method for model loading

Additional information

This change enables Chatterbox multilingual TTS models to properly tokenize input text, which is required for text-to-speech generation in multiple languages. The implementation follows the existing pattern of checking model configuration files.

Checklist

Code tested with multilingual Chatterbox models
Documentation updated for multilingual model usage
No breaking changes to existing English-only models

litmudoc

I tried to use MTLTokenizer for Chatterbox Multilingual support. Thank you.

Blaizzy · 2026-01-04T18:24:19Z

Hey @litmudoc

Awesome, do you have any audio samples .?

litmudoc · 2026-01-05T08:07:46Z

mlx_audio.tts.generate \
    --model litmudoc/Chatterbox-Multilingual-MLX-v2-fp16 \
    --exaggeration 0.6 \
    --cfg_scale 0.35 \
    --temperature 0.8 \
    --lang_code ko \
    --text ", 한국말이 너무 자연스러워요\! 감격하고, 또 감격 했습니다." \
    --ref_audio ko.wav \
    --ref_text "우리는 정말로 허름한 호텔에 묵었지만, 그래도 행복했다." \
    --verbose --play

This is a sample created with the Chatterbox multilingual model on local Mac.
audio_000.wav

I made a sample using the Korean wav obtained from the Chatterbox Multilingual Demo. I'm very pleased!
Chatterbox Multilingual Demo
Thanks to your amazing mlx_audio, the Chatterbox Multilingual model is running smoothly on my local Mac. Thank you!
Chatterbox-Multilingual-MLX-v2-fp16

Blaizzy

LGTM, thanks!

use MTLTokenizer for multilingual.

aabd744

litmudoc commented Jan 4, 2026

View reviewed changes

This comment was marked as duplicate.

Sign in to view

litmudoc changed the title ~~use MTLTokenizer for multilingual.~~ use chatterbox MTLTokenizer for multilingual. Jan 4, 2026

litmudoc marked this pull request as draft January 5, 2026 14:34

Apply pre-commit fixes

f3c387d

litmudoc marked this pull request as ready for review January 5, 2026 15:13

Blaizzy approved these changes Jan 5, 2026

View reviewed changes

Blaizzy force-pushed the main branch from 4242976 to 558546c Compare January 5, 2026 17:48

Blaizzy force-pushed the Edit-chatterbox-can-use-MTLTokenizer branch from 6b92122 to f3c387d Compare January 5, 2026 17:53

Blaizzy added 5 commits January 5, 2026 19:09

Fix MTLTokenizer language_id

6dbcc6b

set default language to en

fc1d769

Refactor tokenizer initialization and error handling

42aac02

remove debug statements

0c02b68

Make model argument required in parse_args function

aed96b9

Blaizzy merged commit 9220f2c into Blaizzy:main Jan 5, 2026
10 checks passed

litmudoc deleted the Edit-chatterbox-can-use-MTLTokenizer branch January 6, 2026 02:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

use chatterbox MTLTokenizer for multilingual. #362

use chatterbox MTLTokenizer for multilingual. #362

litmudoc commented Jan 4, 2026 •

edited

Loading

Uh oh!

litmudoc left a comment

Uh oh!

This comment was marked as duplicate.

Uh oh!

Blaizzy commented Jan 4, 2026

Uh oh!

litmudoc commented Jan 5, 2026 •

edited

Loading

Uh oh!

Blaizzy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

use chatterbox MTLTokenizer for multilingual. #362

use chatterbox MTLTokenizer for multilingual. #362

Conversation

litmudoc commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Description

Changes in the codebase

Additional information

Checklist

Uh oh!

litmudoc left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as duplicate.

Uh oh!

Blaizzy commented Jan 4, 2026

Uh oh!

litmudoc commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Blaizzy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

litmudoc commented Jan 4, 2026 •

edited

Loading

litmudoc commented Jan 5, 2026 •

edited

Loading