Categorical Schrödinger Bridge Matching (CSBM)

This repository contains the official implementation of the paper "Categorical Schrödinger Bridge Matching", accepted at ICML 2025.

📌 TL;DR

This paper extends the Schrödinger Bridge problem to work with discrete time and spaces.

📦 Dependencies

Create the Anaconda environment using the following command:

conda env create -f environment.yml

🛠️ Preparations

Download Datasets

Use this link to obtain the CelebA dataset;
Follow these instructions to obtain the AFHQv2 dataset.

Additionally, for the CelebA dataset, rename the main folder to celeba, then rename celeba/img_align_celeba/img_align_celeba to celeba/img_align_celeba/raw.

Train VQ-GAN

Configure the appropriate configuration file configs/vqgan_*.yaml.
Run the corresponding quantize_*.sh script to save quantized images as .npy files in celeba/img_align_celeba/quantized/ or afhq/*/*/.

Tip

For more details on training VQ-GAN, refer to the official repository.

Train Tokenizer

Set tokenizer.path in the main config file configs/amazon.yaml or configs/yelp.yaml
Run train_tokenizer_*.sh to train the tokenizer.

🏋️‍♂️ Training

Set the corresponding configuration files;
Use the appropriate scripts or notebooks.

Experiment name	Script/Notebook	Configs (`config/`)	Weights (W&B link)
Convergence of D-IMF on Discrete Spaces	`notebooks/convergence_d_imf.ipynb`	N/A	N/A
Illustrative 2D Experiments	`train_swiss_roll.sh`	`swiss_roll.yaml`	N/A
Unpaired Translation on Colored MNIST	`train_cmnist.sh`	`cmnist.yaml`	CSBM
Unpaired Translation of CelebA Faces	`train_celeba.sh`	`celeba.yaml`, `vqgan_celeba_f8_1024.yaml`	CSBM, VQ-GAN
Unpaired Translation of AFHQ Faces	`train_afhq.sh`	`afhq.yaml`, `vqgan_afhq_f32_1024.yaml`	N/A
Unpaired Text Style Transfer of Amazon Reviews	`train_amazon.sh`	`amazon.yaml`	CSBM, Tokenizer
Unpaired Text Style Transfer of Yelp Reviews	`train_yelp.sh`	`yelp.yaml`	N/A

Tip

Set the exp_dir parameter in any train_*.sh script to define a custom path for saving experiment results, following the structure below:

data.type               # e.g., toy, images, etc.
`-- data.dataset        # e.g., swiss_roll, cmnist, etc. 
   `-- prior.type       # e.g., gaussian, uniform, etc. 
       |-- checkpoints 
       |   |-- forward_*
       |   |   `-- model.safetensors
       |   |-- ...
       |   |-- backward_*
       |   `-- ...
       |-- samples      # images of samples
       |-- trajectories # images of trajectories
       `-- config.yaml  # copied config

📊 Evaluation

Specify the exp_path parameter, pointing to the saved experiment folder;
Run eval_*.sh with the appropriate iteration argument.

Important

Reusing an earlier evaluation pipeline for the CelebA dataset may yield different results. In the article, images were generated first (see scripts/generate.py) and then evaluated with the following metrics (see notebooks/eval.ipynb):

FID from pytorch-fid
CMMD from cmmd-pytorch
LPIPS from torchmetrics

🎓 Citation

@inproceedings{
  ksenofontov2025categorical,
  title={Categorical {Schr\"odinger} Bridge Matching},
  author={Grigoriy Ksenofontov and Alexander Korotin},
  booktitle={Forty-second International Conference on Machine Learning},
  year={2025},
  url={https://openreview.net/forum?id=RBly0nOr2h}
}

🙏 Credits

Weights & Biases — experiment-tracking and visualization toolkit;
Hugging Face — Tokenizers and Accelerate libraries for tokenizer implementation, parallel training, and checkpoint hosting on the Hub;
D3PM — reference implementation of discrete-diffusion models;
Taming Transformers — original VQ-GAN codebase;
VQ-Diffusion — vector-quantized diffusion architecture;
MDLM — diffusion architecture for text-generation experiments;
ASBM — evaluation metrics and baseline models for CelebA face transfer;
Balancing the Style-Content Trade-Off in Sentiment Transfer Using Polarity-Aware Denoising — processed Amazon Reviews dataset and sentiment-transfer baselines;
Inkscape — an excellent open-source editor for vector graphics.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
configs		configs
data		data
images		images
notebooks		notebooks
poster		poster
scripts		scripts
src/csbm		src/csbm
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
environment.yml		environment.yml
eval_amazon.sh		eval_amazon.sh
eval_celeba.sh		eval_celeba.sh
eval_cmnist.sh		eval_cmnist.sh
quatize_afhq.sh		quatize_afhq.sh
quatize_celeba.sh		quatize_celeba.sh
train_afhq.sh		train_afhq.sh
train_amazon.sh		train_amazon.sh
train_celeba.sh		train_celeba.sh
train_cmnist.sh		train_cmnist.sh
train_swiss_roll.sh		train_swiss_roll.sh
train_tokenizer_amazon.sh		train_tokenizer_amazon.sh
train_tokenizer_yelp.sh		train_tokenizer_yelp.sh
train_yelp.sh		train_yelp.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Categorical Schrödinger Bridge Matching (CSBM)

📌 TL;DR

📦 Dependencies

🛠️ Preparations

Download Datasets

Train VQ-GAN

Train Tokenizer

🏋️‍♂️ Training

📊 Evaluation

🎓 Citation

🙏 Credits

About

Uh oh!

Languages

License

gregkseno/csbm

Folders and files

Latest commit

History

Repository files navigation

Categorical Schrödinger Bridge Matching (CSBM)

📌 TL;DR

📦 Dependencies

🛠️ Preparations

Download Datasets

Train VQ-GAN

Train Tokenizer

🏋️‍♂️ Training

📊 Evaluation

🎓 Citation

🙏 Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages