First of all, great paper and great that you guys make your code publish available. :-) For both I need CUDA: train_self_distill.sh and train_mutual_distill.sh. Does the repo/training also works without CUDA available?