Jitian Zhao*, Changho Shin*, Tzu-Heng Huang, Satya Sai Srinath Namburi GNVV, Frederic Sala
Paper Link: TBD
pip install -r requirements.txtpython scripts/save_judge_outputs.py \
--datasets asset_ratings civilcomments_binary allenai_preference_test_sets/pku_better_binary \
--mode gaussian_mixtureOutput path example: judge_outputs/fully_gaussian/asset/Qwen3-8B.csv
Fully Gaussian (table 1 experiment):
python scripts/fully_gaussian_main.py --seed 2024Gaussian mixture (table 2 experiment):
python scripts/gaussian_mixture_main.py --seed 42 --datasets civilcomments pku_betterTBD