preference-alignment

Star

Here are 17 public repositories matching this topic...

princeton-nlp / SimPO

Star

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

alignment large-language-models rlhf preference-alignment

Updated Feb 16, 2025
Python

zjukg / KnowPAT

Star

[Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

knowledge-graph question-answering large-language-models instruction-tuning preference-alignment domain-specific-qa

Updated Jun 10, 2024
Python

Video-Bench / Video-Bench

Star

Video Generation Benchmark

video-understanding video-generation sora human-annotation text-to-video large-language-models llm multimodal-large-language-models preference-alignment video-evaluation video-generation-evaluation human-preference

Updated Jun 9, 2025
Python

Meaquadddd / DPO-Shift

Star

DPO-Shift: Shifting the Distribution of Direct Preference Optimization

alignment large-language-models rlhf preference-alignment

Updated Mar 5, 2025
Python

GradientSpaces / respace

Star

Code for "ReSpace: Text-Driven 3D Indoor Scene Synthesis and Editing with Preference Alignment"

spatial-reasoning indoor-scenes large-language-models scene-synthesis preference-alignment indoor-scene-synthesis 3d-scene-synthesis 3d-indoor-scene-synthesis

Updated Dec 9, 2025
Python

junkangwu / beta-DPO

Star

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

alignment dpo rlhf preference-alignment

Updated Oct 23, 2024
Python

Shentao-YANG / Dense_Reward_T2I

Star

Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).

text-to-image-generation preference-alignment dense-reward-for-direct-preference-optimization

Updated May 9, 2024
Python

pspdada / SENTINEL

Star

[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".

image-captioning multimodal-datasets multimodal-large-language-models preference-alignment iccv2025

Updated Dec 11, 2025
Python

junkangwu / Dr_DPO

Star

[ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"

alignment dpo rlhf distributionally-robust-optimization preference-alignment

Updated Jun 1, 2024
Python

MingjunPan / PO4COPs

Star

[ICML 25] "Preference Optimization for Combinatorial Optimization Problems"

reinforcement-learning combinatorial-optimization preference-alignment

Updated Jun 6, 2025
Python

YJiangcm / BMC

Star

[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

alignment dpo llms rlhf preference-alignment

Updated Jan 26, 2025
Python

dvlab-research / TGDPO

Star

[ICML 2025] TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

alignment preference-learning large-language-models llm rlhf preference-alignment direct-preference-optimization preference-optimization

Updated Jul 15, 2025
Python

basiclab / DiffusionDRO

Star

[NeurIPS 2025] Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback

diffusion-models rlhf preference-alignment neurips-2025

Updated Dec 15, 2025
Python

jqwangai / ProUtt

Star

LLM-Driven Preference Data Synthesis for Proactive Prediction of the User’s Next Utterance in Human–Machine Dialogue

proactive user-intent data-synthesis llm preference-alignment

Updated Dec 25, 2025
Python

BARUDA-AI / Awesome-Preference-Optimization

Star

Survey of preference alignment algorithms

alignment direct preference-learning rlhf preference-alignment

Updated Feb 25, 2024

thibaud-perrin / synthetic-datasets

Star

Generate synthetic datasets for instruction tuning and preference alignment using tools like `distilabel` for efficient and scalable data creation.

ai synthetic-data llm instruction-tuning preference-alignment

Updated Jan 26, 2025
Jupyter Notebook

reshalfahsi / gpt2chat

Star

Creating a GPT-2-Based Chatbot with Human Preferences

natural-language-processing chatbot pytorch language-model gpt-2 huggingface pytorch-lightning langchain instruction-tuning preference-alignment orpo

Updated May 9, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the preference-alignment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preference-alignment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preference-alignment

Here are 17 public repositories matching this topic...

princeton-nlp / SimPO

zjukg / KnowPAT

Video-Bench / Video-Bench

Meaquadddd / DPO-Shift

GradientSpaces / respace

junkangwu / beta-DPO

Shentao-YANG / Dense_Reward_T2I

pspdada / SENTINEL

junkangwu / Dr_DPO

MingjunPan / PO4COPs

YJiangcm / BMC

dvlab-research / TGDPO

basiclab / DiffusionDRO

jqwangai / ProUtt

BARUDA-AI / Awesome-Preference-Optimization

thibaud-perrin / synthetic-datasets

reshalfahsi / gpt2chat

Improve this page

Add this topic to your repo