NVIDIA · terrykong · Jan 23, 2025
diff --git a/docs/user-guide/index.rst b/docs/user-guide/index.rst
@@ -5,6 +5,7 @@
 .. toctree::
    :maxdepth: 2
 
+   reinforce.rst
    sft.rst
    knowledge-distillation.rst
    dpo.rst
@@ -19,6 +20,9 @@
 :ref:`Prerequisite Obtaining a Pre-Trained Model <prerequisite>`
    This section provides instructions on how to download pre-trained LLMs in .nemo format. The following section will use these base LLMs for further fine-tuning and alignment. 
 
+:ref:`Model Alignment by REINFORCE <nemo-aligner-reinforce>`
+   In this tutorial, we will guide you through the process of aligning a NeMo Framework model using REINFORCE. This method can be applied to various models, including LLaMa2 and Mistral, with our scripts functioning consistently across different models.
+
 :ref:`Model Alignment by Supervised Fine-Tuning (SFT) <nemo-aligner-sft>`
    In this section, we walk you through the most straightforward alignment method. We use a supervised dataset in the prompt-response pairs format to fine-tune the base model according to the desired behavior.
 
@@ -59,6 +63,14 @@
      - Mistral
      - Nemotron-4
      - Mixtral
+   * - :ref:`REINFORCE <nemo-aligner-reinforce>`
+     - Yes
+     - Yes
+     - Yes
+     - Yes (✓)
+     - Yes
+     - Yes
+     - 
    * - :ref:`SFT <nemo-aligner-sft>`
      - 
      - Yes (✓)

diff --git a/docs/user-guide/reinforce.rst b/docs/user-guide/reinforce.rst
@@ -1,6 +1,6 @@
 .. include:: /content/nemo.rsts
 
-.. _model-aligner-reinforce:
+.. _nemo-aligner-reinforce:
 
 Model Alignment by REINFORCE
 @@@@@@@@@@@@@@@@@@@@@@@@@@@@