Add new example to fine tune llama-2 70b with lora #80

lu-wang-dl · 2023-10-13T05:40:16Z

Tested on: https://adb-7064161269814046.2.staging.azuredatabricks.net/?o=7064161269814046#notebook/94670986903573/command/94670986903574

es94129

Thanks for adding this, looks very cool!
Wondering why deepspeed is required, is it for the memory optimization?

es94129 · 2023-10-13T17:46:17Z

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

+# MAGIC
+# MAGIC # Fine tune llama-2-70b with deepspeed
+# MAGIC
+# MAGIC [Llama 2](https://huggingface.co/meta-llama) is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. It is trained with 2T tokens and supports context length window upto 4K tokens. [Llama-2-70b-hf](https://huggingface.co/meta-llama/Llama-2-70b-hf) is the 7B pretrained model, converted for the Hugging Face Transformers format.


nit

Suggested change

# MAGIC [Llama 2](https://huggingface.co/meta-llama) is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. It is trained with 2T tokens and supports context length window upto 4K tokens. [Llama-2-70b-hf](https://huggingface.co/meta-llama/Llama-2-70b-hf) is the 7B pretrained model, converted for the Hugging Face Transformers format.

# MAGIC [Llama 2](https://huggingface.co/meta-llama) is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. It is trained with 2T tokens and supports context length window upto 4K tokens. [Llama-2-70b-hf](https://huggingface.co/meta-llama/Llama-2-70b-hf) is the 70B pretrained model, converted for the Hugging Face Transformers format.

deepspeed is used for multi-GPU training with LORA.

es94129 · 2023-10-13T17:48:22Z

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

Since 07 is used for AI gateway, maybe other indices

Sure. Let's design a proper orders after.

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

es94129 · 2023-10-13T17:52:01Z

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

+
+# MAGIC %sh
+# MAGIC deepspeed \
+# MAGIC --num_gpus 2 \


--num_gpus is probably not needed because deepspeed can use all the GPUs on the machine

Good point. Let me remove it.

es94129 · 2023-10-13T17:54:41Z

llm-models/llamav2/llamav2-70b/scripts/fine_tune_lora.py

+MODEL_PATH = 'meta-llama/Llama-2-70b-hf'
+TOKENIZER_PATH = 'meta-llama/Llama-2-70b-hf'
+DEFAULT_TRAINING_DATASET = "mosaicml/dolly_hhrlhf"
+CONFIG_PATH = "../../config/a10_config_zero2.json"


Maybe rename the file to a100_...?

es94129 · 2023-10-13T17:54:50Z

llm-models/config/a10_config_zero2.json

Maybe rename the file to a100_...?

es94129 · 2023-10-13T18:01:11Z

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

+# MAGIC deepspeed \
+# MAGIC --num_gpus 2 \
+# MAGIC scripts/fine_tune_lora.py \
+# MAGIC --output_dir="/local_disk0/output"


Q: What is the difference between --output_dir and /local_disk0/final_model, is the latter just the LoRA weights?

es94129 · 2023-10-13T18:01:59Z

llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

+# COMMAND ----------
+
+# MAGIC %sh
+# MAGIC ls /local_disk0/final_model


Could you also add instructions or code for how to load this for inference?

Co-authored-by: Ying Chen <ying.chen@databricks.com>

lu-wang-dl and others added 6 commits October 5, 2023 07:58

Add example notebooks for mistral model

1f7f497

fix

4d6870e

Update and address the comments

f59abd4

address the comments

3e7d8ba

add readme

cf8eba3

Add example to fine tune llama-2 70B with lora

d4d7fc0

lu-wang-dl requested a review from es94129 October 13, 2023 05:40

es94129 approved these changes Oct 13, 2023

View reviewed changes

lu-wang-dl and others added 2 commits October 19, 2023 18:24

Update llm-models/llamav2/llamav2-70b/07_fine_tune_lora.py

f941387

Co-authored-by: Ying Chen <ying.chen@databricks.com>

Update fine_tune_lora.py

b5e81bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add new example to fine tune llama-2 70b with lora #80

Add new example to fine tune llama-2 70b with lora #80

Uh oh!

lu-wang-dl commented Oct 13, 2023 •

edited

Loading

Uh oh!

es94129 left a comment

Uh oh!

es94129 Oct 13, 2023

Uh oh!

lu-wang-dl Oct 20, 2023

Uh oh!

es94129 Oct 13, 2023

Uh oh!

lu-wang-dl Oct 20, 2023

Uh oh!

Uh oh!

es94129 Oct 13, 2023

Uh oh!

lu-wang-dl Oct 20, 2023

Uh oh!

es94129 Oct 13, 2023

Uh oh!

es94129 Oct 13, 2023

Uh oh!

es94129 Oct 13, 2023

Uh oh!

es94129 Oct 13, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add new example to fine tune llama-2 70b with lora #80

Are you sure you want to change the base?

Add new example to fine tune llama-2 70b with lora #80

Uh oh!

Conversation

lu-wang-dl commented Oct 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

es94129 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lu-wang-dl commented Oct 13, 2023 •

edited

Loading