Added llm-type argument by PrateekHumane · Pull Request #39 · CERC-AAI/Robin

PrateekHumane · 2024-02-08T00:07:32Z

Added an argument llm_type to choose the llm model class that will be used in both training and inference.

The valid values for llm_type are taken from the existing 'model_type' variable from the corresponding models config class. More specifically the choices are: ['llava', 'llava_mistral','llava_neox']

When adding a new llm class, as long as the class is defined, the method init_subclass will be invoked in the LlavaMetaModel base class and the Enum class variable ModelType will automatically be updated.

Added a registry and Enum of all models to the LlavaMetaModel. All subclass config model type name is saved in the base class. Added additional param for llm_type in train and builder scripts, as well is in most calls of "load_pretrained_model()"

Now the name is the class name and value is the config shorthand

Prateek Humane added 2 commits February 5, 2024 19:15

Added LLM-type argument

705bd53

Added a registry and Enum of all models to the LlavaMetaModel. All subclass config model type name is saved in the base class. Added additional param for llm_type in train and builder scripts, as well is in most calls of "load_pretrained_model()"

Swapped the name and value of LlavaMetaModel.ModelType Enum

a217dce

Now the name is the class name and value is the config shorthand

PrateekHumane force-pushed the feature/model-type-argument branch from 52399ed to a217dce Compare February 8, 2024 00:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Added llm-type argument#39

Added llm-type argument#39
PrateekHumane wants to merge 2 commits intoCERC-AAI:frontier/devfrom
PrateekHumane:feature/model-type-argument

PrateekHumane commented Feb 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

PrateekHumane commented Feb 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant