Skip to content

Comments

Added llm-type argument#39

Draft
PrateekHumane wants to merge 2 commits intoCERC-AAI:frontier/devfrom
PrateekHumane:feature/model-type-argument
Draft

Added llm-type argument#39
PrateekHumane wants to merge 2 commits intoCERC-AAI:frontier/devfrom
PrateekHumane:feature/model-type-argument

Conversation

@PrateekHumane
Copy link
Collaborator

Added an argument llm_type to choose the llm model class that will be used in both training and inference.

The valid values for llm_type are taken from the existing 'model_type' variable from the corresponding models config class. More specifically the choices are: ['llava', 'llava_mistral','llava_neox']

When adding a new llm class, as long as the class is defined, the method init_subclass will be invoked in the LlavaMetaModel base class and the Enum class variable ModelType will automatically be updated.

Prateek Humane added 2 commits February 5, 2024 19:15
Added a registry and Enum of all models to the LlavaMetaModel. All subclass config model type name is saved in the base class.
Added additional param for llm_type in train and builder scripts, as well is in most calls of "load_pretrained_model()"
Now the name is the class name and value is the config shorthand
@PrateekHumane PrateekHumane force-pushed the feature/model-type-argument branch from 52399ed to a217dce Compare February 8, 2024 00:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant