Skip to content

Adding gpt_oss_20b and gpt_oss_120b to config_registry.py#2432

Open
chelsea0x3b wants to merge 2 commits intopytorch:mainfrom
chelsea0x3b:gpt-oss-configs
Open

Adding gpt_oss_20b and gpt_oss_120b to config_registry.py#2432
chelsea0x3b wants to merge 2 commits intopytorch:mainfrom
chelsea0x3b:gpt-oss-configs

Conversation

@chelsea0x3b
Copy link

@chelsea0x3b chelsea0x3b commented Feb 24, 2026

Tested with

torchrun --nproc-per-node gpu -m torchtitan.train --module gpt_oss --config gpt_oss_20b --training.local-batch-size 4

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 24, 2026
Comment on lines 85 to 88
checkpoint=CheckpointManager.Config(
interval=10,
last_save_model_only=False,
),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The configs for checkpoint, validator, log_freq for these two production models seem toy. Could you maybe refer to DeepSeek and update them to be slightly more realistic? Thanks!

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ty! ready for re-review

@chelsea0x3b chelsea0x3b requested a review from tianyu-l February 25, 2026 14:01
Copy link
Contributor

@tianyu-l tianyu-l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tianyu-l
Copy link
Contributor

please fix linting

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants