[rl] refactor torchtitan model registery in vllm by wwwjn · Pull Request #2194 · pytorch/torchtitan

wwwjn · 2026-01-02T16:59:57Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

[ghstack-poisoned]

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

[ghstack-poisoned]

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

[ghstack-poisoned]

torchtitan/experiments/rl/unified/__init__.py

torchtitan/experiments/rl/unified/plugin.py

[ghstack-poisoned]

torchtitan/experiments/rl/unified/actors/generator.py

torchtitan/experiments/rl/unified/plugin.py

tianyu-l · 2026-02-19T19:29:51Z

torchtitan/experiments/rl/unified/__init__.py

-    # model_flavor during registration because we can not pass torchtitan job_config from LLM() Api
-    model_flavor="0.6B",
+from torchtitan.experiments.rl.unified.infra.parallelism_utils import (
+    create_parallel_dims_from_vllm_config,


I hope we could put all torchtitan-vllm glue code in one file / folder, and carefully document why we need each class / method. This one sounds one of them.

Nice catch, refactored this part

torchtitan/experiments/rl/unified/infer.py

[ghstack-poisoned]

…ight tying (#2410) Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.13.0) (oldest at bottom): * #2395 * #2244 * #2221 * #2194 * #2191 * __->__ #2410 This is a alternative fix to #2402 (comment). Weight updating between trainer and generator is totally broken because: It's caused by we called "reload_weights" when updating the weights. The reload_weights has following steps: - initialize_layerwise_reload(model): Saves the current real GPU tensors as info.kernel_tensors, and replace all parameters with meta tensor. - Call model.load_weights(weights_iter): This function is written by us and calls set_model_state_dict, Internally, set_model_state_dict tries to do param.data.copy_(loaded_weight) for each parameter. When parameters are meta tensor, it will do "no-op". So the weights never get updated In this PR: - Totally bypass reload_weights, and don't load from a file when we update the weights - Gets the model via self.engine.model_executor.driver_worker.get_model() - Iterates over model.named_parameters() to find the matching parameter by name - Does param.data.copy_(new_tensor) directly

[ghstack-poisoned]

torchtitan/experiments/rl/unified/infer.py

torchtitan/experiments/rl/unified/README.md

[ghstack-poisoned]

torchtitan/experiments/rl/unified/plugin.py

torchtitan/experiments/rl/unified/models/vllm_wrapper.py

[ghstack-poisoned]

refactor model registery

7a5ade5

[ghstack-poisoned]

This was referenced Jan 2, 2026

[rl] Use torchtitan config system for inference and simple GRPO #2191

Open

config sys for simple_grpo v1 #2192

Closed

pytorch-bot bot added the ciflow/8gpu label Jan 2, 2026

wwwjn mentioned this pull request Jan 2, 2026

config sys for simple_grpo v2 #2193

Closed

wwwjn added a commit that referenced this pull request Jan 2, 2026

refactor model registery

9de7e0a

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 2, 2026

wwwjn closed this Jan 2, 2026

Update on "refactor model registery"

ba6e8f5

[ghstack-poisoned]

wwwjn added a commit that referenced this pull request Jan 2, 2026

refactor model registery

b5dfd2b

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

wwwjn reopened this Jan 2, 2026

wwwjn changed the title ~~refactor model registery~~ [rl] refactor model registery Jan 2, 2026

Update on "[rl] refactor model registery"

d29447c

[ghstack-poisoned]

wwwjn added a commit that referenced this pull request Jan 13, 2026

refactor model registery

463670b

ghstack-source-id: 557ecd0 Pull Request resolved: #2194

Update on "[rl] refactor model registery"

dc8b149

[ghstack-poisoned]

wwwjn mentioned this pull request Jan 13, 2026

[rl] refactor save and load model weights using DCP #2221

Open

wwwjn added 2 commits January 12, 2026 16:29

Update on "[rl] refactor model registery"

a33d957

[ghstack-poisoned]

Update on "[rl] refactor model registery"

bf00e35

[ghstack-poisoned]

wwwjn mentioned this pull request Jan 16, 2026

[rl] Generator enables TP, using torchtitan as Trainer, add grader for reward calculation #2244

Open

tianyu-l reviewed Jan 19, 2026

View reviewed changes

torchtitan/experiments/rl/unified/__init__.py Outdated Show resolved Hide resolved

torchtitan/experiments/rl/unified/plugin.py Outdated Show resolved Hide resolved

wwwjn added 5 commits January 30, 2026 00:23

Update on "[rl] refactor model registery"

b833098

[ghstack-poisoned]

Update on "[rl] refactor model registery"

c9b27cc

[ghstack-poisoned]

Update on "[rl] refactor model registery"

45d0c40

[ghstack-poisoned]

Update on "[rl] refactor model registery"

ffd057f

[ghstack-poisoned]

Update on "[rl] refactor model registery"

809928f

[ghstack-poisoned]

wwwjn mentioned this pull request Feb 19, 2026

[WIP][rl] enable batch-invariant mode in RL loop #2395

Open

acisseJZhong reviewed Feb 19, 2026

View reviewed changes

torchtitan/experiments/rl/unified/actors/generator.py Outdated Show resolved Hide resolved

acisseJZhong reviewed Feb 19, 2026

View reviewed changes

torchtitan/experiments/rl/unified/plugin.py Outdated Show resolved Hide resolved

acisseJZhong reviewed Feb 19, 2026

View reviewed changes

torchtitan/experiments/rl/unified/plugin.py Outdated Show resolved Hide resolved

acisseJZhong reviewed Feb 19, 2026

View reviewed changes

torchtitan/experiments/rl/unified/plugin.py Outdated Show resolved Hide resolved

wwwjn changed the title ~~[rl] refactor model registery~~ [rl] refactor torchtitan model registery in vllm Feb 19, 2026

tianyu-l reviewed Feb 19, 2026

View reviewed changes

Update

cd1b075

[ghstack-poisoned]

wwwjn mentioned this pull request Feb 20, 2026

[rl] bypass reload_weights by manually copy weights per-param, fix weight tying #2410

Merged

Update

4745cb6

[ghstack-poisoned]

wwwjn added 7 commits February 22, 2026 16:48

Update

a5691b8

[ghstack-poisoned]

Update

831bed1

[ghstack-poisoned]

Update

d09a3a5

[ghstack-poisoned]

Update

78bd787

[ghstack-poisoned]

Update

c82292b

[ghstack-poisoned]

Update

90443ef

[ghstack-poisoned]

Update

0e428b8

[ghstack-poisoned]

wwwjn commented Feb 24, 2026

View reviewed changes

torchtitan/experiments/rl/unified/infer.py Outdated Show resolved Hide resolved

torchtitan/experiments/rl/unified/README.md Show resolved Hide resolved

Update

8cf41a0

[ghstack-poisoned]

tianyu-l reviewed Feb 25, 2026

View reviewed changes

wwwjn added 2 commits February 25, 2026 13:54

Update

5aa04f2

[ghstack-poisoned]

Update

4a3f408

[ghstack-poisoned]

wwwjn mentioned this pull request Feb 26, 2026

[rl][combo] Refactor simple RL loop with torchtitan components #2443

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rl] refactor torchtitan model registery in vllm#2194

[rl] refactor torchtitan model registery in vllm#2194
wwwjn wants to merge 23 commits intogh/wwwjn/5/basefrom
gh/wwwjn/5/head

wwwjn commented Jan 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Feb 19, 2026

Uh oh!

wwwjn Feb 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wwwjn commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

wwwjn Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wwwjn commented Jan 2, 2026 •

edited

Loading