Skip to content

Conversation

@keshavb96
Copy link
Contributor

@keshavb96 keshavb96 commented Feb 2, 2026

  1. Decoupling of processes that do weight updates / training, prompt dispatching and rollout generation (vLLM). See example in examples/decoupled_synchronous.
  2. Async RL support, see example in examples/decoupled_asynchronous

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant