Stasis

Stasis

This library aims to demonstrate how to create a dead-simple multi-node RL stack.

Some focuses of this library includes:

RL specific observability tools to help debug capabilities
Support for GRPO, DrGRPO -- no value function models.
Focus on clean code -- zero feature bloat.

Development speed

vLLM makes many calls to pynvml during initialization, This brings up the intialization time to around 17s. Turning on persistence mode on the CUDA driver brings this time down to around 5s.

To turn on CUDA driver persistence mode, run:

sudo nvidia-smi -pm 1

This enables CUDA driver persistence mode. The size of the driver will be ~75MB of kernel memory [TODO check this], but this is well worth it for development velocity.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
__pycache__		__pycache__
configs		configs
data-formatter		data-formatter
data/countdown		data/countdown
rlperf		rlperf
stasis-viz		stasis-viz
stasis.egg-info		stasis.egg-info
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stasis

Development speed

About

Uh oh!

Releases

Packages

Languages

punwai/jolt

Folders and files

Latest commit

History

Repository files navigation

Stasis

Development speed

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages