diff --git a/README.md b/README.md index b9224b0..0d4cb54 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@
- Logo + Logo
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts @@ -15,9 +15,9 @@ -**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang](), ¹ [Zijun Min](),_** +**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), ¹ Ante Wang, ¹ Zijun Min,_** -**_² [Liang Yao](), ² [Haibo Zhang](), ² [Anxiang Zeng](), ¹ *[Jinsong Su]()_** +**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_** diff --git a/requirements-npu.txt b/requirements-npu.txt index 7d03869..ba31476 100644 --- a/requirements-npu.txt +++ b/requirements-npu.txt @@ -4,7 +4,7 @@ codetiming datasets dill hydra-core -numpy<2.0.0 +numpy<3.0.0 pandas peft pyarrow>=15.0.0 diff --git a/requirements.txt b/requirements.txt index 31459e6..bff56eb 100644 --- a/requirements.txt +++ b/requirements.txt @@ -6,7 +6,7 @@ dill flash-attn hydra-core liger-kernel -numpy<2.0.0 +numpy<3.0.0 pandas peft pyarrow>=19.0.0 diff --git a/requirements_sglang.txt b/requirements_sglang.txt index ce9e7d5..88ac998 100644 --- a/requirements_sglang.txt +++ b/requirements_sglang.txt @@ -5,7 +5,7 @@ datasets dill flash-attn hydra-core -numpy<2.0.0 +numpy<3.0.0 pandas peft pyarrow>=19.0.0