From b52f1397b230dde201879bcb7746ee8696b53553 Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:35:29 +0800 Subject: [PATCH 1/7] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index b9224b0..9a2646d 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@
- Logo + Logo
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts From 402d11a28ed7f91a7faedc9497ee7af16bfe1b78 Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:35:38 +0800 Subject: [PATCH 2/7] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 9a2646d..35d964e 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@
- Logo + Logo
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts From f8417e17447bf3af11818886ff533ff4c480e1d0 Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:35:47 +0800 Subject: [PATCH 3/7] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 35d964e..da78145 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@
- Logo + Logo
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts From 12c8475a810188e1a21081bb0c48000fc920bf10 Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:36:53 +0800 Subject: [PATCH 4/7] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index da78145..032dbc4 100644 --- a/README.md +++ b/README.md @@ -15,9 +15,9 @@ -**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang](), ¹ [Zijun Min](),_** +**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_** -**_² [Liang Yao](), ² [Haibo Zhang](), ² [Anxiang Zeng](), ¹ *[Jinsong Su]()_** +**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_** From 2d603f237f33c3c65a095ee2c863d1b27ea62410 Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:37:28 +0800 Subject: [PATCH 5/7] Update README.md --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 032dbc4..acc4b87 100644 --- a/README.md +++ b/README.md @@ -15,9 +15,9 @@ -**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_** +**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_** -**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_** +**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_** From 187c814e762dd2dd8617e84cbecb9f45176b019c Mon Sep 17 00:00:00 2001 From: "zijun.min" <137787597+zijunmin@users.noreply.github.com> Date: Wed, 17 Sep 2025 13:37:50 +0800 Subject: [PATCH 6/7] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index acc4b87..0d4cb54 100644 --- a/README.md +++ b/README.md @@ -15,7 +15,7 @@ -**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_** +**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), ¹ Ante Wang, ¹ Zijun Min,_** **_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_** From e9ff2d99b18e7a8b204b8eefd9f5d6efff73d555 Mon Sep 17 00:00:00 2001 From: "dependabot[bot]" <49699333+dependabot[bot]@users.noreply.github.com> Date: Fri, 19 Sep 2025 02:33:49 +0000 Subject: [PATCH 7/7] Update tensordict requirement Updates the requirements on [tensordict](https://github.com/pytorch/tensordict) to permit the latest version. - [Release notes](https://github.com/pytorch/tensordict/releases) - [Commits](https://github.com/pytorch/tensordict/compare/v0.8.0...v0.10.0) --- updated-dependencies: - dependency-name: tensordict dependency-version: 0.10.0 dependency-type: direct:production ... Signed-off-by: dependabot[bot] --- requirements-npu.txt | 2 +- requirements.txt | 2 +- requirements_sglang.txt | 2 +- 3 files changed, 3 insertions(+), 3 deletions(-) diff --git a/requirements-npu.txt b/requirements-npu.txt index 7d03869..78d204c 100644 --- a/requirements-npu.txt +++ b/requirements-npu.txt @@ -10,7 +10,7 @@ peft pyarrow>=15.0.0 pybind11 pylatexenc -tensordict>=0.8.0,<=0.9.1,!=0.9.0 +tensordict>=0.8.0,!=0.9.0,<=0.10.0 transformers==4.52.4 ray==2.46.0 wandb diff --git a/requirements.txt b/requirements.txt index 31459e6..0b7d2d2 100644 --- a/requirements.txt +++ b/requirements.txt @@ -14,7 +14,7 @@ pybind11 pylatexenc pre-commit ray[default] -tensordict>=0.8.0,<=0.9.1,!=0.9.0 +tensordict>=0.8.0,!=0.9.0,<=0.10.0 torchdata transformers # vllm==0.8.4 diff --git a/requirements_sglang.txt b/requirements_sglang.txt index ce9e7d5..9b8749c 100644 --- a/requirements_sglang.txt +++ b/requirements_sglang.txt @@ -12,7 +12,7 @@ pyarrow>=19.0.0 pybind11 pylatexenc ray[default]>=2.10 -tensordict>=0.8.0,<=0.9.1,!=0.9.0 +tensordict>=0.8.0,!=0.9.0,<=0.10.0 torchdata torchvision transformers