From b52f1397b230dde201879bcb7746ee8696b53553 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:29 +0800
Subject: [PATCH 1/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index b9224b0..9a2646d 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From 402d11a28ed7f91a7faedc9497ee7af16bfe1b78 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:38 +0800
Subject: [PATCH 2/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 9a2646d..35d964e 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From f8417e17447bf3af11818886ff533ff4c480e1d0 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:47 +0800
Subject: [PATCH 3/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 35d964e..da78145 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From 12c8475a810188e1a21081bb0c48000fc920bf10 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:36:53 +0800
Subject: [PATCH 4/7] Update README.md
---
README.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/README.md b/README.md
index da78145..032dbc4 100644
--- a/README.md
+++ b/README.md
@@ -15,9 +15,9 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang](), ¹ [Zijun Min](),_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_**
-**_² [Liang Yao](), ² [Haibo Zhang](), ² [Anxiang Zeng](), ¹ *[Jinsong Su]()_**
+**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_**
From 2d603f237f33c3c65a095ee2c863d1b27ea62410 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:37:28 +0800
Subject: [PATCH 5/7] Update README.md
---
README.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/README.md b/README.md
index 032dbc4..acc4b87 100644
--- a/README.md
+++ b/README.md
@@ -15,9 +15,9 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_**
-**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_**
+**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_**
From 187c814e762dd2dd8617e84cbecb9f45176b019c Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:37:50 +0800
Subject: [PATCH 6/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index acc4b87..0d4cb54 100644
--- a/README.md
+++ b/README.md
@@ -15,7 +15,7 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), ¹ Ante Wang, ¹ Zijun Min,_**
**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_**
From e9ff2d99b18e7a8b204b8eefd9f5d6efff73d555 Mon Sep 17 00:00:00 2001
From: "dependabot[bot]" <49699333+dependabot[bot]@users.noreply.github.com>
Date: Fri, 19 Sep 2025 02:33:49 +0000
Subject: [PATCH 7/7] Update tensordict requirement
Updates the requirements on [tensordict](https://github.com/pytorch/tensordict) to permit the latest version.
- [Release notes](https://github.com/pytorch/tensordict/releases)
- [Commits](https://github.com/pytorch/tensordict/compare/v0.8.0...v0.10.0)
---
updated-dependencies:
- dependency-name: tensordict
dependency-version: 0.10.0
dependency-type: direct:production
...
Signed-off-by: dependabot[bot]
---
requirements-npu.txt | 2 +-
requirements.txt | 2 +-
requirements_sglang.txt | 2 +-
3 files changed, 3 insertions(+), 3 deletions(-)
diff --git a/requirements-npu.txt b/requirements-npu.txt
index 7d03869..78d204c 100644
--- a/requirements-npu.txt
+++ b/requirements-npu.txt
@@ -10,7 +10,7 @@ peft
pyarrow>=15.0.0
pybind11
pylatexenc
-tensordict>=0.8.0,<=0.9.1,!=0.9.0
+tensordict>=0.8.0,!=0.9.0,<=0.10.0
transformers==4.52.4
ray==2.46.0
wandb
diff --git a/requirements.txt b/requirements.txt
index 31459e6..0b7d2d2 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -14,7 +14,7 @@ pybind11
pylatexenc
pre-commit
ray[default]
-tensordict>=0.8.0,<=0.9.1,!=0.9.0
+tensordict>=0.8.0,!=0.9.0,<=0.10.0
torchdata
transformers
# vllm==0.8.4
diff --git a/requirements_sglang.txt b/requirements_sglang.txt
index ce9e7d5..9b8749c 100644
--- a/requirements_sglang.txt
+++ b/requirements_sglang.txt
@@ -12,7 +12,7 @@ pyarrow>=19.0.0
pybind11
pylatexenc
ray[default]>=2.10
-tensordict>=0.8.0,<=0.9.1,!=0.9.0
+tensordict>=0.8.0,!=0.9.0,<=0.10.0
torchdata
torchvision
transformers