From b52f1397b230dde201879bcb7746ee8696b53553 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:29 +0800
Subject: [PATCH 1/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index b9224b0..9a2646d 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From 402d11a28ed7f91a7faedc9497ee7af16bfe1b78 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:38 +0800
Subject: [PATCH 2/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 9a2646d..35d964e 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From f8417e17447bf3af11818886ff533ff4c480e1d0 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:35:47 +0800
Subject: [PATCH 3/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 35d964e..da78145 100644
--- a/README.md
+++ b/README.md
@@ -1,5 +1,5 @@
-

+
# SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts
From 12c8475a810188e1a21081bb0c48000fc920bf10 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:36:53 +0800
Subject: [PATCH 4/7] Update README.md
---
README.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/README.md b/README.md
index da78145..032dbc4 100644
--- a/README.md
+++ b/README.md
@@ -15,9 +15,9 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang](), ¹ [Zijun Min](),_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_**
-**_² [Liang Yao](), ² [Haibo Zhang](), ² [Anxiang Zeng](), ¹ *[Jinsong Su]()_**
+**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_**
From 2d603f237f33c3c65a095ee2c863d1b27ea62410 Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:37:28 +0800
Subject: [PATCH 5/7] Update README.md
---
README.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/README.md b/README.md
index 032dbc4..acc4b87 100644
--- a/README.md
+++ b/README.md
@@ -15,9 +15,9 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), [Ante Wang], ¹ [Zijun Min],_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_**
-**_² [Liang Yao], ² [Haibo Zhang], ² [Anxiang Zeng], ¹ *[Jinsong Su]_**
+**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_**
From 187c814e762dd2dd8617e84cbecb9f45176b019c Mon Sep 17 00:00:00 2001
From: "zijun.min" <137787597+zijunmin@users.noreply.github.com>
Date: Wed, 17 Sep 2025 13:37:50 +0800
Subject: [PATCH 6/7] Update README.md
---
README.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/README.md b/README.md
index acc4b87..0d4cb54 100644
--- a/README.md
+++ b/README.md
@@ -15,7 +15,7 @@
-**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), Ante Wang, ¹ Zijun Min,_**
+**_¹ [Bingshuai Liu](https://bingshuailiu.github.io), ¹ Ante Wang, ¹ Zijun Min,_**
**_² Liang Yao, ² Haibo Zhang, ² Anxiang Zeng, ¹ *Jinsong Su_**
From bda963feb7b94e64651c993a7b5ed419b093901f Mon Sep 17 00:00:00 2001
From: "dependabot[bot]" <49699333+dependabot[bot]@users.noreply.github.com>
Date: Fri, 19 Sep 2025 02:33:40 +0000
Subject: [PATCH 7/7] Bump transformers from 4.52.4 to 4.56.1
Bumps [transformers](https://github.com/huggingface/transformers) from 4.52.4 to 4.56.1.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v4.52.4...v4.56.1)
---
updated-dependencies:
- dependency-name: transformers
dependency-version: 4.56.1
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot]
---
requirements-npu.txt | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/requirements-npu.txt b/requirements-npu.txt
index 7d03869..f08c738 100644
--- a/requirements-npu.txt
+++ b/requirements-npu.txt
@@ -11,7 +11,7 @@ pyarrow>=15.0.0
pybind11
pylatexenc
tensordict>=0.8.0,<=0.9.1,!=0.9.0
-transformers==4.52.4
+transformers==4.56.1
ray==2.46.0
wandb
mathruler