Optimize `fetch_entries_to` function to avoid stucking `write` too long. #382

LykxSassinator · 2025-07-02T05:56:59Z

Background

PR #370 fixed a panic issue caused by concurrent updates to Memtable followed by reads on stale indexes. However, we observed that this fix introduced performance regressions under concurrent read/write workloads.

Issue

When fetch_entries_to retrieves a large batch of entries from disk, write operations are blocked until the read completes, delaying Memtable updates and degrading throughput.

Solution

This PR optimizes fetch_entries_to to reduce contention and improve performance under mixed workloads.

Results

Branch	Status
Master
This PR

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

ti-chi-bot · 2025-07-02T05:57:02Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

glorv · 2025-07-03T04:18:12Z

src/engine.rs

            for i in ents_idx.iter() {
-                vec.push(read_entry_from_file::<M, _>(self.pipe_log.as_ref(), i)?);
+                vec.push({
+                    match read_entry_from_file::<M, _>(self.pipe_log.as_ref(), i) {


Can we ensure the safety to read entry without holding the lock? Is it possible that some entries are truncated and target wal files are Gced(or reused) before this read, then the result of this read is undefined behavior

Yep. It's ensured.

Is it possible that some entries are truncated and target wal files are Gced(or reused) before this read

File Access Atomicity in Raft-Engine
In raft-engine, file access (.raftlog) is atomic. The GC operation acquires a mutex and deletes the entire file before allowing other accesses. If a read operation successfully returns the target bytes, the result is guaranteed valid.

Index Consistency Handling
This PR further ensures consistency by returning Error(e) if the Memtable index is updated (by either background rewrite or foreground write threads) during a read. Then it will automatically retry with the latest index to fetch the correct entry bytes.

Index Consistency Handling

Say if an entry is rewritten and the old wal file is purged before the real access, seems we still can read it with the old entry info as the file handle is changed then seems we still returns an error even if the entay is actually vaild.

If it uses a stale file handle to get the entry, it will get the Error(OutOfSeq), ref:
https://github.com/LykxSassinator/raft-engine/blob/392f5e66f8286dc1b6d7cf69f2bc20ed72d40123/src/file_pipe_log/pipe.rs#L238

So, if the first fetch uses a stale index to get the entry, it will return an Error and trigger the second retry, where it will use the latest index to access the entry.

glorv

LGTM

overvenus

Rest LGTM

src/engine.rs

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

ti-chi-bot · 2025-07-04T08:22:18Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: glorv, overvenus

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [glorv,overvenus]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2025-07-04T08:22:19Z

[LGTM Timeline notifier]

Timeline:

2025-07-03 07:36:16.133028097 +0000 UTC m=+1553228.856207081: ☑️ agreed by glorv.
2025-07-04 08:22:18.919925679 +0000 UTC m=+1642391.643104660: ☑️ agreed by overvenus.

…ng. (tikv#382) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

…e` too long. (#382) (#384) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

…e` too long. (#382) (#383) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

…ng. (#382) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

Optimize fetch_entries_to function to avoid stucking write too long.

961d779

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

ti-chi-bot bot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. dco-signoff: yes Indicates the PR's author has signed the dco. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Jul 2, 2025

Make format.

9d8e798

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

LykxSassinator marked this pull request as ready for review July 2, 2025 11:29

ti-chi-bot bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 2, 2025

LykxSassinator requested review from Connor1996, glorv and overvenus July 2, 2025 11:30

glorv reviewed Jul 3, 2025

View reviewed changes

glorv approved these changes Jul 3, 2025

View reviewed changes

ti-chi-bot bot added needs-1-more-lgtm Indicates a PR needs 1 more LGTM. approved labels Jul 3, 2025

LykxSassinator added needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. needs-cherry-pick-release-8.5 Should cherry pick this PR to release-8.5 branch. labels Jul 4, 2025

overvenus reviewed Jul 4, 2025

View reviewed changes

src/engine.rs Outdated Show resolved Hide resolved

Remove useless codes.

c7930cf

Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

LykxSassinator requested a review from overvenus July 4, 2025 07:07

overvenus approved these changes Jul 4, 2025

View reviewed changes

ti-chi-bot bot added the lgtm label Jul 4, 2025

ti-chi-bot bot removed the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label Jul 4, 2025

ti-chi-bot bot merged commit 03f77d9 into tikv:master Jul 4, 2025
7 checks passed

LykxSassinator added a commit to LykxSassinator/raft-engine that referenced this pull request Jul 4, 2025

Optimize fetch_entries_to function to avoid stucking write too lo…

8e1db07

…ng. (tikv#382) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

LykxSassinator added a commit to LykxSassinator/raft-engine that referenced this pull request Jul 4, 2025

Optimize fetch_entries_to function to avoid stucking write too lo…

dc413c7

…ng. (tikv#382) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

This was referenced Jul 4, 2025

[cp-8.5] Optimize fetch_entries_to function to avoid stucking write too long. (#382) #383

Merged

[cp-7.5] Optimize fetch_entries_to function to avoid stucking write too long. (#382) #384

Merged

ti-chi-bot bot pushed a commit that referenced this pull request Jul 4, 2025

[cp-7.5] Optimize fetch_entries_to function to avoid stucking `writ…

ddf3c90

…e` too long. (#382) (#384) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

ti-chi-bot bot pushed a commit that referenced this pull request Jul 4, 2025

[cp-8.5] Optimize fetch_entries_to function to avoid stucking `writ…

2f9f688

…e` too long. (#382) (#383) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

LykxSassinator mentioned this pull request Jul 6, 2025

*: update raft-engine to optimize fetch_entries_to. tikv/tikv#18617

Merged

9 tasks

This was referenced Jul 6, 2025

*: update raft-engine to optimize fetch_entries_to. (#18617) tikv/tikv#18672

Merged

*: update raft-engine to optimize fetch_entries_to. (#18617) tikv/tikv#18673

Merged

*: update raft-engine to optimize fetch_entries_to. (#18617) tikv/tikv#18674

Merged

LykxSassinator mentioned this pull request Jul 7, 2025

Add missing uts for the optimization on fetch_entries_to. #385

Merged

LykxSassinator added a commit that referenced this pull request Jul 12, 2025

Optimize fetch_entries_to function to avoid stucking write too lo…

ca1cd6b

…ng. (#382) Signed-off-by: lucasliang <nkcs_lykx@hotmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize `fetch_entries_to` function to avoid stucking `write` too long. #382

Optimize `fetch_entries_to` function to avoid stucking `write` too long. #382

Uh oh!

LykxSassinator commented Jul 2, 2025 •

edited

Loading

Uh oh!

ti-chi-bot bot commented Jul 2, 2025

Uh oh!

glorv Jul 3, 2025

Uh oh!

LykxSassinator Jul 3, 2025

Uh oh!

glorv Jul 3, 2025 •

edited

Loading

Uh oh!

LykxSassinator Jul 3, 2025

Uh oh!

glorv left a comment

Uh oh!

overvenus left a comment

Uh oh!

Uh oh!

ti-chi-bot bot commented Jul 4, 2025

Uh oh!

ti-chi-bot bot commented Jul 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize fetch_entries_to function to avoid stucking write too long. #382

Optimize fetch_entries_to function to avoid stucking write too long. #382

Uh oh!

Conversation

LykxSassinator commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Issue

Solution

Results

Uh oh!

ti-chi-bot bot commented Jul 2, 2025

Uh oh!

glorv Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

LykxSassinator Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

glorv Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LykxSassinator Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

glorv left a comment

Choose a reason for hiding this comment

Uh oh!

overvenus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ti-chi-bot bot commented Jul 4, 2025

Uh oh!

ti-chi-bot bot commented Jul 4, 2025

[LGTM Timeline notifier]

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize `fetch_entries_to` function to avoid stucking `write` too long. #382

Optimize `fetch_entries_to` function to avoid stucking `write` too long. #382

LykxSassinator commented Jul 2, 2025 •

edited

Loading

glorv Jul 3, 2025 •

edited

Loading