HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block level #9570

ivandika3 · 2025-12-29T08:53:16Z

What changes were proposed in this pull request?

Currently, datanode has an option to flush the write on chunk boundary (hdds.container.chunk.write.sync) which is disabled by default since it might affect the DN write throughput and latency. However, disabling this means that if the datanode machine is suddenly down (e.g. power failure, reaped by OOM killer), this might cause the file to have incomplete data even if PutBlock (write commit) is successful which violates our durability guarantee. Although PutBlock triggers FilePerBlockStrategy#finishWriteChunks which will trigger close (RandomAccessFile#close), the buffer cache might not be flushed yet since closing a file does not imply that the buffer cache for the file is flushed (see https://man7.org/linux/man-pages/man2/close.2.html). So there might be a chance where the user's key is committed, but the data do not exist in datanodes.

However, flushing for every WriteChunk might cause unnecessary overhead. We might need to consider calling FileChannel#force on PutBlock instead of WriteChunk since the data is only visible for users when PutBlock returns successfully (the data is committed) and for failure the client will try to replace the block (allocate another block). Therefore, we can guarantee that the after user successfully uploaded the key, the data has been persistently stored in the leader and at least one follower promise to flush the data (MAJORITY_COMMITTED).

This might still affect the write throughput and latency due to waiting for the buffer cached to be flushed to persistent storage (ssd or disk), but will increase our data durability guarantee (which should be our priority). Flushing the buffer cache might also reduce the memory usage of datanode.

In the future, we should consider enabling hdds.container.chunk.write.sync by default.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-14246

How was this patch tested?

CI when sync is enabled (https://github.com/ivandika3/ozone/actions/runs/20535392231)

…evel

HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block l…

3c01f35

…evel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block level #9570

HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block level #9570

ivandika3 commented Dec 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block level #9570

Are you sure you want to change the base?

HDDS-14246. Change fsync boundary for FilePerBlockStrategy to block level #9570

Conversation

ivandika3 commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ivandika3 commented Dec 29, 2025 •

edited

Loading