CNDB-14687: Consistent error messages in the event of a network timeout. #2194

scottfines · 2026-01-13T15:53:27Z

What is the issue

There are two different ways in which network timeouts are detected during Repair operations--there is a configured count-down latch which will fail if the requests take too long, and there is a separate timeout returned by the underlying messaging system. Unfortunately, when the underlying messaging system timed out, it was being treated as a general network error instead of a timeout. The net result is that very rarely the network will timeout before the count down latch, and some tests in the CI build will fail with an incorrect error message.

What does this PR fix and why was it fixed

This resolves https://github.com/riptano/cndb/issues/14687.

The main motivation is flaky unit tests, but the end user will also see a more consistent error message in the event of a network timeout that is detected at a lower level than the latch timeout.

…age as a latch timeout. This creates a more consistent behavior in the event of different network timeouts, and has a side benefit of fixing three or four different flaky tests

github-actions · 2026-01-13T15:53:55Z

driftx · 2026-01-15T19:48:44Z

Restarted the CI job, hopefully that works: https://jenkins-stargazer.aws.dsinternal.org/job/ds-cassandra-pr-gate/job/PR-2194/

sonarqubecloud · 2026-01-15T20:08:44Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

driftx

Everything looks good. Butler didn't fire from the CI restart, but there weren't many failures and they were all timeouts that don't reproduce so aren't related to this.

CNDB-14687: Ensures that a network timeout causes the same error mess…

5a1faa8

…age as a latch timeout. This creates a more consistent behavior in the event of different network timeouts, and has a side benefit of fixing three or four different flaky tests

driftx self-requested a review January 15, 2026 14:26

driftx approved these changes Jan 16, 2026

View reviewed changes

scottfines merged commit a87ea67 into main Jan 16, 2026
486 of 501 checks passed

scottfines deleted the c14687 branch January 16, 2026 14:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CNDB-14687: Consistent error messages in the event of a network timeout. #2194

CNDB-14687: Consistent error messages in the event of a network timeout. #2194

Uh oh!

scottfines commented Jan 13, 2026

Uh oh!

github-actions bot commented Jan 13, 2026 •

edited by scottfines

Loading

Uh oh!

driftx commented Jan 15, 2026

Uh oh!

sonarqubecloud bot commented Jan 15, 2026

Uh oh!

driftx left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CNDB-14687: Consistent error messages in the event of a network timeout. #2194

CNDB-14687: Consistent error messages in the event of a network timeout. #2194

Uh oh!

Conversation

scottfines commented Jan 13, 2026

What is the issue

What does this PR fix and why was it fixed

Uh oh!

github-actions bot commented Jan 13, 2026 • edited by scottfines Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist before you submit for review

Uh oh!

driftx commented Jan 15, 2026

Uh oh!

sonarqubecloud bot commented Jan 15, 2026

Quality Gate passed

Uh oh!

driftx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Jan 13, 2026 •

edited by scottfines

Loading