CNDB-16363: Improve matched rows estimation accuracy for memory indexes #2188

pkolaczk · 2026-01-07T17:07:36Z

What is the issue

When a memory index contains very few rows and is split into
many shards, we can expect a lot of variance in the number of rows
between the shards. Hence, if we took only one
shard to estimate the number of matched rows, and
extrapolate that on all shards to compute the estimated matching rows
from the whole index, we risk making a huge estimation error.

This turned out to be a problem when testing the query planner metrics
in #2130, when the planner estimated 0 rows and didn't bump up
the estimated row count metric.

What does this PR fix and why was it fixed

This commit changes the algorithm to take as many shards as needed
to collect enough returned or indexed rows. For very
tiny datasets is it's likely to use all shards for estimation.
For big datasets, one shard will likely be enough, speeding up
estimation.

This change also allows to remove one estimation method.
We no longer need to manually choose between the estimation
from the first shard and from all shards.

github-actions · 2026-01-07T17:07:50Z

k-rus · 2026-01-07T17:33:44Z

@pkolaczk can you add to the PR description, which issue is going to be fixed by this PR?

pkolaczk · 2026-01-08T08:16:06Z

@pkolaczk can you add to the PR description, which issue is going to be fixed by this PR?

Linked. https://github.com/riptano/cndb/issues/16363

k-rus · 2026-01-08T10:04:52Z

It would be great to update the PR description with motivation for the work from the issue and that it's blocking for another work.

pkolaczk · 2026-01-12T11:08:29Z

test this please

src/java/org/apache/cassandra/index/sai/plan/Plan.java

src/java/org/apache/cassandra/index/sai/plan/StorageAttachedIndexSearcher.java

test/unit/org/apache/cassandra/index/sai/cql/EstimatedRowCountTest.java

src/java/org/apache/cassandra/index/sai/memory/TrieMemtableIndex.java

src/java/org/apache/cassandra/index/sai/memory/MemoryIndex.java

test/unit/org/apache/cassandra/index/sai/plan/SingleRestrictionEstimatedRowCountTest.java

scottfines

I'm new, and just trying to learn. But for what its worth, the logic looks sound to me

src/java/org/apache/cassandra/index/sai/memory/MemtableIndex.java

src/java/org/apache/cassandra/index/sai/memory/TrieMemtableIndex.java

adelapena

The changes look good to me. I have just left a few nits that can be addressed before merging. I think the CNDB PR will need an update. Maybe IndexQueryMetricsTest.testIndexQueryMetrics will need some adapting in that PR too.

src/java/org/apache/cassandra/index/sai/plan/QueryController.java

test/unit/org/apache/cassandra/index/sai/SAITester.java

test/unit/org/apache/cassandra/index/sai/metrics/QueryMetricsTest.java

src/java/org/apache/cassandra/db/filter/RowFilter.java

cassci-bot · 2026-01-14T13:52:14Z

❌ Build ds-cassandra-pr-gate/PR-2188 rejected by Butler

3 regressions found
See build details here

Found 3 new test failures

Test	Explanation	Runs	Upstream
o.a.c.index.sai.cql.VectorCompaction100dTest.testOneToManyCompaction[dc true]	REGRESSION	🔴⚪	0 / 21
o.a.c.index.sai.cql.VectorSiftSmallTest.testSiftSmall[db false]	REGRESSION	🔴⚪	0 / 21
o.a.c.index.sai.plan.PlanTest.prettyPrint (compression)	REGRESSION	🔵🔴	0 / 21

Found 3 known test failures

When a memory index contains very few rows and is split into many shards, we can expect a lot of variance in the number of rows between the shards. Hence, if we took only one shard to estimate the number of matched rows, and extrapolate that on all shards to compute the estimated matching rows from the whole index, we risk making a huge estimation error. This commit changes the algorithm to take as many shards as needed to collect enough returned or indexed rows. For very tiny datasets is it's likely to use all shards for estimation. For big datasets, one shard will likely be enough, speeding up estimation. This change also allows to remove one estimation method. We no longer need to manually choose between the estimation from the first shard and from all shards. Additionally, the accuracy of estimating of NOT_EQ rows has been improved by letting the planner know the union generated by NOT_EQ is disjoint so the result set cardinality is the sum of cardinalities of the subplans. The commit contains also a fix for a bug that caused some non-hybrid queries be counted as hybrid by the query metrics. Unused keyRange parameters have been removed from the methods for estimating row counts in the index classes.

sonarqubecloud · 2026-01-14T18:31:55Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
87.6% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

pkolaczk requested a review from k-rus January 7, 2026 17:08

pkolaczk force-pushed the c16363-memtable-index-estimates branch from 2dbcb8e to 876dc37 Compare January 9, 2026 11:53

pkolaczk requested review from adelapena and removed request for k-rus January 9, 2026 11:59

pkolaczk force-pushed the c16363-memtable-index-estimates branch from 876dc37 to 23c2479 Compare January 9, 2026 12:58

pkolaczk force-pushed the c16363-memtable-index-estimates branch from 23c2479 to 56197c0 Compare January 12, 2026 11:21

adelapena reviewed Jan 12, 2026

View reviewed changes

scottfines reviewed Jan 13, 2026

View reviewed changes

src/java/org/apache/cassandra/index/sai/memory/MemtableIndex.java Show resolved Hide resolved

src/java/org/apache/cassandra/index/sai/memory/TrieMemtableIndex.java Outdated Show resolved Hide resolved

adelapena approved these changes Jan 14, 2026

View reviewed changes

pkolaczk force-pushed the c16363-memtable-index-estimates branch from 325bad7 to 2386ac7 Compare January 14, 2026 17:20

pkolaczk merged commit c7ae969 into main Jan 16, 2026
485 of 499 checks passed

pkolaczk deleted the c16363-memtable-index-estimates branch January 16, 2026 13:44

CNDB-16363: Improve matched rows estimation accuracy for memory indexes #2188

CNDB-16363: Improve matched rows estimation accuracy for memory indexes #2188

Uh oh!

Conversation

pkolaczk commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the issue

What does this PR fix and why was it fixed

Uh oh!

github-actions bot commented Jan 7, 2026 • edited by pkolaczk Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist before you submit for review

Uh oh!

k-rus commented Jan 7, 2026

Uh oh!

pkolaczk commented Jan 8, 2026

Uh oh!

k-rus commented Jan 8, 2026

Uh oh!

pkolaczk commented Jan 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

scottfines left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

adelapena left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cassci-bot commented Jan 14, 2026

❌ Build ds-cassandra-pr-gate/PR-2188 rejected by Butler

Found 3 new test failures

Found 3 known test failures

Uh oh!

sonarqubecloud bot commented Jan 14, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pkolaczk commented Jan 7, 2026 •

edited

Loading

github-actions bot commented Jan 7, 2026 •

edited by pkolaczk

Loading