Scale the DAG by bitcoin-coder-bob · Pull Request #908 · arkade-os/arkd

bitcoin-coder-bob · 2026-02-10T19:01:28Z

Issue: #833

Summary by CodeRabbit

New Features
- VTXOs now include depth and marker metadata for chain hierarchy and traversal.
- Marker-based bulk-sweep and traversal optimizations for faster deep-chain operations.
API
- Public API and indexer responses now include a depth field in VTXO payloads.
Database
- Schema migrations added to persist depth, markers, and swept-marker tracking.
Tests
- Extensive tests added for depth, markers, traversal, and sweep behaviors.

NOTE:

❌ Existing VTXOs won't benefit from bulk sweep optimization
✅ New VTXOs (created after migration) will get proper marker inheritance from application code
e2e test require changes to the sdk to support new proto field added (depth on the vtxo) so they are not expanded on in this PR

Heres a breakdown on the efficiency gains and db query savings:

  The branch introduces a marker system — DAG checkpoints placed every 100 depths in the VTXO chain. These markers enable bulk operations instead of per-VTXO operations.                                  
                                                                                                                                                                                                           
  New structures:
  - marker table: checkpoints every 100 depths with parent_markers for traversal                                                                                                                           
  - swept_marker table: replaces the per-VTXO swept boolean column                                                                                                                                         
  - markers JSONB column on vtxo: links each VTXO to its covering markers                                                                                                                                  
  - depth column on vtxo: integer chain depth

Tests: the tests in the added `internal/interface/grpc/handlers/parser_test.go` may be overkill, I can remove the on request.
                                                                                                                                                                                                           
  ---                                                                                                                                                                                                      
  GetVtxoChain — Major Savings                                                                                                                                                                             

  Before (master): BFS loop fetching VTXOs one at a time via individual SELECT queries per iteration.

  After (bob/dag-1): A prefetchVtxosByMarkers phase bulk-loads all relevant VTXOs into an in-memory cache before the loop starts. The loop then hits cache instead of DB.
  ┌────────────────────────────────┬──────────────────────┬─────────────────────────────────────────────────┬──────────────────────────┐
  │             Metric             │   Before (master)    │                After (bob/dag-1)                │         Savings          │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ VTXO SELECT queries (depth N)  │ N individual queries │ 1 bulk query (Postgres) or ceil(N/100) (SQLite) │ ~100x fewer VTXO lookups │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ Marker traversal queries       │ 0                    │ ceil(N/100)                                     │ New cost, but tiny       │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ Total DB round-trips (N=100)   │ ~201                 │ ~103                                            │ ~49% reduction           │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ Total DB round-trips (N=500)   │ ~1001                │ ~508                                            │ ~49% reduction           │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ Total DB round-trips (N=1000)  │ ~2001                │ ~1012                                           │ ~49% reduction           │
  ├────────────────────────────────┼──────────────────────┼─────────────────────────────────────────────────┼──────────────────────────┤
  │ Total DB round-trips (N=10000) │ ~20001               │ ~10102                                          │ ~50% reduction           │
  └────────────────────────────────┴──────────────────────┴─────────────────────────────────────────────────┴──────────────────────────┘
  The remaining ~50% comes from GetOffchainTx calls (1 per preconfirmed VTXO) which are not cached by this change. The VTXO-fetching portion specifically goes from O(N) to O(N/100) — a 100x reduction in
  that category.

  ---
  Sweep Operations — Massive Savings

  Before (master): Sweeping required 1 UPDATE vtxo SET swept=true per VTXO.

  After (bob/dag-1): Sweeping inserts 1 row into swept_marker — all VTXOs sharing that marker are swept instantly.
  ┌──────────────────────────────────────┬────────────────────┬────────────────────────────────┬────────────────┐
  │              Operation               │       Before       │             After              │  Improvement   │
  ├──────────────────────────────────────┼────────────────────┼────────────────────────────────┼────────────────┤
  │ Sweep 100 VTXOs (1 marker)           │ 100 UPDATEs        │ 1 INSERT                       │ 100x           │
  ├──────────────────────────────────────┼────────────────────┼────────────────────────────────┼────────────────┤
  │ Sweep 1,000 VTXOs (10 markers)       │ 1,000 UPDATEs      │ 10 INSERTs                     │ 100x           │
  ├──────────────────────────────────────┼────────────────────┼────────────────────────────────┼────────────────┤
  │ Sweep 10,000 VTXOs (100 markers)     │ 10,000 UPDATEs     │ 100 INSERTs                    │ 100x           │
  ├──────────────────────────────────────┼────────────────────┼────────────────────────────────┼────────────────┤
  │ Sweep full tree via BulkSweepMarkers │ N/A (must iterate) │ 1 recursive CTE + batch insert │ New capability │
  └──────────────────────────────────────┴────────────────────┴────────────────────────────────┴────────────────┘
  The improvement ratio is consistently ~MarkerInterval (100x) for write operations.

  ---
  ListVtxos / GetVtxos — No Query Count Change, Correctness Improvement

  The query count is unchanged (1 query before, 1 query now). What changed:

  - Before: swept was a static boolean column — fast to read but could go stale
  - After: swept is dynamically computed via an EXISTS subquery against the indexed swept_marker table — always correct, marginally more expensive per-row

  ---
  Summary Table
  ┌─────────────────────────────┬─────────────────────────────────────┬──────────────────────────┐
  │          Category           │           Before → After            │          Factor          │
  ├─────────────────────────────┼─────────────────────────────────────┼──────────────────────────┤
  │ VTXO chain lookups          │ N queries → 1 bulk query (Postgres) │ ~100x fewer              │
  ├─────────────────────────────┼─────────────────────────────────────┼──────────────────────────┤
  │ Total GetVtxoChain DB calls │ ~2N → ~N + N/100                    │ ~50% reduction           │
  ├─────────────────────────────┼─────────────────────────────────────┼──────────────────────────┤
  │ Sweep writes                │ N UPDATEs → N/100 INSERTs           │ ~100x fewer              │
  ├─────────────────────────────┼─────────────────────────────────────┼──────────────────────────┤
  │ ListVtxos query count       │ 1 → 1                               │ No change                │
  ├─────────────────────────────┼─────────────────────────────────────┼──────────────────────────┤
  │ Swept status accuracy       │ Static (can be stale)               │ Dynamic (always correct) │
  └─────────────────────────────┴─────────────────────────────────────┴──────────────────────────┘
  The bottleneck remaining in GetVtxoChain is the per-hop GetOffchainTx call, which still runs O(N) times. If that were also bulk-fetched or cached, total round-trips would drop to ~O(N/100), yielding
  closer to a 100x overall reduction.

Yes this PR has a lot of lines of code. 62% are in test files. 0.8% come from the api-sepc folder, and the other 37.2% are "actual" code changes (4,254 LoC)

…ByMarkers, getVtxosFromCacheOrDB

coderabbitai · 2026-02-10T19:01:40Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

Adds VTXO depth and marker tracking across the stack: domain types and interfaces, DB schema/migrations and repo implementations (Postgres/SQLite/Badger), service logic to compute/create markers, marker-aware sweeping and indexer/cache optimizations, and API/protobuf/schema extensions for Depth.

Changes

Cohort / File(s)	Summary
API Spec & Protobuf `api-spec/openapi/swagger/ark/v1/.openapi.json`, `api-spec/protobuf/ark/v1/.proto`	Added `depth` (uint32) to `Vtxo`/`IndexerVtxo` schemas and protobuf messages.
Domain Types & Interfaces `internal/core/domain/marker.go`, `internal/core/domain/vtxo.go`, `internal/core/domain/marker_repo.go`, `internal/core/domain/vtxo_repo.go`	Introduced `Marker`/`SweptMarker`, `MarkerInterval`; added `Depth` and `MarkerIDs` to `Vtxo`; expanded/renamed `MarkerRepository` methods (bulk ops, traversal, `UpdateVtxoMarkers`); removed `VtxoRepository.SweepVtxos`.
Application: Indexer & Cache `internal/core/application/indexer.go`, `internal/core/application/indexer_test.go`	Added `prefetchVtxosByMarkers` and `getVtxosFromCacheOrDB` for marker-chain traversal and cache-first VTXO retrieval; indexer uses cache optimization; tests added.
Application: Service & Utils `internal/core/application/service.go`, `internal/core/application/utils.go`, `internal/core/application/service_test.go`, `internal/core/application/utils_test.go`	Compute new `Vtxo.Depth = max(spent.Depth)+1`, derive/create markers at boundaries, initialize Depth/MarkerIDs for new VTXOs, and assign marker IDs for dust/root VTXOs; tests for depth/marker logic added.
Application: Sweeper `internal/core/application/sweeper.go`, `internal/core/application/sweeper_test.go`	Replaced per-VTXO sweep with marker-based bulk sweep (`BulkSweepMarkers`): aggregate marker IDs from VTXOs and call bulk sweep; extensive sweeper tests added.
Repo Manager & Service wiring `internal/core/ports/repo_manager.go`, `internal/infrastructure/db/service.go`	Added `RepoManager.Markers()`; wired `MarkerRepository` into DB service lifecycle; create marker stores and integrate root-marker creation and marker-based sweeping.
Postgres DB layer `internal/infrastructure/db/postgres/.go`, `internal/infrastructure/db/postgres/sqlc/`, `internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.*`	Added `marker` and `swept_marker` tables, `depth` and `markers` columns on `vtxo`, indexes and views; extended sqlc query surface (UpsertMarker, SelectMarkers*, GetDescendantMarkerIds, UpsertVtxo includes depth/markers, UpdateVtxoMarkers, SelectVtxosByDepthRange/ByMarker/ByArkTxid); Postgres marker repo implemented.
SQLite DB layer `internal/infrastructure/db/sqlite/.go`, `internal/infrastructure/db/sqlite/sqlc/`, `internal/infrastructure/db/sqlite/migration/20260210000000_add_depth_and_markers.*`	Parallel SQLite marker repo and sqlc/query changes: depth/markers columns, `marker`/`swept_marker` tables, recursive descendant query, `UpsertVtxo`/`UpdateVtxoMarkers`, repo implementation and migrations.
Badger DB layer `internal/infrastructure/db/badger/marker_repo.go`, `internal/infrastructure/db/badger/vtxo_repo.go`	New Badger-backed marker repository and updates to Vtxo repository type/receivers; store accessors added; marker repo supports marker and swept operations.
GRPC / Interface layer `internal/interface/grpc/handlers/parser.go`, `internal/interface/grpc/handlers/indexer.go`, `internal/interface/grpc/handlers/parser_test.go`	Populate `Depth` when converting `domain.Vtxo` → proto `Vtxo`/`IndexerVtxo`; tests updated to assert Depth propagation.
SQLC Models & Queries (generated artifacts) `internal/infrastructure/db/*/sqlc/queries/models.go`, `.../query.sql.go`	Added `Marker` and `SweptMarker` types; many generated structs and queries extended to include `Depth` and `Markers`; `UpsertVtxo` and related params updated; new marker query methods added.
Tests & E2E notes `internal/infrastructure/db/service_test.go`, `internal/test/e2e/`, many _test.go files across packages	Extensive unit tests for marker/depth behavior added across service, indexer, sweeper, utils, parser; e2e test TODOs added pending SDK proto exposure for `IndexerVtxo.Depth`.
Minor formatting/refactor `cmd/arkd/commands.go`, `internal/core/application/fraud.go`, `internal/infrastructure/tx-builder/...`	Small formatting or non-functional refactors; `round_repo` swept derivation adjusted to integer handling.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant Service
    participant MarkerRepo
    participant VtxoRepo
    participant Database

    Client->>Service: ProcessOffchainTx(spentVtxos, outputs)
    Service->>Service: maxDepth = max(spentVtxos.Depth) + 1
    Service->>Service: atBoundary = IsAtMarkerBoundary(maxDepth)
    alt atBoundary
        Service->>MarkerRepo: Add/Upsert marker(depth=maxDepth, parents)
        MarkerRepo->>Database: INSERT/UPDATE marker
        Service->>Service: newVtxos.MarkerIDs = [newMarkerID]
    else notBoundary
        Service->>Service: newVtxos.MarkerIDs = parentMarkerIDs
    end
    Service->>VtxoRepo: Upsert newVtxos (include Depth, MarkerIDs)
    VtxoRepo->>Database: INSERT/UPDATE vtxo rows (depth, markers)
    Client->>Service: GetVtxoChain(startOutpoint)
    Service->>MarkerRepo: prefetchVtxosByMarkers(startOutpoint)
    MarkerRepo->>Database: Traverse markers / GetVtxoChainByMarkers
    MarkerRepo-->>Service: vtxoCache (outpoint -> Vtxo)
    Service->>VtxoRepo: getVtxosFromCacheOrDB(outpoints, cache)
    VtxoRepo->>Database: Fetch missing outpoints only
    VtxoRepo-->>Service: Vtxos
    Service-->>Client: Return full chain

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related issues

Scale the DAG #833: Implements the marker/depth design (Vtxo.Depth, MarkerIDs, marker tables/repos, marker-aware queries, bulk marker sweeping) matching the issue objectives.

Possibly related PRs

Optimize sweeper #816: Overlaps in sweeper and vtxo repo changes — similar modifications to sweep flow and repo surface.
Sweep expired checkpoint outputs #686: Related sweep infrastructure refactor and marker-aware sweep design.
Mark VTXOs from arktx swept and broadcast SweptVtxos event #693: Related proto/indexer and sweep/indexer handler updates adding Depth propagation.

Suggested reviewers

altafan
sekulicd
Kukks

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 34.94% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Merge Conflict Detection	✅ Passed	✅ No merge conflicts detected when merging into `master`
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Scale the DAG' is concise and directly reflects the main change: adding a depth-based marker system to scale VTXO traversal in the DAG structure.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch bob/dag-1

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

bitcoin-coder-bob · 2026-02-10T20:25:20Z

@coderabbitai review

coderabbitai · 2026-02-10T20:25:29Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 4

🤖 Fix all issues with AI agents

In `@internal/core/application/indexer.go`:
- Around line 408-413: The loop that walks parent markers (using
marker.ParentMarkerIDs and i.repoManager.Markers().GetMarker) can infinite-loop
on cyclic parent chains; add a visited set (map[string]struct{}) keyed by marker
ID and check it before appending IDs or calling GetMarker to break cycles, and
also deduplicate ParentMarkerIDs when appending to markerIDs so you don't re-add
the same ID; update the loop to mark the current marker ID as visited, skip any
parent IDs already visited, and stop traversal if the next parent is seen.

In `@internal/infrastructure/db/badger/marker_repo.go`:
- Around line 501-514: GetVtxoChainByMarkers currently does a full table scan
via r.vtxoStore.Find(&dtos, &badgerhold.Query{}) and filters in-memory; change
it to query by marker IDs to avoid loading all vtxos: iterate markerIDs (or
batch them) and call r.vtxoStore.Find with badgerhold.Where("MarkerID").Eq(id)
for each id (or badgerhold.Where("MarkerID").In(batch) if supported), collect
matched vtxoDTOs, convert dto.Vtxo into the vtxos slice and return; ensure you
still respect markerIDSet and handle errors per-query.

In
`@internal/infrastructure/db/postgres/migration/20260210100000_add_vtxo_depth.up.sql`:
- Around line 8-13: The Postgres view vtxo_vw currently returns NULL for the
commitments column when no rows exist because it uses
string_agg(vc.commitment_txid, ','); change the SELECT to wrap string_agg with
COALESCE (e.g., COALESCE(string_agg(...), '')) so commitments always yields an
empty string like the SQLite view; update the SELECT that references vtxo,
vtxo_commitment_txid and the commitments alias to use
COALESCE(string_agg(vc.commitment_txid, ','), '').

In `@internal/infrastructure/db/service.go`:
- Around line 752-821: sweepVtxosWithMarkers currently marks a marker swept
before guaranteeing the marker's VTXOs were successfully removed, risking
inconsistent state; change the ordering so you attempt to sweep the marker's
VTXOs first (use markerStore.SweepVtxosByMarker and fall back to
vtxoStore.SweepVtxos for markerVtxos[markerID]) and only if that sweep returns
success call markerStore.SweepMarker(markerID, sweptAt); on any sweep error keep
the marker unmarked, log the failure, and accumulate the fallback count as now
done — update the loop in sweepVtxosWithMarkers to perform
SweepVtxosByMarker/SweepVtxos before calling SweepMarker and adjust error
handling accordingly.

🧹 Nitpick comments (12)

internal/test/e2e/utils_test.go (1)

742-744: Acknowledge the TODO placeholder.

The TODO is clear about the dependency on the SDK proto package exposing Depth. Consider tracking this with a GitHub issue so it doesn't get lost.

Would you like me to open a GitHub issue to track re-enabling setupRawIndexerClient and getVtxoDepthByOutpoint once the SDK proto exposes Depth?
internal/core/domain/marker_test.go (2)
9-35: Good coverage of boundary cases.

The table-driven test covers a solid range including edges (0, 99, 100, 101). Consider using t.Run with a subtest name for each case to get more granular test output on failure.
♻️ Optional: use subtests for better diagnostics
 	for _, tt := range tests {
+		t.Run(fmt.Sprintf("depth_%d", tt.depth), func(t *testing.T) {
 		result := IsAtMarkerBoundary(tt.depth)
 		require.Equal(t, tt.expected, result,
 			"IsAtMarkerBoundary(%d) should be %v", tt.depth, tt.expected)
+		})
 	}
(You'd need to add "fmt" to imports.)
42-55: These tests only verify struct literal construction, not behavior.

TestMarkerStruct and TestSweptMarkerStruct test that Go struct fields hold the values you assign — they don't test any domain logic. They're fine as documentation of the data model but provide no regression protection. Consider adding tests for actual marker operations (creation, parent resolution, etc.) as the marker logic matures.
internal/infrastructure/db/postgres/migration/20260211020000_add_markers.up.sql (1)
2-6: Consider adding NOT NULL DEFAULT '[]'::jsonb to parent_markers.

The column currently allows NULL, which means application code must handle both NULL and empty array. Using a NOT NULL default simplifies queries and Go code that deserializes this field.
♻️ Suggested change
 CREATE TABLE IF NOT EXISTS marker (
     id TEXT PRIMARY KEY,
     depth INTEGER NOT NULL,
-    parent_markers JSONB  -- JSON array of parent marker IDs
+    parent_markers JSONB NOT NULL DEFAULT '[]'::jsonb  -- JSON array of parent marker IDs
 );
internal/core/domain/marker_repo.go (2)

5-44: Large interface — consider whether it could be split, and watch for unbounded queries.

The interface has 16 methods mixing marker lifecycle, sweep operations, and VTXO queries. This is functional but may violate the Interface Segregation Principle as it grows.

More concretely, methods like GetVtxosByMarker, GetVtxosByDepthRange, and GetVtxoChainByMarkers (lines 30, 37, 41) return unbounded []Vtxo slices. If marker/depth ranges can span many VTXOs, callers may hit memory pressure. Consider whether pagination or a limit parameter is warranted for these, especially GetVtxosByDepthRange which could span a very wide range.

6-7: Clarify upsert semantics in the doc comment.

The comment says "creates or updates a marker," but the method signature uses error as the only signal. It may be useful to document whether an update replaces ParentMarkerIDs entirely or merges, and whether updating a marker that has already been swept is allowed.

internal/infrastructure/db/sqlite/migration/20260211000000_add_markers.up.sql (1)

2-6: Consider adding an index on parent_markers for BFS descendant lookups.

The marker table stores parent_markers as a JSON text column. The Badger implementation does BFS by querying markers whose ParentMarkerIDs contains a given ID. If a similar query pattern is used in SQLite (e.g., using json_each to find children), performance could degrade without an index strategy. This is fine for now if the query load is low, but worth keeping in mind.

internal/infrastructure/db/badger/marker_repo.go (2)

42-106: Constructor uses interface{} variadic config — consider a typed options struct.

The NewMarkerRepository(config ...interface{}) pattern with positional interface{} arguments is fragile and hard to use correctly. While this matches the existing codebase pattern (e.g., NewVtxoRepository), a typed config struct would be safer. This is fine for now if consistency with the existing pattern is preferred.

116-136: Retry loop doesn't respect context cancellation.

The retry loops (here and in similar patterns at lines 243, 390, 435) sleep unconditionally without checking ctx.Done(). If the context is cancelled, the function will still retry up to maxRetries times with 100ms sleeps. This is a minor concern given the small retry count.
internal/infrastructure/db/sqlite/marker_repo.go (1)
141-166: Make descendant sweeping atomic to avoid partial state.

If an insert fails mid-loop, some markers are swept and others aren’t. Wrapping the inserts in a single transaction avoids partial sweeps and reduces round-trips.
♻️ Suggested transaction wrapper
 func (m *markerRepository) SweepMarkerWithDescendants(
 	ctx context.Context,
 	markerID string,
 	sweptAt int64,
 ) (int64, error) {
 	// Get all descendant marker IDs (including the root marker) that are not already swept
 	descendantIDs, err := m.querier.GetDescendantMarkerIds(ctx, markerID)
 	if err != nil {
 		return 0, fmt.Errorf("failed to get descendant markers: %w", err)
 	}
 
-	// Insert each descendant into swept_marker
-	var count int64
-	for _, id := range descendantIDs {
-		err := m.querier.InsertSweptMarker(ctx, queries.InsertSweptMarkerParams{
-			MarkerID: id,
-			SweptAt:  sweptAt,
-		})
-		if err != nil {
-			return count, fmt.Errorf("failed to sweep marker %s: %w", id, err)
-		}
-		count++
-	}
-
-	return count, nil
+	tx, err := m.db.BeginTx(ctx, nil)
+	if err != nil {
+		return 0, err
+	}
+	q := queries.New(tx)
+	defer tx.Rollback()
+
+	var count int64
+	for _, id := range descendantIDs {
+		if err := q.InsertSweptMarker(ctx, queries.InsertSweptMarkerParams{
+			MarkerID: id,
+			SweptAt:  sweptAt,
+		}); err != nil {
+			return count, fmt.Errorf("failed to sweep marker %s: %w", id, err)
+		}
+		count++
+	}
+	if err := tx.Commit(); err != nil {
+		return 0, err
+	}
+	return count, nil
 }
internal/infrastructure/db/postgres/marker_repo.go (1)
144-169: Make descendant sweeping atomic to avoid partial state.

Same concern as the sqlite implementation—if the loop fails mid-way, markers can be partially swept.
♻️ Suggested transaction wrapper
 func (m *markerRepository) SweepMarkerWithDescendants(
 	ctx context.Context,
 	markerID string,
 	sweptAt int64,
 ) (int64, error) {
 	// Get all descendant marker IDs (including the root marker) that are not already swept
 	descendantIDs, err := m.querier.GetDescendantMarkerIds(ctx, markerID)
 	if err != nil {
 		return 0, fmt.Errorf("failed to get descendant markers: %w", err)
 	}
 
-	// Insert each descendant into swept_marker
-	var count int64
-	for _, id := range descendantIDs {
-		err := m.querier.InsertSweptMarker(ctx, queries.InsertSweptMarkerParams{
-			MarkerID: id,
-			SweptAt:  sweptAt,
-		})
-		if err != nil {
-			return count, fmt.Errorf("failed to sweep marker %s: %w", id, err)
-		}
-		count++
-	}
-
-	return count, nil
+	tx, err := m.db.BeginTx(ctx, nil)
+	if err != nil {
+		return 0, err
+	}
+	q := queries.New(tx)
+	defer tx.Rollback()
+
+	var count int64
+	for _, id := range descendantIDs {
+		if err := q.InsertSweptMarker(ctx, queries.InsertSweptMarkerParams{
+			MarkerID: id,
+			SweptAt:  sweptAt,
+		}); err != nil {
+			return count, fmt.Errorf("failed to sweep marker %s: %w", id, err)
+		}
+		count++
+	}
+	if err := tx.Commit(); err != nil {
+		return 0, err
+	}
+	return count, nil
 }
internal/infrastructure/db/sqlite/sqlc/query.sql (1)
467-480: Replace LIKE-based JSON matching with json_each() for robustness.

The recursive CTE uses m.parent_markers LIKE '%"' || dm.id || '"%' to check if a marker ID exists in a JSON array. While this works in practice, it's fragile: marker IDs containing SQL LIKE wildcards (%, _) would cause incorrect matches since the code doesn't escape them. The Postgres version correctly uses @> jsonb_build_array(dm.id) (line 473); SQLite should use the equivalent json_each() for consistency and correctness:
Suggested replacement
-    SELECT m.id FROM marker m
-    INNER JOIN descendant_markers dm ON (
-        m.parent_markers LIKE '%"' || dm.id || '"%'
-    )
+    SELECT m.id FROM marker m
+    INNER JOIN descendant_markers dm ON EXISTS (
+        SELECT 1 FROM json_each(m.parent_markers) je WHERE je.value = dm.id
+    )

internal/core/application/indexer.go

internal/infrastructure/db/badger/marker_repo.go

coderabbitai · 2026-02-10T20:36:18Z

internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.down.sql

+CREATE VIEW vtxo_vw AS
+SELECT v.*, string_agg(vc.commitment_txid, ',') AS commitments
+FROM vtxo v
+LEFT JOIN vtxo_commitment_txid vc
+ON v.txid = vc.vtxo_txid AND v.vout = vc.vtxo_vout
+GROUP BY v.txid, v.vout;


⚠️ Potential issue | 🟡 Minor

Inconsistent NULL handling between Postgres and SQLite views.

The SQLite vtxo_vw uses COALESCE(group_concat(...), '') (returns empty string when no commitments), but this Postgres view uses bare string_agg(...) (returns NULL when no commitments). This can cause behavioral differences across backends if downstream code doesn't uniformly handle both NULL and empty string.

Consider wrapping with COALESCE for consistency:

Suggested fix

CREATE VIEW vtxo_vw AS -SELECT v.*, string_agg(vc.commitment_txid, ',') AS commitments +SELECT v.*, COALESCE(string_agg(vc.commitment_txid, ','), '') AS commitments FROM vtxo v LEFT JOIN vtxo_commitment_txid vc ON v.txid = vc.vtxo_txid AND v.vout = vc.vtxo_vout GROUP BY v.txid, v.vout;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

CREATE VIEW vtxo_vw AS

SELECT v.*, string_agg(vc.commitment_txid, ',') AS commitments

FROM vtxo v

LEFT JOIN vtxo_commitment_txid vc

ON v.txid = vc.vtxo_txid AND v.vout = vc.vtxo_vout

GROUP BY v.txid, v.vout;

CREATE VIEW vtxo_vw AS

SELECT v.*, COALESCE(string_agg(vc.commitment_txid, ','), '') AS commitments

FROM vtxo v

LEFT JOIN vtxo_commitment_txid vc

ON v.txid = vc.vtxo_txid AND v.vout = vc.vtxo_vout

GROUP BY v.txid, v.vout;

🤖 Prompt for AI Agents

In `@internal/infrastructure/db/postgres/migration/20260210100000_add_vtxo_depth.up.sql` around lines 8 - 13, The Postgres view vtxo_vw currently returns NULL for the commitments column when no rows exist because it uses string_agg(vc.commitment_txid, ','); change the SELECT to wrap string_agg with COALESCE (e.g., COALESCE(string_agg(...), '')) so commitments always yields an empty string like the SQLite view; update the SELECT that references vtxo, vtxo_commitment_txid and the commitments alias to use COALESCE(string_agg(vc.commitment_txid, ','), '').

coderabbitai · 2026-02-10T20:36:18Z

internal/infrastructure/db/service.go

+// sweepVtxosWithMarkers performs marker-based sweeping for VTXOs.
+// It groups VTXOs by their marker, sweeps each marker, then bulk-updates all VTXOs.
+// Returns the total count of VTXOs swept.
+func (s *service) sweepVtxosWithMarkers(
+	ctx context.Context,
+	vtxoOutpoints []domain.Outpoint,
+) int64 {
+	if len(vtxoOutpoints) == 0 {
+		return 0
+	}
+
+	// Get VTXOs to find their markers
+	vtxos, err := s.vtxoStore.GetVtxos(ctx, vtxoOutpoints)
+	if err != nil {
+		log.WithError(err).Warn("failed to get vtxos for marker-based sweep")
+		// Fall back to individual sweep
+		count, _ := s.vtxoStore.SweepVtxos(ctx, vtxoOutpoints)
+		return int64(count)
+	}
+
+	// Group VTXOs by marker ID
+	markerVtxos := make(map[string][]domain.Outpoint)
+	noMarkerVtxos := make([]domain.Outpoint, 0)
+
+	for _, vtxo := range vtxos {
+		if vtxo.MarkerID != "" {
+			markerVtxos[vtxo.MarkerID] = append(markerVtxos[vtxo.MarkerID], vtxo.Outpoint)
+		} else {
+			noMarkerVtxos = append(noMarkerVtxos, vtxo.Outpoint)
+		}
+	}
+
+	var totalSwept int64
+	sweptAt := time.Now().Unix()
+
+	// Sweep each marker
+	for markerID := range markerVtxos {
+		// Mark the marker as swept
+		if err := s.markerStore.SweepMarker(ctx, markerID, sweptAt); err != nil {
+			log.WithError(err).Warnf("failed to sweep marker %s", markerID)
+			// Fall back to individual sweep for this marker's VTXOs
+			count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID])
+			totalSwept += int64(count)
+			continue
+		}
+
+		// Bulk sweep all VTXOs with this marker
+		count, err := s.markerStore.SweepVtxosByMarker(ctx, markerID)
+		if err != nil {
+			log.WithError(err).Warnf("failed to bulk sweep vtxos for marker %s", markerID)
+			// Fall back to individual sweep
+			count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID])
+			totalSwept += int64(count)
+			continue
+		}
+		totalSwept += count
+		log.Debugf("swept marker %s with %d vtxos", markerID, count)
+	}
+
+	// Sweep VTXOs without markers individually
+	if len(noMarkerVtxos) > 0 {
+		count, err := s.vtxoStore.SweepVtxos(ctx, noMarkerVtxos)
+		if err != nil {
+			log.WithError(err).Warn("failed to sweep vtxos without markers")
+		}
+		totalSwept += int64(count)
+	}
+
+	return totalSwept
+}


⚠️ Potential issue | 🟠 Major

Avoid marking markers swept before VTXO sweep succeeds.

If SweepVtxosByMarker (and the fallback) fails, the marker can end up marked swept while VTXOs remain unswept. That can leave an inconsistent state and block later sweep logic. Consider sweeping VTXOs first, then recording the marker sweep only on success.

🐛 Suggested ordering change

for markerID := range markerVtxos { - // Mark the marker as swept - if err := s.markerStore.SweepMarker(ctx, markerID, sweptAt); err != nil { - log.WithError(err).Warnf("failed to sweep marker %s", markerID) - // Fall back to individual sweep for this marker's VTXOs - count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID]) - totalSwept += int64(count) - continue - } - - // Bulk sweep all VTXOs with this marker - count, err := s.markerStore.SweepVtxosByMarker(ctx, markerID) - if err != nil { - log.WithError(err).Warnf("failed to bulk sweep vtxos for marker %s", markerID) - // Fall back to individual sweep - count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID]) - totalSwept += int64(count) - continue - } - totalSwept += count - log.Debugf("swept marker %s with %d vtxos", markerID, count) + // Bulk sweep all VTXOs with this marker + count, err := s.markerStore.SweepVtxosByMarker(ctx, markerID) + if err != nil { + log.WithError(err).Warnf("failed to bulk sweep vtxos for marker %s", markerID) + // Fall back to individual sweep + fallbackCount, ferr := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID]) + if ferr != nil { + log.WithError(ferr).Warnf("failed to sweep vtxos for marker %s", markerID) + continue + } + count = int64(fallbackCount) + } + totalSwept += count + if err := s.markerStore.SweepMarker(ctx, markerID, sweptAt); err != nil { + log.WithError(err).Warnf("failed to sweep marker %s", markerID) + } + log.Debugf("swept marker %s with %d vtxos", markerID, count) }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// sweepVtxosWithMarkers performs marker-based sweeping for VTXOs.

// It groups VTXOs by their marker, sweeps each marker, then bulk-updates all VTXOs.

// Returns the total count of VTXOs swept.

func (s *service) sweepVtxosWithMarkers(

ctx context.Context,

vtxoOutpoints []domain.Outpoint,

) int64 {

if len(vtxoOutpoints) == 0 {

return 0

}

// Get VTXOs to find their markers

vtxos, err := s.vtxoStore.GetVtxos(ctx, vtxoOutpoints)

if err != nil {

log.WithError(err).Warn("failed to get vtxos for marker-based sweep")

// Fall back to individual sweep

count, _ := s.vtxoStore.SweepVtxos(ctx, vtxoOutpoints)

return int64(count)

}

// Group VTXOs by marker ID

markerVtxos := make(map[string][]domain.Outpoint)

noMarkerVtxos := make([]domain.Outpoint, 0)

for _, vtxo := range vtxos {

if vtxo.MarkerID != "" {

markerVtxos[vtxo.MarkerID] = append(markerVtxos[vtxo.MarkerID], vtxo.Outpoint)

} else {

noMarkerVtxos = append(noMarkerVtxos, vtxo.Outpoint)

}

}

var totalSwept int64

sweptAt := time.Now().Unix()

// Sweep each marker

for markerID := range markerVtxos {

// Mark the marker as swept

if err := s.markerStore.SweepMarker(ctx, markerID, sweptAt); err != nil {

log.WithError(err).Warnf("failed to sweep marker %s", markerID)

// Fall back to individual sweep for this marker's VTXOs

count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID])

totalSwept += int64(count)

continue

}

// Bulk sweep all VTXOs with this marker

count, err := s.markerStore.SweepVtxosByMarker(ctx, markerID)

if err != nil {

log.WithError(err).Warnf("failed to bulk sweep vtxos for marker %s", markerID)

// Fall back to individual sweep

count, _ := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID])

totalSwept += int64(count)

continue

}

totalSwept += count

log.Debugf("swept marker %s with %d vtxos", markerID, count)

}

// Sweep VTXOs without markers individually

if len(noMarkerVtxos) > 0 {

count, err := s.vtxoStore.SweepVtxos(ctx, noMarkerVtxos)

if err != nil {

log.WithError(err).Warn("failed to sweep vtxos without markers")

}

totalSwept += int64(count)

}

return totalSwept

}

for markerID := range markerVtxos {

// Bulk sweep all VTXOs with this marker

count, err := s.markerStore.SweepVtxosByMarker(ctx, markerID)

if err != nil {

log.WithError(err).Warnf("failed to bulk sweep vtxos for marker %s", markerID)

// Fall back to individual sweep

fallbackCount, ferr := s.vtxoStore.SweepVtxos(ctx, markerVtxos[markerID])

if ferr != nil {

log.WithError(ferr).Warnf("failed to sweep vtxos for marker %s", markerID)

continue

}

count = int64(fallbackCount)

}

totalSwept += count

if err := s.markerStore.SweepMarker(ctx, markerID, sweptAt); err != nil {

log.WithError(err).Warnf("failed to sweep marker %s", markerID)

}

log.Debugf("swept marker %s with %d vtxos", markerID, count)

}

🤖 Prompt for AI Agents

In `@internal/infrastructure/db/service.go` around lines 752 - 821, sweepVtxosWithMarkers currently marks a marker swept before guaranteeing the marker's VTXOs were successfully removed, risking inconsistent state; change the ordering so you attempt to sweep the marker's VTXOs first (use markerStore.SweepVtxosByMarker and fall back to vtxoStore.SweepVtxos for markerVtxos[markerID]) and only if that sweep returns success call markerStore.SweepMarker(markerID, sweptAt); on any sweep error keep the marker unmarked, log the failure, and accumulate the fallback count as now done — update the loop in sweepVtxosWithMarkers to perform SweepVtxosByMarker/SweepVtxos before calling SweepMarker and adjust error handling accordingly.

… to use BulkSweepMarkers

…swept

bitcoin-coder-bob · 2026-02-12T22:23:06Z

@coderabbitai review

coderabbitai · 2026-02-12T22:23:16Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 13

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

internal/infrastructure/db/badger/vtxo_repo.go (2)
101-119: ⚠️ Potential issue | 🟠 Major

GetVtxos returns nil, nil when any single outpoint is missing, discarding all previously collected vtxos.

getVtxo returns (nil, nil) for ErrNotFound (Line 476-477), so the strings.Contains(err.Error(), "not found") check on Line 108 is dead code for that path. Instead, execution falls through to Line 113 where vtxo == nil triggers return nil, nil, silently dropping all vtxos already appended. This should continue instead, matching the likely intent of skipping missing outpoints.
Proposed fix
 	for _, outpoint := range outpoints {
 		vtxo, err := r.getVtxo(ctx, outpoint)
 		if err != nil {
-			if strings.Contains(err.Error(), "not found") {
-				continue
-			}
 			return nil, err
 		}
 		if vtxo == nil {
-			return nil, nil
+			continue
 		}
 		vtxos = append(vtxos, *vtxo)
 	}
277-322: ⚠️ Potential issue | 🟡 Minor

Inconsistent filter: query uses Ge (>=) but post-filter uses > (strictly greater).

Lines 286-288 and 295-297 fetch vtxos with Amount >= amountFilter, but Lines 306 and 311 then exclude vtxos where Amount == amountFilter by checking vtxo.Amount > amountFilter. This means vtxos with amount exactly equal to the filter are fetched from the DB but silently dropped. Either the query should use Gt or the post-filter should use >=.

🤖 Fix all issues with AI agents

In `@internal/core/application/service_test.go`:
- Around line 149-154: The test case "no spent vtxos" expects depth 1 but the
service's logic leaves newDepth at 0 when spentOutpoints is empty; update the
test in service_test.go for the case named "no spent vtxos (empty)" to set
expectedDepth to 0 so it matches the actual behavior of the service (referencing
newDepth, maxDepth and spentOutpoints in the service implementation).

In `@internal/infrastructure/db/badger/marker_repo.go`:
- Around line 258-276: SweepMarker currently does a full table scan by calling
r.vtxoStore.Find(&allDtos, &badgerhold.Query{}) for every marker (and
BulkSweepMarkers calls SweepMarker in a loop), causing N full scans; change
SweepMarker to query only VTXOs that contain the marker by using
r.vtxoStore.Find(&filteredDtos,
badgerhold.Where("MarkerIDs").Contains(markerID)) (same pattern as
getDescendantMarkerIds), iterate filteredDtos (type vtxoDTO) and call
r.vtxoStore.Update(outpoint.String(), dto) to set Swept=true and UpdatedAt; this
ensures each marker triggers a targeted query instead of scanning all VTXOs and
avoids the N×full-scan behavior in BulkSweepMarkers.
- Around line 438-462: GetVtxosByMarker currently loads all VTXOs then filters
in memory; change the find to use an indexed query so Badger filters by
MarkerIDs: replace the badgerhold.Query{} call in
markerRepository.GetVtxosByMarker with
badgerhold.Where("MarkerIDs").Contains(markerID) (keeping the same
r.vtxoStore.Find(&dtos, query) pattern), then retain the existing loop to
compute vtxo.Swept via r.isAnyMarkerSwept(dto.MarkerIDs) and append matching
DTOs to the result slice.

In `@internal/infrastructure/db/postgres/marker_repo.go`:
- Around line 159-184: SweepMarkerWithDescendants does inserts in a loop without
a transaction, causing partial commits on failure; wrap the entire operation in
a DB transaction so either all descendant InsertSweptMarker calls succeed or
none do. Start a transaction (e.g., via m.db.BeginTx or your repo's transaction
helper), run GetDescendantMarkerIds and then perform each
queries.InsertSweptMarker using the transactional querier/context (or passing tx
into the querier methods), rollback on any error and commit at the end, and
return the count only after a successful commit; reference functions:
SweepMarkerWithDescendants, GetDescendantMarkerIds, and InsertSweptMarker.

In
`@internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.down.sql`:
- Around line 1-14: Move the view drops to before any column/table drops: drop
views intent_with_inputs_vw and vtxo_vw first, then drop index idx_vtxo_markers,
drop columns markers and depth from table vtxo, and finally drop marker and
swept_marker tables; update the script so vtxo_vw and intent_with_inputs_vw are
removed prior to altering vtxo to avoid PostgreSQL dependency errors.

In
`@internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.up.sql`:
- Around line 1-5: Change the new column definition for markers on table vtxo to
be non-nullable with a default empty JSON array by altering the ADD COLUMN
statement for markers to "ADD COLUMN IF NOT EXISTS markers JSONB NOT NULL
DEFAULT '[]'::jsonb" (keep the existing GIN index creation), and ensure any
separate backfill step that populates markers for existing rows is consistent
with this default (remove/adjust redundant backfill or ensure it uses
'[]'::jsonb for rows without markers).

In `@internal/infrastructure/db/postgres/sqlc/queries/query.sql.go`:
- Around line 1770-1776: The SQL in the constant selectVtxosByArkTxid used by
the method SelectVtxosByArkTxid filters on the wrong column (txid); update the
query string to use WHERE ark_txid = $1 (or WHERE ark_txid = `@ark_txid` in the
.sql source) so the function returns VTXOs created by the given ark transaction;
update the selectVtxosByArkTxid SQL in both the Postgres and SQLite query.sql
sources so the generated query and the Queries.SelectVtxosByArkTxid
implementation both filter on ark_txid instead of txid.
- Around line 118-131: The recursive CTE used by GetDescendantMarkerIds scans
marker.parent_markers with the jsonb containment operator (@>), causing repeated
sequential scans; add a migration that creates a GIN index on the parent_markers
column (marker.parent_markers) so the recurrence m.parent_markers @>
jsonb_build_array(dm.id) can use the index; implement the migration file that
executes CREATE INDEX IF NOT EXISTS idx_marker_parent_markers ON marker USING
GIN (parent_markers) and ensure it is applied in your migrations pipeline.

In `@internal/infrastructure/db/service.go`:
- Around line 210-219: The code appends badgerVtxoRepo.GetStore() onto
config.DataStoreConfig which can mutate the original slice's backing array;
instead create a new slice copy of config.DataStoreConfig before appending to
avoid side effects. Locate the block that builds markerConfig (using
config.DataStoreConfig, badgerVtxoRepo.GetStore() and markerStoreFactory) and
replace the direct append with creating a new slice sized to hold the elements,
copying config.DataStoreConfig into it, then append badgerVtxoRepo.GetStore() to
that new slice and pass the new slice to markerStoreFactory.
- Around line 492-496: CreateRootMarkersForVtxos failures are currently only
warned and can leave persisted VTXOs referencing missing markers; update the
block where s.markerStore.CreateRootMarkersForVtxos(ctx, newVtxos) is called to
either (a) retry the CreateRootMarkersForVtxos call with the same retry/backoff
strategy used for persisting VTXOs (mirror the loop around the VTXO
persistence), or (b) if retrying fails, return the error to fail-fast so the
caller can roll back or handle incomplete state; locate the call to
s.markerStore.CreateRootMarkersForVtxos and implement a retry loop (or propagate
the error) and ensure logs include context about the affected VTXO set when
giving up.
- Around line 538-577: The GetVtxos DB failure path leaves newDepth at 0 and
parentMarkerIDs nil which makes IsAtMarkerBoundary(0) treat chained VTXOs as
root markers; update the error path in the block that calls s.vtxoStore.GetVtxos
(inside the loop over offchainTx.CheckpointTxs and subsequent processing) to
avoid creating misleading root markers by either returning the error upward
(propagate the GetVtxos error) or setting newDepth to a sentinel (e.g., a
special unknown value) and ensuring downstream logic
(IsAtMarkerBoundary/newDepth handling) treats that sentinel as “unknown” (no
root marker creation) instead of depth 0, and document the chosen approach in
the same function where newDepth and parentMarkerIDs are computed.

In
`@internal/infrastructure/db/sqlite/migration/20260210000000_add_depth_and_markers.down.sql`:
- Around line 28-33: The down migration tries to copy the removed swept column
from vtxo into vtxo_temp causing "no such column: swept"; fix by reconstructing
swept or restoring the column before the INSERT: either add a temporary swept
column to vtxo (or vtxo_temp) prior to the INSERT (so INSERT INTO vtxo_temp
SELECT ... swept ... FROM vtxo succeeds), or change the INSERT SELECT to compute
swept from swept_marker (join vtxo with swept_marker and derive swept) so the
SELECT no longer references the missing swept column; look for symbols
vtxo_temp, vtxo, swept, swept_marker and vtxo_new when applying the change.

In `@internal/infrastructure/db/sqlite/sqlc/query.sql`:
- Around line 471-484: GetDescendantMarkerIds currently matches parent_markers
via m.parent_markers LIKE '%"' || dm.id || '"%' which is brittle (false
positives for '%'/'_' and overlapping prefixes) and forces full scans; replace
the LIKE with a JSON-aware check using SQLite's json_each (e.g., JOIN/EXISTS
over json_each(m.parent_markers) j WHERE j.value = dm.id) or, better, migrate
parent_markers to a normalized join table (parent_marker mapping) and update
descendant_markers to join that table; also add an integration test for
GetDescendantMarkerIds using marker IDs containing characters like '%'/'_' and
overlapping prefixes to ensure correctness, and document the current limitation
of the LIKE approach in the schema/query comments.

🧹 Nitpick comments (20)

internal/core/application/sweeper_test.go (2)
734-735: Non-obvious Txid values for i >= 26.

string(rune('A'+i)) for i in 0..49 produces ASCII letters A–Z for i < 26, but non-letter characters ([, \, ], …) for i >= 26. This doesn't break the test (uniqueness is preserved), but fmt.Sprintf("child-%d", i) would be clearer and consistent with TestCreateCheckpointSweepTask_LargeMarkerSet (line 1189).
Suggested fix
-		childOutpoints[i] = domain.Outpoint{Txid: "child" + string(rune('A'+i)), VOut: 0}
+		childOutpoints[i] = domain.Outpoint{Txid: fmt.Sprintf("child-%d", i), VOut: 0}
22-158: Consider generating mocks to reduce boilerplate.

~400 lines of hand-rolled mocks for WalletService, VtxoRepository, MarkerRepository, and TxBuilder. Most methods are stubs returning zero values. Using a tool like mockery or counterfeiter would auto-generate these, reduce maintenance burden as interfaces evolve, and keep the test file focused on test logic.
internal/core/domain/marker_repo.go (1)

41-47: VTXO retrieval methods on MarkerRepository blur the boundary with VtxoRepository.

GetVtxosByDepthRange, GetVtxosByArkTxid, and GetVtxoChainByMarkers return []Vtxo and are essentially VTXO queries. Placing them here is understandable since they're marker/depth-optimized, but it means callers now need to know which repository to ask for VTXOs depending on the query pattern. If the interface continues to grow, consider whether a dedicated chain-traversal service or moving these to VtxoRepository with marker-aware implementations would keep the boundaries cleaner.

internal/infrastructure/db/sqlite/sqlc/query.sql (1)

261-270: Liquidity queries now scan every vtxo row with a correlated LIKE subquery.

SelectExpiringLiquidityAmount and SelectRecoverableLiquidityAmount both use EXISTS (SELECT 1 FROM swept_marker sm WHERE v.markers LIKE '%"' || sm.marker_id || '"%'). This is essentially a cross join between vtxo and swept_marker with a LIKE predicate on every pair — O(vtxos × swept_markers) string scans per query. As the number of swept markers grows, these queries will degrade.

Consider caching swept status on the vtxo row itself (a denormalized swept flag updated during BulkSweepMarkers), or evaluating sweep status in the application layer where the marker set is already available.

Also applies to: 273-279
internal/infrastructure/db/postgres/sqlc/query.sql (1)
500-514: Inconsistent projection: SELECT * vs sqlc.embed(vtxo_vw) across vtxo queries.

SelectVtxosByDepthRange, SelectVtxosByArkTxid, and SelectVtxoChainByMarker use SELECT * FROM vtxo_vw, while all other vtxo queries (e.g., SelectAllVtxos, SelectVtxo, SelectSweepableUnrolledVtxos) use SELECT sqlc.embed(vtxo_vw) FROM vtxo_vw. This generates different Go return types — flat structs vs. nested struct { VtxoVw VtxoVw } — requiring different mapping code in the repository layer.

Consider using sqlc.embed(vtxo_vw) consistently so the generated Go types are uniform.
Suggested fix
 -- name: SelectVtxosByDepthRange :many
 -- Get all VTXOs within a depth range, useful for filling gaps between markers
-SELECT * FROM vtxo_vw
+SELECT sqlc.embed(vtxo_vw) FROM vtxo_vw
 WHERE depth >= `@min_depth` AND depth <= `@max_depth`
 ORDER BY depth DESC;
 
 -- name: SelectVtxosByArkTxid :many
 -- Get all VTXOs created by a specific ark tx (offchain tx)
-SELECT * FROM vtxo_vw WHERE txid = `@ark_txid`;
+SELECT sqlc.embed(vtxo_vw) FROM vtxo_vw WHERE txid = `@ark_txid`;
 
 -- name: SelectVtxoChainByMarker :many
 -- Get VTXOs whose markers JSONB array contains any of the given marker IDs
-SELECT * FROM vtxo_vw
+SELECT sqlc.embed(vtxo_vw) FROM vtxo_vw
 WHERE markers ?| `@marker_ids`::TEXT[]
 ORDER BY depth DESC;
internal/infrastructure/db/sqlite/vtxo_repo.go (1)
538-548: Silent error swallowing in parseMarkersJSONFromVtxo could mask data corruption.

If the JSON in the markers column is malformed, this function silently returns nil without any logging. While defensive, this could make it hard to diagnose data integrity issues.

Consider adding a log warning on parse failure, consistent with how other parse errors are handled elsewhere in the codebase.
Optional: add warning log on parse failure
 func parseMarkersJSONFromVtxo(markersJSON string) []string {
 	if markersJSON == "" {
 		return nil
 	}
 	var markerIDs []string
 	if err := json.Unmarshal([]byte(markersJSON), &markerIDs); err != nil {
+		// Log warning to help diagnose data corruption
+		log.WithError(err).Warn("failed to parse markers JSON from vtxo")
 		return nil
 	}
 	return markerIDs
 }
internal/core/application/indexer_test.go (1)

15-299: Consider using a mock generation tool to reduce boilerplate.

The manual mock implementations (~280 lines of stubs) are correct but add significant maintenance burden. Tools like mockery or moq could auto-generate these from the interfaces and keep them in sync as the repository interfaces evolve.

That said, the explicit nil-interface handling in Markers() (lines 288–294) is a valuable pattern worth keeping regardless.
internal/infrastructure/db/badger/marker_repo.go (1)
524-528: Dead error handling — assign-then-discard pattern is misleading.

Line 525 calls r.SweepMarker(...) and assigns to err, then line 528 discards it with _ = err. This is confusing — use _ = directly on the call.
Simplify the error discard
-	if err := r.SweepMarker(ctx, markerID, time.Now().Unix()); err != nil {
-		// Non-fatal - the vtxos are already marked as swept
-		_ = err
-	}
+	// Non-fatal - the vtxos are already marked as swept
+	_ = r.SweepMarker(ctx, markerID, time.Now().Unix())
internal/infrastructure/db/sqlite/marker_repo.go (2)
398-508: Four nearly identical rowToVtxoFrom* functions — consider a shared mapper.

rowToVtxoFromMarkerQuery, rowToVtxoFromDepthRangeQuery, rowToVtxoFromArkTxidQuery, and rowToVtxoFromChainQuery all perform the same mapping from VtxoVw embedded in different sqlc row types. Since the inner row.VtxoVw is the same type (queries.VtxoVw), you could extract a shared vtxoVwToDomain(vw queries.VtxoVw) domain.Vtxo and call it from each wrapper, reducing ~100 lines of duplication.

Note that vtxo_repo.go already has rowToVtxo(row queries.VtxoVw) which does essentially the same mapping — you could reuse that directly.
Consolidate using the existing rowToVtxo from vtxo_repo.go
 func rowToVtxoFromMarkerQuery(row queries.SelectVtxosByMarkerIdRow) domain.Vtxo {
-	var commitmentTxids []string
-	if commitments, ok := row.VtxoVw.Commitments.(string); ok && commitments != "" {
-		commitmentTxids = strings.Split(commitments, ",")
-	}
-	return domain.Vtxo{
-		Outpoint: domain.Outpoint{
-			Txid: row.VtxoVw.Txid,
-			VOut: uint32(row.VtxoVw.Vout),
-		},
-		// ... all fields ...
-	}
+	return rowToVtxo(row.VtxoVw)
 }
Apply the same pattern to all four functions.
510-519: Duplicate parseMarkersJSON — already exists as parseMarkersJSONFromVtxo in vtxo_repo.go.

Both functions in this package have identical logic. Consolidate into a single shared function.
internal/core/application/indexer.go (2)
416-431: Consider batching GetMarker calls to reduce DB round-trips during BFS.

Each iteration of the BFS loop issues an individual GetMarker DB call (line 420). For deep marker chains (e.g., depth 20000 with markers every 100 levels = ~200 markers), this results in ~200 sequential queries. A batch approach using GetMarkersByIds on the current queue batch would be significantly faster.
♻️ Sketch of batched BFS
 	for len(queue) > 0 {
-		currentID := queue[0]
-		queue = queue[1:]
-
-		marker, err := i.repoManager.Markers().GetMarker(ctx, currentID)
-		if err != nil || marker == nil {
+		// Fetch all markers in current queue batch at once
+		batch := queue
+		queue = nil
+		markers, err := i.repoManager.Markers().GetMarkersByIds(ctx, batch)
+		if err != nil {
 			continue
 		}
-
-		for _, parentID := range marker.ParentMarkerIDs {
-			if !visited[parentID] {
-				visited[parentID] = true
-				markerIDs = append(markerIDs, parentID)
-				queue = append(queue, parentID)
+		for _, marker := range markers {
+			for _, parentID := range marker.ParentMarkerIDs {
+				if !visited[parentID] {
+					visited[parentID] = true
+					markerIDs = append(markerIDs, parentID)
+					queue = append(queue, parentID)
+				}
 			}
 		}
 	}
465-474: Cache is mutated via the cache map parameter — document this side effect.

getVtxosFromCacheOrDB updates the caller's map in-place (line 473). This is correct for the current usage pattern, but the mutation is non-obvious. A brief doc note on the side effect would improve maintainability.
internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.up.sql (2)
73-83: Correlated EXISTS subquery in the view may degrade as swept_marker grows.

The vtxo_vw view computes swept via EXISTS (SELECT 1 FROM swept_marker sm WHERE v.markers @> jsonb_build_array(sm.marker_id)). This scans swept_marker for each VTXO row. While the GIN index on markers helps the containment check, this is effectively a semi-join where the outer side (swept_marker) is iterated per-vtxo. As the number of swept markers grows, this scan may become expensive for queries that touch many VTXOs.

Consider whether a reverse lookup (joining vtxo markers against swept_marker PK) or a materialized approach would scale better for your expected data volumes.

25-30: Remove intermediate view creation — they are dropped and recreated without ever being used.

The views created at lines 25-30 and 32-40 are dropped at lines 65-66 before being recreated at lines 73-93. The backfill queries (lines 44-62) query vtxo directly, so these intermediate views are never referenced and can be removed to simplify the migration.
♻️ Simplified migration flow
-- Drop views before dropping the swept column (views depend on it via v.*)
-DROP VIEW IF EXISTS intent_with_inputs_vw;
-DROP VIEW IF EXISTS vtxo_vw;
-
-CREATE VIEW vtxo_vw AS
-SELECT v.*, string_agg(vc.commitment_txid, ',') AS commitments
-FROM vtxo v
-LEFT JOIN vtxo_commitment_txid vc
-ON v.txid = vc.vtxo_txid AND v.vout = vc.vtxo_vout
-GROUP BY v.txid, v.vout;
-
-CREATE VIEW intent_with_inputs_vw AS
-SELECT vtxo_vw.*,
-       intent.id,
-       intent.round_id,
-       intent.proof,
-       intent.message
-FROM intent
-LEFT OUTER JOIN vtxo_vw
-ON intent.id = vtxo_vw.intent_id;
-
 -- Backfill: Create a marker for every existing VTXO using its outpoint as marker ID
internal/infrastructure/db/sqlite/migration/20260210000000_add_depth_and_markers.up.sql (1)

22-41: Intermediate view recreation appears unused — backfill queries reference vtxo directly.

Same as the Postgres migration: the views created at lines 26-41 are dropped again at lines 96-97 without being referenced by the backfill statements (lines 45-62). They add migration complexity without benefit.

internal/infrastructure/db/postgres/marker_repo.go (2)

249-274: TOCTOU between count query and sweep insert in SweepVtxosByMarker.

CountUnsweptVtxosByMarkerId (line 260) and InsertSweptMarker (line 266) are not atomic. The returned count may not reflect the actual number of VTXOs affected by the sweep. Since the count is only used for logging/metrics, this isn't a correctness issue, but worth noting.

426-436: Silent error swallowing in parseMarkersJSONB — consider logging.

Unmarshal errors at line 432 are silently swallowed. If corrupted marker JSON ends up in the database, this would silently produce nil marker IDs, making affected VTXOs invisible to marker-based queries. A debug-level log would aid troubleshooting without adding noise.
internal/core/application/service_test.go (1)
562-567: outputs[0].MarkerIDs is re-sorted on every loop iteration.

The sort at line 563 mutates outputs[0].MarkerIDs in-place on each iteration. Move it before the loop.
♻️ Minor optimization
 			// All outputs must have the same marker IDs
+			sort.Strings(outputs[0].MarkerIDs)
 			for i := 1; i < len(outputs); i++ {
-				sort.Strings(outputs[0].MarkerIDs)
 				sort.Strings(outputs[i].MarkerIDs)
 				require.Equal(t, outputs[0].MarkerIDs, outputs[i].MarkerIDs,
 					"output %d has different markers than output 0", i)
 			}
internal/infrastructure/db/badger/vtxo_repo.go (2)
23-26: Duplicate accessors: GetStore() and Store() return the same value.

Both methods on Lines 23-26 and Lines 421-424 return r.store with identical signatures. Pick one and remove the other to avoid confusion about which to call.

Also applies to: 421-424

625-637: Redundant visited check in GetSweepableVtxosByCommitmentTxid.

Line 627 checks !visited[outpointKey], and Line 628 checks !seen on the same key. Since visited maps to bool, !visited[key] is true iff the key is absent (zero-value false), making the inner check always true when reached. This also means Line 633-635 (enqueue ArkTxid) is unreachable for already-visited outpoints — which is correct — but the double-check is confusing. Compare with the cleaner pattern in GetAllChildrenVtxos (Lines 668-676).
Simplify to match GetAllChildrenVtxos pattern
 		for _, vtxo := range vtxos {
 			outpointKey := vtxo.Outpoint.String()
-			if !visited[outpointKey] {
-				if _, seen := visited[outpointKey]; !seen {
-					visited[outpointKey] = true
-					outpoints = append(outpoints, vtxo.Outpoint)
-				}
-
-				if vtxo.ArkTxid != "" {
-					queue = append(queue, vtxo.ArkTxid)
-				}
+			if !visited[outpointKey] {
+				visited[outpointKey] = true
+				outpoints = append(outpoints, vtxo.Outpoint)
+				if vtxo.ArkTxid != "" {
+					queue = append(queue, vtxo.ArkTxid)
+				}
 			}
 		}

internal/core/application/service_test.go

internal/infrastructure/db/badger/marker_repo.go

internal/infrastructure/db/postgres/marker_repo.go

internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.down.sql

internal/infrastructure/db/service.go

internal/infrastructure/db/sqlite/migration/20260210000000_add_depth_and_markers.down.sql

internal/infrastructure/db/sqlite/sqlc/query.sql

bitcoin-coder-bob · 2026-02-13T17:27:54Z

@coderabbitai review

coderabbitai · 2026-02-13T17:28:05Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 5

🤖 Fix all issues with AI agents

In `@internal/core/application/service_test.go`:
- Around line 655-666: The test name and comment are misleading because it
claims to exercise uint32 overflow near math.MaxUint32 but only uses depths
around 20000; either rename the subtest (e.g., "depth beyond 20k target remains
valid") and update the comment to match the 20k boundary checks, or add a true
overflow-edge case: create a parent Vtxo with Depth: math.MaxUint32-1, call
calculateMaxDepth([]domain.Vtxo{parent})+1 and assert the resulting uint32
behavior (wraps to 0) and verify domain.IsAtMarkerBoundary for relevant
wrap-around values; update assertions accordingly and keep references to
calculateMaxDepth, domain.Vtxo, and domain.IsAtMarkerBoundary so reviewers can
find the changes.

In `@internal/infrastructure/db/postgres/marker_repo.go`:
- Around line 39-53: The AddMarker function currently json.Marshal's
marker.ParentMarkerIDs which yields JSON "null" for a nil slice; to match
vtxo_repo.go behavior and store an empty array instead, ensure
marker.ParentMarkerIDs is converted to an empty slice when nil before marshaling
(e.g., if marker.ParentMarkerIDs == nil then use []string{} or the appropriate
typed empty slice), then marshal and pass the resulting bytes into the existing
pqtype.NullRawMessage for ParentMarkers in UpsertMarker; keep the rest of
AddMarker and the pqtype.NullRawMessage Valid handling unchanged.
- Around line 229-243: In markerRepository.UpdateVtxoMarkers the code passes
markersJSON ([]byte) to UpdateVtxoMarkersParams.Markers which is a
json.RawMessage named type; perform an explicit conversion by setting Markers:
json.RawMessage(markersJSON) so the types match and the query compiles at build
time.

In
`@internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.up.sql`:
- Around line 74-84: The view vtxo_vw uses string_agg(vc.commitment_txid, ',')
which returns NULL when no commitments; wrap that call in
COALESCE(string_agg(...), '') to return an empty string instead and update the
SELECT in vtxo_vw accordingly, and make the same COALESCE change to the
earlier/intermediate view that also uses string_agg so both views behave
consistently across backends; ensure the column alias remains commitments and
that GROUP BY and joins (vtxo, vtxo_commitment_txid) are unchanged.

In `@internal/infrastructure/db/service.go`:
- Around line 792-806: sweepVtxosWithMarkers currently sets totalSwept =
int64(len(vtxos)) unconditionally and discards partial progress when
markerStore.BulkSweepMarkers fails; update sweepVtxosWithMarkers to use an
actual swept count returned from markerStore.BulkSweepMarkers (change
BulkSweepMarkers signature to return (int64, error) if needed) and
accumulate/return the number of newly swept VTXOs instead of len(vtxos); handle
partial failures by returning the partial count and logging a warning that
includes the requested markerIDs, the number of markers successfully swept and
the error, and ensure symbols referenced are sweepVtxosWithMarkers,
markerStore.BulkSweepMarkers, uniqueMarkers/markerIDs, vtxos and sweptAt so
reviewers can locate the changes.

🧹 Nitpick comments (9)

internal/core/application/service_test.go (3)
210-211: Empty subtest names make failures hard to identify.

Each t.Run uses "". When a subtest fails, the output won't distinguish which case broke. Consider adding a name field to the test struct (as done in the other table-driven tests) and passing it to t.Run.
Proposed fix
 	testCases := []struct {
+		name               string
 		parentDepth        uint32
 		newDepth           uint32
 		shouldCreateMarker bool
 	}{
-		{99, 100, true},   // crossing into boundary
-		{100, 101, false}, // leaving boundary
-		{199, 200, true},  // crossing into next boundary
-		{0, 1, false},     // moving away from initial boundary
-		{98, 99, false},   // approaching but not at boundary
+		{"crossing into boundary", 99, 100, true},
+		{"leaving boundary", 100, 101, false},
+		{"crossing into next boundary", 199, 200, true},
+		{"moving away from initial boundary", 0, 1, false},
+		{"approaching but not at boundary", 98, 99, false},
 	}

 	for _, tc := range testCases {
-		t.Run("", func(t *testing.T) {
+		t.Run(tc.name, func(t *testing.T) {
224-264: Test helpers duplicate production logic — divergence risk.

calculateMaxDepth, collectParentMarkers, and deriveMarkerIDs re-implement service internals. If the service logic changes, these helpers won't break, silently making the tests stale. Consider either exporting thin wrappers from the service (or an internal test-support package) so the tests exercise the real code, or adding a comment/issue to keep them in sync.

548-572: Marker equality check is trivially true — all outputs share the same slice.

Line 554 assigns the same markerIDs slice to every output. The loop at lines 565-569 then compares outputs[0].MarkerIDs with outputs[i].MarkerIDs, which are the same underlying array — this will always pass regardless of content.

If the intent is to verify that independent copies are equal, assign a copy per output. If the intent is simply to document that the service applies one shared reference, the current code is fine but the loop adds no value over a single length check.

Also, sort.Strings(outputs[0].MarkerIDs) on line 566 is re-executed every iteration; move it before the loop.
Proposed fix — copy slice per output and sort once
 			outputs := make([]domain.Vtxo, tc.numOutputVtxos)
 			for i := 0; i < tc.numOutputVtxos; i++ {
+				ids := make([]string, len(markerIDs))
+				copy(ids, markerIDs)
 				outputs[i] = domain.Vtxo{
 					Outpoint:  domain.Outpoint{Txid: "tx-with-multiple-outputs", VOut: uint32(i)},
 					Depth:     newDepth,
-					MarkerIDs: markerIDs,
+					MarkerIDs: ids,
 				}
 			}

 			// All outputs must have the same depth
 			for i, v := range outputs {
 				require.Equal(t, tc.expectedDepth, v.Depth,
 					"output %d has wrong depth", i)
 			}

 			// All outputs must have the same marker IDs
+			sort.Strings(outputs[0].MarkerIDs)
 			for i := 1; i < len(outputs); i++ {
-				sort.Strings(outputs[0].MarkerIDs)
 				sort.Strings(outputs[i].MarkerIDs)
 				require.Equal(t, outputs[0].MarkerIDs, outputs[i].MarkerIDs,
 					"output %d has different markers than output 0", i)
 			}
internal/infrastructure/db/sqlite/sqlc/query.sql (1)

261-270: Liquidity queries use correlated NOT EXISTS with LIKE — potential performance concern at scale.

SelectExpiringLiquidityAmount and SelectRecoverableLiquidityAmount (Lines 273-279) both correlate every vtxo row against swept_marker using LIKE '%"' || sm.marker_id || '"%'. For large vtxo and swept_marker tables, this is effectively an O(vtxos × swept_markers) scan per query. This is acceptable for now given SQLite limitations, but worth keeping in mind as data grows.
internal/infrastructure/db/badger/marker_repo.go (2)
470-525: SweepVtxosByMarker filters by Swept=false then manually checks marker — use compound query instead.

Lines 476-494 load all unswept VTXOs then iterate to check if each has the target marker. This is more efficient than a full scan but still unnecessarily broad. Use badgerhold.Where("MarkerIDs").Contains(markerID).And("Swept").Eq(false) to push both filters to the store.

Also, the error swallowing at line 521 (_ = err) after the if err := is a no-op and reads confusingly. Simply drop the _ = err line.
♻️ Proposed fix
-	var allDtos []vtxoDTO
-	err := r.vtxoStore.Find(&allDtos, badgerhold.Where("Swept").Eq(false))
+	var allDtos []vtxoDTO
+	err := r.vtxoStore.Find(&allDtos,
+		badgerhold.Where("MarkerIDs").Contains(markerID).And("Swept").Eq(false))
 	if err != nil {
 		return 0, err
 	}
 
 	var count int64
 	for _, dto := range allDtos {
-		// Check if this VTXO has the markerID
-		hasMarker := false
-		for _, id := range dto.MarkerIDs {
-			if id == markerID {
-				hasMarker = true
-				break
-			}
-		}
-		if !hasMarker {
-			continue
-		}
-
 		// Update the vtxo's Swept field
 	if err := r.SweepMarker(ctx, markerID, time.Now().Unix()); err != nil {
-		// Non-fatal - the vtxos are already marked as swept
-		_ = err
+		// Non-fatal - the vtxos are already marked as swept
 	}
38-106: Constructor uses positional interface{} args — fragile API.

The variadic interface{} constructor with positional indexing (config[0], config[1], config[2]) is brittle and hard to extend. This pattern is used by other repos in the codebase, so it's consistent, but consider documenting the expected signatures more prominently or migrating to a typed config struct in a future refactor.
internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.up.sql (2)
22-41: Intermediate view creation before backfill appears unnecessary.

Lines 22-41 create vtxo_vw and intent_with_inputs_vw, but the backfill at lines 43-63 queries vtxo v directly, not through the views. These views are then dropped again at lines 66-67. If no concurrent queries depend on the views during migration, this round-trip can be removed.

However, if views are expected to exist for other sessions during migration, keeping them is a safe choice.

74-84: vtxo_vw's swept computation with EXISTS ... @> jsonb_build_array(sm.marker_id) performs a scan of swept_marker per VTXO row.

The EXISTS (SELECT 1 FROM swept_marker sm WHERE v.markers @> jsonb_build_array(sm.marker_id)) checks if any marker in the VTXO's markers array has been swept. This is evaluated for every row when the view is queried. With the GIN index on markers, the @> containment check is efficient per-pair, but the directionality is inverted: the query iterates swept_marker and checks containment in v.markers. For a large swept_marker table, this could be slow.

An alternative that leverages the GIN index better would be:
EXISTS (
    SELECT 1 FROM jsonb_array_elements_text(v.markers) AS m(id)
    WHERE m.id IN (SELECT sm.marker_id FROM swept_marker sm)
)
This is not blocking but worth monitoring query plans as the swept_marker table grows.
internal/infrastructure/db/postgres/marker_repo.go (1)

374-448: rowToVtxoFromVtxoVw and parseMarkersJSONB duplicate vtxo_repo.go's rowToVtxo and parseMarkersJSONBFromVtxo.

These functions are near-identical to their counterparts in vtxo_repo.go (lines 492-526). Consider extracting shared mapping helpers to a common file (e.g., internal/infrastructure/db/postgres/helpers.go) to reduce duplication.

internal/core/application/service_test.go

internal/infrastructure/db/postgres/marker_repo.go

internal/infrastructure/db/postgres/migration/20260210100000_add_depth_and_markers.up.sql

internal/infrastructure/db/service.go

bitcoin-coder-bob · 2026-02-13T18:08:33Z

@coderabbitai review

coderabbitai · 2026-02-13T18:08:41Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@internal/infrastructure/db/postgres/marker_repo.go`:
- Around line 265-290: Change SweepVtxosByMarker to accept a sweptAt int64
(matching SweepMarker, BulkSweepMarkers, SweepMarkerWithDescendants) and update
the domain.MarkerRepository interface plus SQLite and Badger implementations
accordingly; inside markerRepository.SweepVtxosByMarker use a single DB
transaction to perform CountUnsweptVtxosByMarkerId and InsertSweptMarker so the
count and insert are atomic (no race window), and pass the provided sweptAt into
InsertSweptMarker instead of calling time.Now().Unix(); ensure the tx is
committed/rolled back correctly and return the counted value from the same
transactional context.

🧹 Nitpick comments (5)

internal/core/application/service_test.go (3)
73-83: Test helpers duplicate production logic — tests won't catch service regressions.

calculateMaxDepth, collectParentMarkers, and deriveMarkerIDs are hand-rolled copies of the service's logic. If the service implementation drifts (e.g., a bug is introduced or the marker ID format changes), these tests will keep passing against their own copy. Consider either:

Exporting the real helpers from the service (or extracting them into a shared internal package) and calling them directly in tests, or

Writing integration-style tests that exercise the actual service method (e.g., updateProjectionsAfterOffchainTxEvents) with mocked dependencies.

This way the tests validate actual behavior rather than a parallel re-implementation.

Also applies to: 224-264

565-570: outputs[0].MarkerIDs is re-sorted on every loop iteration.

sort.Strings(outputs[0].MarkerIDs) mutates in-place and is called inside the loop for each i. Move it before the loop.
Proposed fix
+		sort.Strings(outputs[0].MarkerIDs)
 		for i := 1; i < len(outputs); i++ {
-			sort.Strings(outputs[0].MarkerIDs)
 			sort.Strings(outputs[i].MarkerIDs)
260-264: The nil return path (no parent markers, not at boundary) has no test coverage.

When parentMarkerIDs is empty and the depth is not at a boundary, deriveMarkerIDs returns (nil, nil). None of the current test cases exercise this branch. Consider adding a case in TestMarkerInheritanceAtNonBoundary with empty parent markers to cover it.
Example test case to add
{
    name:             "no parent markers at non-boundary depth",
    parentDepths:     []uint32{0},
    parentMarkerSets: [][]string{{}},
    expectedDepth:    1,
    expectedMarkers:  nil, // or []string(nil)
    description:      "child with no parent markers inherits nil",
},
Note: the assertion would need adjustment since the current test sorts and compares with require.Equal, which would need to handle nil vs empty slice.
internal/infrastructure/db/postgres/marker_repo.go (2)
442-452: parseMarkersJSONB silently swallows unmarshal errors.

If the JSON in the markers column is corrupt, this returns nil with no indication of a problem. This could mask data-integrity issues. Consider returning an error or at minimum logging, similar to how rowToMarker (line 406) propagates unmarshal failures.
Option: propagate the error
-func parseMarkersJSONB(markers pqtype.NullRawMessage) []string {
+func parseMarkersJSONB(markers pqtype.NullRawMessage) ([]string, error) {
 	if !markers.Valid || len(markers.RawMessage) == 0 {
-		return nil
+		return nil, nil
 	}
 	var markerIDs []string
 	if err := json.Unmarshal(markers.RawMessage, &markerIDs); err != nil {
-		return nil
+		return nil, fmt.Errorf("failed to unmarshal markers JSONB: %w", err)
 	}
-	return markerIDs
+	return markerIDs, nil
 }
This would require updating rowToVtxoFromVtxoVw and rowToVtxoFromMarkerQuery to handle the error, but it prevents silent data corruption.
378-440: rowToVtxoFromMarkerQuery duplicates rowToVtxoFromVtxoVw field-by-field.

The only difference is that one accesses row.Field and the other row.VtxoVw.Field. Since SelectVtxosByMarkerIdRow embeds VtxoVw, you could delegate:
func rowToVtxoFromMarkerQuery(row queries.SelectVtxosByMarkerIdRow) domain.Vtxo {
	return rowToVtxoFromVtxoVw(row.VtxoVw)
}
This eliminates ~20 lines of duplicated mapping and ensures future field additions are handled in one place.

internal/infrastructure/db/postgres/marker_repo.go

# Conflicts: # api-spec/protobuf/ark/v1/indexer.proto # api-spec/protobuf/ark/v1/types.proto # api-spec/protobuf/gen/ark/v1/indexer.pb.go # api-spec/protobuf/gen/ark/v1/indexer.pb.rgw.go # api-spec/protobuf/gen/ark/v1/types.pb.go # internal/core/application/utils.go # internal/core/domain/vtxo.go # internal/infrastructure/db/postgres/sqlc/queries/models.go # internal/infrastructure/db/postgres/sqlc/queries/query.sql.go # internal/infrastructure/db/postgres/sqlc/query.sql # internal/infrastructure/db/postgres/vtxo_repo.go # internal/infrastructure/db/service.go # internal/infrastructure/db/service_test.go # internal/infrastructure/db/sqlite/sqlc/queries/models.go # internal/infrastructure/db/sqlite/sqlc/queries/query.sql.go # internal/infrastructure/db/sqlite/sqlc/query.sql # internal/infrastructure/db/sqlite/vtxo_repo.go # internal/interface/grpc/handlers/indexer.go # internal/interface/grpc/handlers/parser.go # internal/test/e2e/e2e_test.go # internal/test/e2e/utils_test.go

Resolve merge conflicts: formatting differences in function arguments and combine branch's Depth/Markers fields with master's changes in vtxo_repo.go. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

bitcoin-coder-bob added 2 commits February 10, 2026 13:26

dag depth, markers, tests

2b42b93

recursive CTE query for descendant markers, sweep test, prefetchVtxos…

bed30a7

…ByMarkers, getVtxosFromCacheOrDB

bitcoin-coder-bob mentioned this pull request Feb 10, 2026

Scale the DAG #833

Open

bitcoin-coder-bob added 2 commits February 10, 2026 14:18

testGetVtxoChainWithMarkerOptimization, linting

e8fbb86

badger impl, migration file renames, marker_test.go

e39e4d2

coderabbitai bot reviewed Feb 10, 2026

View reviewed changes

bitcoin-coder-bob added 18 commits February 11, 2026 15:18

change vtxo table marker column to be JSONB to hold >=1 marker

94adccb

remove serivce.go markerStore nil checks, rearrange sql statements, lint

5c4ebc6

swept column from vtxo table removed

7f613f1

bulksweepmarkers, sweep all of a vtxo's markers

395e128

bulk add new markers and vtxos

cc93afb

new vtxos get markers set in AddVtxos

9588705

populate MarkerIds in getNewVtxosFromRound, createCheckpointSweepTask…

18932e9

… to use BulkSweepMarkers

linted with go1.25.7

56f6cd1

fix github action for linter version

c706f56

github action linter to use latest golangci/golangci-lint-action@v8

a88e895

revert lint changes

2a737f6

Merge branch 'master' into bob/dag-1

691480f

migration to ensure each vtxo has at least 1 marker, dust vtxos bulk …

bfa79c7

…swept

Tests

2fa7689

more tests

bb6d7dc

20k depth test

35c1944

more integration tests in service_test.go

38a7945

add marker for dust

7d4d296

coderabbitai bot reviewed Feb 12, 2026

View reviewed changes

bitcoin-coder-bob added 2 commits February 13, 2026 11:03

safe copy, db tx usage, idx_marker_parent_markers index

09f331d

depthKnown, markersToMarshal fix

1122bd9

coderabbitai bot reviewed Feb 13, 2026

View reviewed changes

parentMarkerIds emply slice setting, test fix

bd5e93c

coderabbitai bot reviewed Feb 13, 2026

View reviewed changes

internal/infrastructure/db/postgres/marker_repo.go Show resolved Hide resolved

bitcoin-coder-bob added 6 commits February 16, 2026 12:54

lint fixes

307269e

test optimizations, test comments

a6f0d8c

sqlite json_each, badger touchups

fe8a755

cursor-based pagination for GetVtxoChain with lazy marker-window loading

55618c4

add end-to-end pagination tests and fix early termination bug

4dddd82

bitcoin-coder-bob changed the title ~~Bob/dag 1~~ Scale the DAG Feb 17, 2026

bitcoin-coder-bob requested review from Kukks and altafan February 17, 2026 06:36

bitcoin-coder-bob and others added 2 commits February 18, 2026 16:20

Merge origin/master into bob/dag-1

49ed51c

Resolve merge conflicts: formatting differences in function arguments and combine branch's Depth/Markers fields with master's changes in vtxo_repo.go. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

make proto

dc70231

Conversation

bitcoin-coder-bob commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Possibly related PRs

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

bitcoin-coder-bob commented Feb 10, 2026

Uh oh!

coderabbitai bot commented Feb 10, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

bitcoin-coder-bob commented Feb 12, 2026

Uh oh!

coderabbitai bot commented Feb 12, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bitcoin-coder-bob commented Feb 13, 2026

Uh oh!

coderabbitai bot commented Feb 13, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bitcoin-coder-bob commented Feb 13, 2026

Uh oh!

coderabbitai bot commented Feb 13, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

bitcoin-coder-bob commented Feb 10, 2026 •

edited

Loading

coderabbitai bot commented Feb 10, 2026 •

edited

Loading