feat: Horizontally scalable SSE Events by 86667 · Pull Request #303 · pubky/pubky-core

86667 · 2026-02-06T09:39:36Z

Use Postgres's NOTIFY/LISTEN to propagate events across Homeserver instances.

Changes

Add notify_event() to send events via pg_notify
Add PgEventListener background service that subscribes to Postgres notifications and forwards events to the local broadcast channel
Add serde support for EventEntity and related types

How it works

File write commits to Postgres
notify_event() sends pg_notify('events')
All instances' PgEventListener receive the notification
Each instance broadcasts to its local SSE subscribers

… broadcasting

SHAcollision

Very nice work. I left some comments, most are just Nit othes might be more relevant, but I could also have not understood fully. Good job!!

SHAcollision · 2026-02-24T20:01:54Z

pubky-homeserver/src/persistence/files/events/events_layer.rs

                })?;
-                self.events_service.broadcast_event(event);
+                self.events_service
+                    .notify_event(&event, self.db.pool())


It seems we switched from broadcast_event to notify_event only, but notify_event is best-effort and currently just logs on failure. If the call fails (DB hiccup, payload issue), local SSE clients on this instance will also miss the event permanently. Can we either (a) keep a local broadcast fast-path plus cross-instance dedupe, or (b) add an explicit catch-up mechanism to guarantee no gaps for connected subscribers?

SHAcollision · 2026-02-24T20:04:41Z

pubky-homeserver/src/persistence/files/events/events_service.rs

+        if let Err(e) = sqlx::query(PG_NOTIFY_QUERY)
+            .bind(&payload)
+            .execute(pool)
+            .await
+        {
+            tracing::error!(
+                event_id = event.id,
+                path = %event.path,
+                "Failed to send NOTIFY: {}", e
+            );
+        }


Does Error handling here logs and returns ()? so failed NOTIFYs are silently dropped from the live stream path. Not sure what's best, maybe we could return a Result to callers and define a recovery policy (retry/backoff, fallback local broadcast, or durable catch-up)? This could introduce a reliability regression if there are random failures.

SHAcollision · 2026-02-24T20:05:35Z

pubky-homeserver/src/persistence/sql/pg_event_listener.rs

+    /// Main loop that handles reconnection on errors.
+    async fn listen_loop(pool: PgPool, events_service: EventsService) {
+        loop {
+            match Self::run_listener(&pool, &events_service).await {
+                Ok(()) => break, // Clean shutdown (should not happen in normal operation)
+                Err(e) => {
+                    tracing::error!("PgListener error: {}. Reconnecting in 1s...", e);
+                    tokio::time::sleep(Duration::from_secs(1)).await;
+                }
+            }
+        }
+    }


Reconnect loop is good, but LISTEN/NOTIFY is not durable: events emitted while disconnected are lost. Can we add gap recovery on reconnect (e.g., resume from last delivered event cursor/ID via DB query) so subscribers don’t permanently miss events during reconnect windows?

SHAcollision · 2026-02-24T20:06:30Z

pubky-homeserver/src/persistence/sql/pg_event_listener.rs

+    /// creates separate ephemeral databases per homeserver. Instead, we instantiate
+    /// only the EventsService + PgEventListener components sharing a single db pool.
+    #[tokio::test]
+    #[pubky_test_utils::test]


Could we add a test that intentionally drops/restarts one listener while events are being produced, then verifies the restarted instance catches up without gaps? Current tests do not seem to test reconnect-window loss behavior

SHAcollision · 2026-02-24T20:07:22Z

pubky-homeserver/src/persistence/files/events/events_service.rs

+pub(crate) const PG_NOTIFY_CHANNEL: &str = "events";
+
+/// SQL query to send a NOTIFY event.
+const PG_NOTIFY_QUERY: &str = "SELECT pg_notify('events', $1)";


Minor maintainability nit: channel name is duplicated (PG_NOTIFY_CHANNEL + hardcoded 'events' in SQL). Could we build the SQL from the constant (or centralize both in one place) to avoid config drift?

SHAcollision · 2026-02-24T20:08:22Z

pubky-homeserver/src/persistence/sql/pg_event_listener.rs

+                    tracing::error!(
+                        "Failed to deserialize event notification: {}. Payload: {}",
+                        e,
+                        notification.payload()
+                    );


Super NIT: logging the full raw payload on deserialize failure may leak user path/public key data into logs. We could try truncating payload content and logging structured metadata instead.

SHAcollision · 2026-02-24T20:09:35Z

pubky-homeserver/src/persistence/files/events/events_service.rs

+        if payload.len() > PG_NOTIFY_WARN_THRESHOLD {
+            tracing::warn!(
+                event_id = event.id,
+                payload_size = payload.len(),
+                "Event payload size exceeds warning threshold. pg_notify has 8KB limit."
+            );
+        }


NIT: nice warning threshold. Could we also log the hard-limit context in structured fields (payload_size, payload_limit=8192) and maybe emit a distinct event name for easier alerting/filtering?

86667 force-pushed the horiz_scalable_sse branch from 5e2807b to 252121b Compare February 19, 2026 09:49

86667 added 2 commits February 23, 2026 10:10

feat: Use Postgres NOTIFY events to achieve horizontally scaled Event…

b3006f3

… broadcasting

test: remove redundant test

991adca

86667 force-pushed the horiz_scalable_sse branch from 252121b to 8d20216 Compare February 23, 2026 10:39

refactor: some small cleaning

7ad8df7

86667 force-pushed the horiz_scalable_sse branch from 8d20216 to 7ad8df7 Compare February 23, 2026 10:43

86667 requested review from SHAcollision and dzdidi February 23, 2026 15:38

86667 linked an issue Feb 24, 2026 that may be closed by this pull request

SSE broadcast channel is not horizontally scalabe #285

Open

SHAcollision reviewed Feb 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Horizontally scalable SSE Events#303

feat: Horizontally scalable SSE Events#303
86667 wants to merge 3 commits intomainfrom
horiz_scalable_sse

86667 commented Feb 6, 2026 •

edited

Loading

Uh oh!

SHAcollision left a comment

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

SHAcollision Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

86667 commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

How it works

Uh oh!

SHAcollision left a comment

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

SHAcollision Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

86667 commented Feb 6, 2026 •

edited

Loading