[WIP] Crash resilience ProfOfConcept implementation on MS SQL #4

MaxShoshin · 2020-06-09T10:34:08Z

No description provided.

Allow to run integration tests locally

MaxShoshin · 2020-06-09T11:12:35Z

The base idea: after successfull publishing write information about range of published events (min version, max version, aggregateId).
Also it requires background process (in application, it is not implemented here) which call IPublishVerificator periodically to verify that committed to IEventPersistence events are successfully published. Log rows removed after successfull verification. To avoid compare all event store this process stores last verified GlobalPosition.

This implementation is EventPersistance agnostic.
Currently it uses MsSQL for log storage, but it can be easlily enhance for all SQL Like storages. Currently, I'm not sure about EntityFramework and Mongo storages...

This implementation is fully separate from existing code - so no breaking changes, etc.

MaxShoshin · 2020-06-09T14:02:37Z

@leotsarev , could you review and discuss?

Source/EventFlow.MsSql/ReliablePublish/MsSqlPublishVerificator.cs

Source/EventFlow/ReadStores/ReadModelEventHelper.cs

Source/EventFlow/Subscribers/DomainEventPublisher.cs

Source/EventFlow/Subscribers/DispatchToEventSubscribers.cs

leotsarev · 2020-06-17T14:09:58Z

Source/EventFlow/PublishRecovery/IReadModelRecoveryHandler.cs

+using EventFlow.Aggregates;
+using EventFlow.ReadStores;
+
+namespace EventFlow.PublishRecovery


I think it could be WAY easier if we just can resolve IRecoveryHandler<TReadModel> from resolver

leotsarev · 2020-06-17T16:29:24Z

Source/EventFlow.MsSql/ReliablePublish/MsSqlReliablePublishPersistence.cs

+
+            await _msSqlConnection.ExecuteAsync(
+                    Label.Named("publishlog-commit"),
+                    CancellationToken.None, // Unable to Cancel


Why we could not cancel here? If some callers do not have cancellation token, they could pass None

Why you should want to cancel writing info about successfully published to side effects?

Same question goes for MarkVerifiedAsync (and basically every update), isn't it? Let's be consistent
I think "should I be allowed to cancel" is not for persistence to decide....

leotsarev · 2020-06-18T18:18:56Z

Source/EventFlow/PublishRecovery/IReadModelRecoveryHandler.cs

+{
+    public interface IReadModelRecoveryHandler
+    {
+        Task RecoverFromShutdownAsync(


Why we are inconsistent in signatures? I think it's ok to just have "if you throw, you failed to recover". I don't like API that's tell us "we could be fallible", because we want user to try to get infallible API ;-)

leotsarev · 2020-06-18T18:21:54Z

Source/EventFlow/PublishRecovery/IReadModelRecoveryHandler.cs

+            IReadOnlyCollection<IDomainEvent> eventsForRecovery,
+            CancellationToken cancellationToken);
+
+        Task<bool> RecoverFromErrorAsync(


Also, I'm highly in doubt that we really need to distinct methods. May be parameter like "RecoveryReason" will fit better, cause usually people will like to implemement both methods they same.

leotsarev · 2020-06-18T18:23:10Z

Source/EventFlow/PublishRecovery/PublishVerificator.cs

+
+namespace EventFlow.PublishRecovery
+{
+    public sealed class PublishVerificator : IPublishVerificator


I don't reviewed this file cause I hope to do it later in separate review

leotsarev · 2020-06-18T18:27:55Z

Source/EventFlow/PublishRecovery/IReliableMarkProcessor.cs

+{
+    public interface IReliableMarkProcessor
+    {
+        Task MarkEventsPublishedAsync(IReadOnlyCollection<IDomainEvent> domainEvents);


I don't think we need this abstraction actually now.
We now have only two reasons left for that:

To inject a NOP implementation (it could be easily done by injecting NOP persistence)

To group domain events by aggregate (see my persistence comment)

leotsarev · 2020-06-18T18:30:08Z

Source/EventFlow.MsSql/ReliablePublish/MsSqlReliablePublishPersistence.cs

+            _msSqlConnection = msSqlConnection;
+        }
+
+        public async Task MarkPublishedAsync(IIdentity aggregateIdentity, IReadOnlyCollection<IDomainEvent> domainEvents)


I think that by taking aggregateIdentity parameter you are leaking your implementation into interface.
This could be determined from domainEvents and some other implementations could be fine with marking events from different aggregateIdentity in one roundtrip.

However, I don't think it will be easy to completely remove this from interface.

It's some kind of 'guarantee' that all domain events should related with one aggregate...

Why we need this kind of guarantee?

leotsarev · 2020-06-18T18:30:58Z

Source/EventFlow.MsSql/ReliablePublish/MsSqlReliablePublishPersistence.cs

+            var item = new PublishLogItem
+            {
+                AggregateId = aggregateIdentity.Value,
+                MinAggregateSequenceNumber = domainEvents.Min(x => x.AggregateSequenceNumber),


We got an assumption here that events are sequential. I thinks thats fine, but probably we should throw if assumption fails

leotsarev · 2020-06-18T18:38:51Z

Source/EventFlow.MsSql/ReliablePublish/MsSqlReliablePublishPersistence.cs

+                logItems);
+        }
+
+        public async Task MarkVerifiedAsync(


Same here — we are leaking implementation detail of having EventFlowPublishVerifyState into interface.
May be could just call MarkPublishedAsync from verificator and have some kind of Compact method which is implementation detail and could be MS SQL specific.

leotsarev · 2020-06-18T18:40:18Z

Source/EventFlow.MsSql/ReliablePublish/MsSqlReliablePublishPersistence.cs

+            GlobalPosition newVerifiedPosition,
+            CancellationToken cancellationToken)
+        {
+            await _msSqlConnection.ExecuteAsync(


I'm started to miss why no transaction here is okay. May be we need to
WHERE LastVerifiedPosition <@LastVerifiedPosition to be sure

leotsarev · 2020-06-18T18:43:01Z

Source/EventFlow/PublishRecovery/IReliablePublishPersistence.cs

+
+namespace EventFlow.PublishRecovery
+{
+    public interface IReliablePublishPersistence


See comments in implementation about this interface.

leotsarev · 2020-06-18T18:44:50Z

Source/EventFlow/PublishRecovery/IReliablePublishPersistence.cs

+{
+    public interface IReliablePublishPersistence
+    {
+        Task MarkPublishedAsync(IIdentity aggregateIdentity, IReadOnlyCollection<IDomainEvent> domainEvents);


Also, we probably need to method to determine verification watermark for given aggregate for preventing publishing events out-of-order for aggregates. And yes, that's here where separate persistence for publish mark will add +1 read per command.

I don't think we have to implement prevent events out of order right away (I'm rather not), but if we gonna publish separate PRs we need to include this in interface from first shot. It's thing for interfaces that you won't have 2nd chance to add another method.

MaxShoshin and others added 2 commits May 29, 2020 14:50

Merge pull request #1 from FortisOnline/local-integration-tests

7ea54c0

Allow to run integration tests locally

[WIP] Crash resilience ProfOfConcept implementation on MS SQL

b7b7a80

MaxShoshin marked this pull request as ready for review June 9, 2020 10:34

MaxShoshin requested a review from leotsarev June 9, 2020 14:02

MaxShoshin commented Jun 9, 2020

View reviewed changes

Source/EventFlow.MsSql/ReliablePublish/MsSqlPublishVerificator.cs Outdated Show resolved Hide resolved

Maxim Shoshin added 3 commits June 10, 2020 12:51

Select part of publish log outside of verification transaction

1c0c9ef

Extract ReliablePublishPersistence

e9fc2ed

Introducing IRecoveryHandlers

f0a36a2

leotsarev reviewed Jun 10, 2020

View reviewed changes

Source/EventFlow/ReadStores/ReadModelEventHelper.cs Outdated Show resolved Hide resolved

Create explicit method ReadModelEventHelper.CheckReadModel

5b58b52

MaxShoshin force-pushed the crash-resilience-poc branch from 6b201d5 to 5b58b52 Compare June 11, 2020 07:40

Introduce RecovertyHandlers

98c882b

leotsarev reviewed Jun 17, 2020

View reviewed changes

MaxShoshin force-pushed the develop branch from 5305406 to d6a0d1f Compare June 18, 2020 09:07

Maxim Shoshin added 2 commits June 18, 2020 17:09

Redesign to use IReadModelRecoveryHandler<TReadModel>

24e4e33

Revert unnecessary changes

8bb53e2

leotsarev approved these changes Jun 18, 2020

View reviewed changes

leotsarev mentioned this pull request Jun 24, 2020

Crash resilience eventflow/EventFlow#439

Closed

Small refactoring

db6129b

[WIP] Crash resilience ProfOfConcept implementation on MS SQL #4

Are you sure you want to change the base?

[WIP] Crash resilience ProfOfConcept implementation on MS SQL #4

Uh oh!

Conversation

MaxShoshin commented Jun 9, 2020

Uh oh!

MaxShoshin commented Jun 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaxShoshin commented Jun 9, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leotsarev Jun 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MaxShoshin commented Jun 9, 2020 •

edited

Loading

leotsarev Jun 18, 2020 •

edited

Loading