Avoid endless restart loop

Right now, if the Limbo process crashes, ECS is configured to restart it.  If the crash is deterministic -- for example, if there are bad credentials or bad code was pushed -- ECS will restart it indefinitely, which causes a ton of log streams to be created in CloudWatch logs (among other problems).  I've poked around the ECS documentation and don't see any configuration setting that tells it to stop restarting after too many tries.  We could probably write a Lambda function that reads the event stream coming off of ECS and looks for this (although would need state for doing this).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid endless restart loop #49

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Avoid endless restart loop #49

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions