After the issue with false-positive alarms, Vladi found , that in our log entries it's hard to find any entries, why MP raised an alert.
See example:
Here only 1 log record only about component.
What we should know from the logs:
- What metric was failed?
- What other parameters for the failed component.
- All possible calculation for it.