-
Notifications
You must be signed in to change notification settings - Fork 6
Description
Hi @nook24
This morning I encountered an issue when I was trying to put some servers in maintenance. The job was submitted but the maintenance never started. Upon investigation further, I found that the statusengine service reported the error couple of hours back.
Dec 20 03:28:36 broker1 StatusengineWorker.php[5006]: statusengine-worker[26979]: [2006] MySQL server has gone away
Dec 20 03:28:36 broker1 statusengine-worker[26979]: [2006] MySQL server has gone away
It was when I restarted the statusengine service that it regained the connection back. Below are the logs from restart (just FYI)
Dec 20 05:14:11 broker1 systemd[1]: statusengine.service: State 'stop-sigterm' timed out. Killing.
Dec 20 05:14:11 broker1 systemd[1]: statusengine.service: Killing process 5006 (StatusengineWor) with signal SIGKILL.
Dec 20 05:14:11 broker1 systemd[1]: statusengine.service: Main process exited, code=killed, status=9/KILL
Dec 20 05:14:11 broker1 systemd[1]: statusengine.service: Failed with result 'timeout'.
Now I do see a similar entry which seems to have resolved on its own yesterday. Logs are below:
Dec 19 13:04:08 broker1 StatusengineWorker.php[5006]: statusengine-worker[5045]: [2006] MySQL server has gone away
Dec 19 13:04:08 broker1 statusengine-worker[5045]: [2006] MySQL server has gone away
Dec 19 13:20:37 broker1 StatusengineWorker.php[5006]: statusengine-worker[5006]: SQLSTATE[HY000] [2002] Connection timed out
Dec 19 13:20:37 broker1 StatusengineWorker.php[5006]: statusengine-worker[5006]: SQLSTATE[HY000] [2002] Connection timed out
Dec 19 13:20:37 broker1 statusengine-worker[5006]: SQLSTATE[HY000] [2002] Connection timed out
Dec 19 13:20:37 broker1 statusengine-worker[5006]: SQLSTATE[HY000] [2002] Connection timed out
Dec 19 14:20:44 broker1 statusengine-worker[26984]: [1213] Deadlock found when trying to get lock; try restarting transaction
Dec 19 14:20:44 broker1 StatusengineWorker.php[5006]: statusengine-worker[26984]: [1213] Deadlock found when trying to get lock; try restarting transaction
Dec 19 15:39:33 broker1 StatusengineWorker.php[5006]: statusengine-worker[5006]: Execute external command (via Query Handler): [1545233967] SCHEDULE_HOST_DOWNTIME;HOST1;1545233966;1702913966;1;0;157680000;user1;“NOT RELEASED TO OPERATIONS”
My questions:
- Is there a recovery mechanism for such a scenario? If yes, could you shed some light?
- I can place some kind of external monitoring to see if the service state and restart if Mysql error is seen, but again, is this something that you as a potential issue?
My MySQL is actually an Azure DB within the same Resource group and network and everything. I'll try to find out why these connections are failing (feel free if you have any ideas around this but this is ofcourse out of your scope to fix). Ref art: https://blogs.msdn.microsoft.com/azuresqldbsupport/2018/11/20/azure-database-for-mysql-server-has-gone-away/