[00:39:12] !log [void@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php rebuildtextindex --wiki=pocketdragonswiki (START) [00:39:16] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [00:39:21] !log [void@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php rebuildtextindex --wiki=pocketdragonswiki (END - exit=0) [00:39:25] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [00:42:25] [Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [00:45:35] !log [void@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php cleanupTitles --wiki=pocketdragonswiki (END - exit=0) [00:45:40] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [00:57:16] !log [void@mwtask181] applied DataDump/sql/patches/patch-dumps_status.sql to lgballtwiki [00:57:20] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [01:03:22] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.08, 21.75, 17.45 [01:07:15] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.15, 22.71, 18.74 [01:09:12] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 14.20, 19.94, 18.22 [01:14:29] RECOVERY - wiki.mahdiruiz.line.pm - reverse DNS on sslhost is OK: SSL OK - wiki.mahdiruiz.line.pm reverse DNS resolves to cp37.wikitide.net - CNAME OK [01:47:25] [Grafana] !tech RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [02:49:52] PROBLEM - wiki.mahdiruiz.line.pm - reverse DNS on sslhost is WARNING: NoNameservers: All nameservers failed to answer the query line.pm. IN NS: Server 2606:4700:4700::1111 UDP port 53 answered The DNS operation timed out.; Server 2606:4700:4700::1111 UDP port 53 answered SERVFAIL [03:00:14] RECOVERY - db151 Backups SQL on db151 is OK: FILE_AGE OK: /var/log/sql-backup.log is 12 seconds old and 0 bytes [03:19:53] RECOVERY - wiki.mahdiruiz.line.pm - reverse DNS on sslhost is OK: SSL OK - wiki.mahdiruiz.line.pm reverse DNS resolves to cp36.wikitide.net - CNAME OK [03:25:19] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.78, 20.10, 15.01 [03:29:12] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.31, 22.74, 17.28 [03:33:05] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.67, 23.93, 18.85 [03:36:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 29.15, 22.45, 15.52 [03:38:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 23.42, 23.49, 16.80 [03:41:39] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.75, 22.80, 17.81 [03:42:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 13.46, 18.00, 16.08 [03:45:28] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 19.07, 22.87, 19.09 [03:47:22] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.58, 24.76, 20.22 [03:50:40] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.23, 20.35, 16.54 [03:52:03] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.63, 22.56, 18.59 [03:55:08] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.76, 23.46, 18.97 [03:57:24] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 29.12, 23.39, 19.26 [03:57:55] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.15, 22.60, 19.66 [03:58:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:59:04] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 18.86, 22.24, 19.64 [03:59:58] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 29.39, 23.57, 18.00 [04:00:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.71, 23.64, 19.37 [04:01:02] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.29, 24.27, 20.70 [04:01:49] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 23.05, 23.75, 20.91 [04:03:00] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.13, 23.35, 20.85 [04:03:46] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.99, 25.42, 21.87 [04:06:55] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 26.64, 24.20, 21.62 [04:08:53] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.08, 22.81, 21.43 [04:08:56] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 15.88, 21.94, 21.55 [04:10:51] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.87, 24.86, 22.34 [04:10:51] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.12, 23.27, 22.04 [04:11:37] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.59, 23.28, 21.51 [04:13:34] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 26.46, 24.00, 21.94 [04:24:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.95, 22.81, 23.63 [04:24:36] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 16.86, 21.64, 22.93 [04:24:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 13.20, 20.03, 22.73 [04:26:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 28.99, 25.19, 24.41 [04:27:10] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.44, 23.93, 23.32 [04:29:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 11.72, 20.15, 23.18 [04:30:30] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 8.17, 15.61, 20.10 [04:30:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 7.90, 19.05, 22.42 [04:30:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 7.32, 15.17, 20.17 [04:32:23] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 5.75, 15.13, 21.88 [04:33:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 5.33, 12.93, 18.88 [04:33:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:33:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 5.65, 12.19, 19.20 [04:34:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 6.84, 13.98, 21.91 [04:34:22] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 7.12, 12.41, 20.06 [04:34:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 6.60, 14.43, 22.77 [04:34:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 5.71, 11.89, 18.71 [04:36:12] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 6.08, 11.21, 19.92 [04:38:27] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 8.47, 10.82, 19.37 [06:22:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.55, 18.56, 12.89 [06:24:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.69, 20.28, 14.27 [06:28:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.23, 21.40, 16.02 [06:30:27] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.81, 20.24, 16.24 [06:34:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 29.59, 24.82, 18.96 [06:38:54] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.17, 19.25, 14.77 [06:40:48] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 17.78, 19.65, 15.50 [06:46:31] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.62, 21.89, 17.49 [06:48:25] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 15.88, 19.22, 17.01 [06:53:22] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.17, 20.97, 16.99 [06:55:21] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.83, 20.78, 17.44 [06:56:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 29.72, 25.61, 20.72 [06:57:21] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.74, 21.35, 17.98 [06:58:31] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 29.06, 23.40, 17.76 [06:59:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.36, 22.67, 17.70 [07:00:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:01:11] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.67, 22.35, 16.64 [07:01:19] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.80, 23.88, 19.88 [07:02:53] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.45, 21.85, 18.36 [07:03:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.01, 21.54, 17.07 [07:03:18] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.24, 24.56, 20.58 [07:05:23] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.95, 21.51, 17.76 [07:06:43] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 22.95, 23.01, 19.65 [07:07:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.26, 23.57, 18.77 [07:07:20] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 16.65, 19.32, 17.38 [07:07:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.40, 23.66, 20.44 [07:07:59] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 15.83, 22.59, 20.74 [07:08:39] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 28.25, 24.96, 20.75 [07:11:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.84, 22.30, 19.49 [07:11:11] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 29.68, 24.92, 20.11 [07:11:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 25.99, 24.43, 21.39 [07:13:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 24.89, 23.89, 20.44 [07:15:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.29, 22.75, 20.46 [07:15:34] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.87, 23.82, 21.76 [07:19:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.31, 24.23, 21.42 [07:33:39] PROBLEM - phorge171 issue-tracker.miraheze.org HTTPS on phorge171 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 328 bytes in 0.013 second response time [07:34:15] PROBLEM - phorge171 phorge-static.wikitide.net HTTPS on phorge171 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/1.1 502 Bad Gateway [07:35:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.36, 23.68, 23.82 [07:35:07] PROBLEM - phorge171 php-fpm on phorge171 is CRITICAL: PROCS CRITICAL: 0 processes with command name 'php-fpm8.2' [07:38:15] RECOVERY - phorge171 phorge-static.wikitide.net HTTPS on phorge171 is OK: HTTP OK: Status line output matched "HTTP/1.1 200" - 17717 bytes in 0.044 second response time [07:39:07] RECOVERY - phorge171 php-fpm on phorge171 is OK: PROCS OK: 9 processes with command name 'php-fpm8.2' [07:39:39] RECOVERY - phorge171 issue-tracker.miraheze.org HTTPS on phorge171 is OK: HTTP OK: HTTP/1.1 200 OK - 19090 bytes in 0.061 second response time [07:39:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 18.68, 21.07, 23.58 [07:40:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 17.16, 21.24, 23.69 [07:44:24] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 13.57, 18.33, 23.02 [07:45:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 10.85, 14.39, 19.24 [07:46:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 11.41, 17.37, 22.71 [07:46:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 12.10, 14.90, 20.01 [07:47:00] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 12.93, 18.44, 23.15 [07:47:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 8.56, 13.70, 19.16 [07:48:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 9.71, 15.98, 22.52 [07:48:18] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 9.51, 14.17, 20.34 [07:50:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [07:50:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 9.62, 14.26, 20.30 [07:50:59] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 10.85, 14.22, 20.36 [07:52:12] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 9.20, 12.47, 19.62 [07:56:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 16.19, 18.82, 23.99 [08:10:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 23.54, 19.59, 18.02 [08:14:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 19.20, 18.97, 18.07 [08:14:49] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.66, 20.14, 19.05 [08:16:42] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 19.17, 19.67, 19.01 [08:18:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.56, 21.88, 21.67 [08:20:31] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.88, 22.50, 20.31 [08:22:25] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.28, 23.68, 21.03 [08:22:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.53, 23.55, 22.54 [08:25:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 21.12, 19.63, 15.96 [08:25:14] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 19.17, 21.59, 19.84 [08:25:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [08:27:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 13.83, 17.87, 15.79 [08:27:10] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.93, 23.37, 20.65 [08:28:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.42, 24.37, 23.08 [08:32:40] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 26.63, 21.69, 18.60 [08:33:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.12, 20.96, 17.44 [08:35:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.87, 21.40, 18.03 [08:37:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 19.51, 20.33, 18.03 [08:41:40] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.60, 23.01, 19.77 [08:41:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.07, 22.83, 19.73 [08:43:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.06, 20.38, 17.90 [08:43:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.20, 22.62, 20.04 [08:45:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.67, 24.71, 21.10 [08:46:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.92, 21.26, 18.09 [08:47:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 15.57, 19.98, 18.43 [08:47:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 24.01, 21.35, 19.86 [08:47:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.50, 23.13, 20.93 [08:48:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.33, 20.86, 18.33 [08:49:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 15.35, 19.94, 20.01 [08:50:32] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.84, 22.78, 22.59 [08:50:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 17.29, 19.51, 18.14 [08:51:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 22.78, 22.17, 19.68 [08:52:31] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.14, 24.62, 23.29 [08:53:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 15.55, 22.42, 21.29 [08:53:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 29.25, 24.84, 22.00 [08:55:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 25.95, 23.25, 20.63 [08:55:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.01, 22.97, 21.57 [08:55:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.38, 23.32, 21.82 [08:57:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 17.97, 21.60, 20.38 [08:57:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 21.64, 22.07, 21.41 [08:58:29] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.96, 23.11, 23.28 [08:59:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 13.05, 18.55, 19.41 [08:59:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 26.26, 24.04, 22.23 [09:01:28] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.09, 21.74, 19.24 [09:01:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 12.92, 20.13, 21.04 [09:03:21] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.16, 21.55, 19.50 [09:03:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 13.68, 18.15, 20.21 [09:03:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 13.21, 18.10, 20.00 [09:05:15] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 16.16, 19.73, 19.09 [09:08:25] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 14.33, 17.08, 20.15 [09:09:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.51, 19.74, 19.93 [09:11:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 17.96, 19.29, 19.76 [09:12:22] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.44, 19.72, 20.61 [09:14:21] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 13.43, 17.51, 19.70 [09:14:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 17.67, 22.29, 23.91 [09:14:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.58, 20.92, 19.40 [09:16:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 11.63, 17.34, 18.27 [09:18:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 11.52, 19.38, 23.95 [09:20:05] PROBLEM - test151 conntrack_table_size on test151 is CRITICAL: connect to address 10.0.15.118 port 5666: No route to hostconnect to host 10.0.15.118 port 5666: No route to host [09:20:06] PROBLEM - mem151 Puppet on mem151 is CRITICAL: connect to address 10.0.15.113 port 5666: No route to hostconnect to host 10.0.15.113 port 5666: No route to host [09:20:06] PROBLEM - mem151 NTP time on mem151 is CRITICAL: connect to address 10.0.15.113 port 5666: No route to hostconnect to host 10.0.15.113 port 5666: No route to host [09:20:07] PROBLEM - graphite151 Puppet on graphite151 is CRITICAL: connect to address 10.0.15.145 port 5666: No route to hostconnect to host 10.0.15.145 port 5666: No route to host [09:20:08] PROBLEM - mw182 MediaWiki Rendering on mw182 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 5627 bytes in 0.013 second response time [09:20:09] PROBLEM - mw181 HTTPS on mw181 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 502 [09:20:10] PROBLEM - cp41 HTTPS on cp41 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 503 [09:20:12] PROBLEM - mw161 MediaWiki Rendering on mw161 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 5627 bytes in 0.012 second response time [09:20:13] PROBLEM - changeprop151 SSH on changeprop151 is CRITICAL: connect to address 10.0.15.148 and port 22: No route to host [09:20:16] PROBLEM - ping on changeprop151 is CRITICAL: CRITICAL - Host Unreachable (10.0.15.148) [09:20:19] PROBLEM - cp36 HTTP 4xx/5xx ERROR Rate on cp36 is CRITICAL: CRITICAL - NGINX Error Rate is 77% [09:20:19] PROBLEM - swiftobject151 SSH on swiftobject151 is CRITICAL: connect to address 10.0.15.117 and port 22: No route to host [09:20:20] PROBLEM - ping on mw152 is CRITICAL: CRITICAL - Host Unreachable (10.0.15.115) [09:20:22] PROBLEM - ping on matomo151 is CRITICAL: CRITICAL - Host Unreachable (10.0.15.112) [09:20:23] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is WARNING: WARNING - NGINX Error Rate is 59% [09:20:27] PROBLEM - test151 SSH on test151 is CRITICAL: connect to address 10.0.15.118 and port 22: No route to host [09:20:28] PROBLEM - ping on swiftobject151 is CRITICAL: CRITICAL - Host Unreachable (10.0.15.117) [09:20:29] PROBLEM - Host mw152 is DOWN: CRITICAL - Host Unreachable (10.0.15.115) [09:20:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [09:20:30] PROBLEM - Host changeprop151 is DOWN: CRITICAL - Host Unreachable (10.0.15.148) [09:20:38] PROBLEM - matomo151 HTTPS on matomo151 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: cURL returned 7 - Failed to connect to matomo151.wikitide.net port 443 after 3078 ms: Couldn't connect to server [09:20:42] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 3.60, 15.90, 23.07 [09:20:43] PROBLEM - os151 NTP time on os151 is CRITICAL: connect to address 10.0.15.111 port 5666: No route to hostconnect to host 10.0.15.111 port 5666: No route to host [09:20:46] PROBLEM - os151 PowerDNS Recursor on os151 is CRITICAL: connect to address 10.0.15.111 port 5666: No route to hostconnect to host 10.0.15.111 port 5666: No route to host [09:20:46] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 1.84, 11.18, 18.80 [09:20:47] PROBLEM - Host mw151 is DOWN: CRITICAL - Host Unreachable (10.0.15.114) [09:20:49] PROBLEM - db151 Backups SQL on db151 is CRITICAL: connect to address 10.0.15.110 port 5666: No route to hostconnect to host 10.0.15.110 port 5666: No route to host [09:20:49] PROBLEM - db151 PowerDNS Recursor on db151 is CRITICAL: connect to address 10.0.15.110 port 5666: No route to hostconnect to host 10.0.15.110 port 5666: No route to host [09:20:49] PROBLEM - os151 Current Load on os151 is CRITICAL: connect to address 10.0.15.111 port 5666: No route to hostconnect to host 10.0.15.111 port 5666: No route to host [09:20:49] PROBLEM - os151 Puppet on os151 is CRITICAL: connect to address 10.0.15.111 port 5666: No route to hostconnect to host 10.0.15.111 port 5666: No route to host [09:20:50] PROBLEM - Host graphite151 is DOWN: CRITICAL - Host Unreachable (10.0.15.145) [09:20:51] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: connect to address 10.0.15.116 port 5666: No route to hostconnect to host 10.0.15.116 port 5666: No route to host [09:20:51] PROBLEM - rdb151 NTP time on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:20:51] PROBLEM - rdb151 Redis Process on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:20:52] PROBLEM - db151 Backups SQL mhglobal on db151 is CRITICAL: connect to address 10.0.15.110 port 5666: No route to hostconnect to host 10.0.15.110 port 5666: No route to host [09:20:52] PROBLEM - mem151 ferm_active on mem151 is CRITICAL: connect to address 10.0.15.113 port 5666: No route to hostconnect to host 10.0.15.113 port 5666: No route to host [09:20:52] PROBLEM - mem151 Disk Space on mem151 is CRITICAL: connect to address 10.0.15.113 port 5666: No route to hostconnect to host 10.0.15.113 port 5666: No route to host [09:20:54] PROBLEM - cp37 Varnish Backends on cp37 is CRITICAL: 9 backends are down. mw151 mw152 mw161 mw162 mw171 mw172 mw181 mw182 mediawiki [09:20:54] PROBLEM - rdb151 Disk Space on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:20:54] PROBLEM - rdb151 conntrack_table_size on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:20:54] PROBLEM - rdb151 poolcounter process on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:21:06] PROBLEM - prometheus151 ferm_active on prometheus151 is CRITICAL: connect to address 10.0.15.116 port 5666: No route to hostconnect to host 10.0.15.116 port 5666: No route to host [09:21:06] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: connect to address 10.0.15.116 port 5666: No route to hostconnect to host 10.0.15.116 port 5666: No route to host [09:21:07] PROBLEM - prometheus151 Puppet on prometheus151 is CRITICAL: connect to address 10.0.15.116 port 5666: No route to hostconnect to host 10.0.15.116 port 5666: No route to host [09:21:07] PROBLEM - test151 Disk Space on test151 is CRITICAL: connect to address 10.0.15.118 port 5666: No route to hostconnect to host 10.0.15.118 port 5666: No route to host [09:21:08] PROBLEM - swiftobject151 Puppet on swiftobject151 is CRITICAL: connect to address 10.0.15.117 port 5666: No route to hostconnect to host 10.0.15.117 port 5666: No route to host [09:21:09] PROBLEM - cp26 HTTPS on cp26 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 503 [09:21:09] PROBLEM - mem151 SSH on mem151 is CRITICAL: connect to address 10.0.15.113 and port 22: No route to host [09:21:09] PROBLEM - ping on mem151 is CRITICAL: CRITICAL - Host Unreachable (10.0.15.113) [09:21:10] PROBLEM - mw182 HTTPS on mw182 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 502 [09:21:10] PROBLEM - cp26 Varnish Backends on cp26 is CRITICAL: 9 backends are down. mw151 mw152 mw161 mw162 mw171 mw172 mw181 mw182 mediawiki [09:21:11] PROBLEM - swiftobject151 conntrack_table_size on swiftobject151 is CRITICAL: connect to address 10.0.15.117 port 5666: No route to hostconnect to host 10.0.15.117 port 5666: No route to host [09:21:11] PROBLEM - cp37 HTTP 4xx/5xx ERROR Rate on cp37 is CRITICAL: CRITICAL - NGINX Error Rate is 78% [09:21:12] PROBLEM - test151 PowerDNS Recursor on test151 is CRITICAL: connect to address 10.0.15.118 port 5666: No route to hostconnect to host 10.0.15.118 port 5666: No route to host [09:21:13] PROBLEM - mw161 HTTPS on mw161 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 502 [09:21:15] PROBLEM - db151 Puppet on db151 is CRITICAL: connect to address 10.0.15.110 port 5666: No route to hostconnect to host 10.0.15.110 port 5666: No route to host [09:21:15] PROBLEM - db151 SSH on db151 is CRITICAL: connect to address 10.0.15.110 and port 22: No route to host [09:21:15] PROBLEM - mw171 MediaWiki Rendering on mw171 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 5627 bytes in 0.013 second response time [09:21:16] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: connect to address 10.0.15.116 and port 22: No route to host [09:21:17] PROBLEM - rdb151 Current Load on rdb151 is CRITICAL: connect to address 10.0.15.142 port 5666: No route to hostconnect to host 10.0.15.142 port 5666: No route to host [09:21:17] PROBLEM - matomo151 PowerDNS Recursor on matomo151 is CRITICAL: connect to address 10.0.15.112 port 5666: No route to hostconnect to host 10.0.15.112 port 5666: No route to host [09:21:17] PROBLEM - matomo151 Redis Process on matomo151 is CRITICAL: connect to address 10.0.15.112 port 5666: No route to hostconnect to host 10.0.15.112 port 5666: No route to host [09:21:29] PROBLEM - prometheus151 conntrack_table_size on prometheus151 is CRITICAL: connect to address 10.0.15.116 port 5666: No route to hostconnect to host 10.0.15.116 port 5666: No route to host [09:21:30] PROBLEM - cloud15 SSH on cloud15 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [09:21:37] PROBLEM - cp36 HTTPS on cp36 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 503 [09:21:39] PROBLEM - matomo151 SSH on matomo151 is CRITICAL: connect to address 10.0.15.112 and port 22: No route to host [09:21:40] PROBLEM - matomo151 php-fpm on matomo151 is CRITICAL: connect to address 10.0.15.112 port 5666: No route to hostconnect to host 10.0.15.112 port 5666: No route to host [09:21:40] PROBLEM - Host matomo151 is DOWN: CRITICAL - Host Unreachable (10.0.15.112) [09:21:41] PROBLEM - db151 NTP time on db151 is CRITICAL: connect to address 10.0.15.110 port 5666: No route to hostconnect to host 10.0.15.110 port 5666: No route to host [09:21:41] PROBLEM - db151 MariaDB Connections on db151 is UNKNOWN: PHP Fatal error: Uncaught mysqli_sql_exception: Connection timed out in /usr/lib/nagios/plugins/check_mysql_connections.php:66Stack trace:#0 /usr/lib/nagios/plugins/check_mysql_connections.php(66): mysqli_real_connect(Object(mysqli), 'db151.wikitide....', 'icinga', Object(SensitiveParameterValue), NULL, NULL, NULL, true)#1 {main} thrown in /usr/lib/nagios/plugins/check_mysql_conne [09:21:41] on line 66Fatal error: Uncaught mysqli_sql_exception: Connection timed out in /usr/lib/nagios/plugins/check_mysql_connections.php:66Stack trace:#0 /usr/lib/nagios/plugins/check_mysql_connections.php(66): mysqli_real_connect(Object(mysqli), 'db151.wikitide....', 'icinga', Object(SensitiveParameterValue), NULL, NULL, NULL, true)#1 {main} thrown in /usr/lib/nagios/plugins/check_mysql_connections.php on line 66 [09:21:42] PROBLEM - Host prometheus151 is DOWN: CRITICAL - Host Unreachable (10.0.15.116) [09:21:43] PROBLEM - cp41 Varnish Backends on cp41 is CRITICAL: 9 backends are down. mw151 mw152 mw161 mw162 mw171 mw172 mw181 mw182 mediawiki [09:21:46] PROBLEM - cp36 Varnish Backends on cp36 is CRITICAL: 9 backends are down. mw151 mw152 mw161 mw162 mw171 mw172 mw181 mw182 mediawiki [09:21:47] PROBLEM - mw162 MediaWiki Rendering on mw162 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [09:21:49] PROBLEM - Host db151 is DOWN: CRITICAL - Host Unreachable (10.0.15.110) [09:21:50] PROBLEM - cp37 HTTPS on cp37 is CRITICAL: HTTP CRITICAL - Invalid HTTP response received from host on port 443: HTTP/2 503 [09:21:50] PROBLEM - mw181 MediaWiki Rendering on mw181 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 5627 bytes in 0.017 second response time [09:21:50] PROBLEM - ping6 on cloud15 is CRITICAL: PING CRITICAL - Packet loss = 100% [09:21:56] PROBLEM - Host cloud15 is DOWN: PING CRITICAL - Packet loss = 100% [09:21:57] PROBLEM - cp27 Varnish Backends on cp27 is CRITICAL: 9 backends are down. mw151 mw152 mw161 mw162 mw171 mw172 mw181 mw182 mediawiki [09:22:12] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 1.44, 9.43, 18.81 [09:24:15] Is it just me, or is Miraheze 503/502-ing? meta.miraheze.org and rainverse.miraheze.org gives me 502, rainverse.wiki gives me 503 (Backend fetch failed via cp51.wikitide.net at Sun, 26 May 2024 09:22:31 GMT; Varnish XID 6265578, serving 2403:4800:3202:de23:f64a:875a:5b18:d5de, 127.0.0.1 (your IP!)) [09:24:35] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 1.48, 8.05, 18.24 [09:29:06] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 43% [09:30:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is CRITICAL: CRITICAL - NGINX Error Rate is 60% [09:30:42] PROBLEM - os161 Puppet on os161 is CRITICAL: CRITICAL: Puppet has 1 failures. Last run 2 minutes ago with 1 failures. Failed resources (up to 3 shown): Opensearch_template[graylog-internal] [09:32:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is WARNING: WARNING - NGINX Error Rate is 58% [09:34:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is CRITICAL: CRITICAL - NGINX Error Rate is 64% [09:35:06] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 38% [09:35:57] RECOVERY - Host cloud15 is UP: PING OK - Packet loss = 0%, RTA = 0.39 ms [09:36:05] RECOVERY - cloud15 SSH on cloud15 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [09:36:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is WARNING: WARNING - NGINX Error Rate is 49% [09:36:41] RECOVERY - ping6 on cloud15 is OK: PING OK - Packet loss = 0%, RTA = 0.19 ms [09:42:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is CRITICAL: CRITICAL - NGINX Error Rate is 72% [09:55:09] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 54% [09:55:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [09:59:09] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 39% [09:59:59] !log depool mw151/152 in cf from * [10:03:13] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 43% [10:05:55] [02mw-config] 07redbluegreenhat opened pull request 03#5574: mark c1 as down - 13https://github.com/miraheze/mw-config/pull/5574 [10:06:56] [02mw-config] 07redbluegreenhat closed pull request 03#5574: mark c1 as down - 13https://github.com/miraheze/mw-config/pull/5574 [10:06:57] !log [@mwtask171] starting deploy of {'config': True} to all [10:06:58] [02mw-config] 07redbluegreenhat pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mw-config/compare/37beaa78521e...eb3356bd3b19 [10:07:00] [02mw-config] 07redbluegreenhat 03eb3356b - mark c1 as down (#5574) [10:07:05] miraheze/mw-config - redbluegreenhat the build passed. [10:07:36] !log [alex@mwtask181] starting deploy of {'pull': 'config', 'config': True} to all [10:07:57] miraheze/mw-config - redbluegreenhat the build passed. [10:07:57] !log [alex@mwtask181] DEPLOY ABORTED: Canary check failed for publictestwiki.com@mw151.wikitide.net [10:08:42] !log [alex@mwtask181] starting deploy of {'pull': 'config', 'config': True, 'force': True} to all [10:09:19] !log [@mwtask171] DEPLOY ABORTED: Canary check failed for publictestwiki.com@mw151.wikitide.net [10:11:30] !log [alex@mwtask181] finished deploy of {'pull': 'config', 'config': True, 'force': True} to all - SUCCESS in 168s [10:21:15] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 39% [10:25:16] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 44% [10:29:17] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 35% [10:35:18] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 46% [10:36:33] !log restarted nginx on mw161 [10:37:02] oh, right, logging doesn't work [10:37:59] !log restarted php8.2-fpm on mw161 [10:39:18] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 37% [10:45:19] PROBLEM - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is WARNING: WARNING - NGINX Error Rate is 49% [10:53:28] RECOVERY - Host test151 is UP: PING OK - Packet loss = 0%, RTA = 0.23 ms [10:53:33] RECOVERY - test151 Redis Process on test151 is OK: PROCS OK: 1 process with args 'redis-server' [10:53:34] RECOVERY - test151 PowerDNS Recursor on test151 is OK: DNS OK: 0.038 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:53:55] RECOVERY - test151 conntrack_table_size on test151 is OK: OK: nf_conntrack is 0 % full [10:53:56] PROBLEM - test151 NTP time on test151 is CRITICAL: NTP CRITICAL: Offset 0.9506341815 secs [10:54:01] RECOVERY - test151 poolcounter process on test151 is OK: PROCS OK: 1 process with UID = 999 (poolcounter), command name 'poolcounterd' [10:54:02] RECOVERY - test151 php-fpm on test151 is OK: PROCS OK: 13 processes with command name 'php-fpm8.2' [10:54:04] RECOVERY - test151 APT on test151 is OK: APT OK: 27 packages available for upgrade (0 critical updates). [10:54:14] RECOVERY - mw181 HTTPS on mw181 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 6.882 second response time [10:54:19] RECOVERY - Host changeprop151 is UP: PING OK - Packet loss = 0%, RTA = 0.31 ms [10:54:21] PROBLEM - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is WARNING: WARNING - NGINX Error Rate is 58% [10:54:22] RECOVERY - mw161 MediaWiki Rendering on mw161 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 3.617 second response time [10:54:25] RECOVERY - mw172 HTTPS on mw172 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.588 second response time [10:54:28] RECOVERY - Host mw151 is UP: PING OK - Packet loss = 0%, RTA = 0.28 ms [10:54:28] RECOVERY - mw182 HTTPS on mw182 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.072 second response time [10:54:29] RECOVERY - Host mw152 is UP: PING OK - Packet loss = 0%, RTA = 0.28 ms [10:54:30] !log [@test151] starting deploy of {'config': True} to test151 [10:54:30] RECOVERY - mw161 HTTPS on mw161 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.061 second response time [10:54:31] RECOVERY - ping on mw152 is OK: PING OK - Packet loss = 0%, RTA = 0.25 ms [10:54:31] !log [@test151] finished deploy of {'config': True} to test151 - SUCCESS in 0s [10:54:32] RECOVERY - mw162 HTTPS on mw162 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.062 second response time [10:54:32] RECOVERY - test151 HTTPS on test151 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.083 second response time [10:54:33] RECOVERY - test151 Current Load on test151 is OK: LOAD OK - total load average: 0.51, 0.16, 0.06 [10:54:35] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [10:54:40] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [10:54:45] RECOVERY - Host graphite151 is UP: PING OK - Packet loss = 0%, RTA = 0.25 ms [10:54:50] RECOVERY - mw172 MediaWiki Rendering on mw172 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 0.703 second response time [10:54:51] PROBLEM - mw152 NTP time on mw152 is WARNING: NTP WARNING: Offset 0.4848535359 secs [10:54:54] RECOVERY - mw171 HTTPS on mw171 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 286 bytes in 0.059 second response time [10:54:55] RECOVERY - cp51 HTTPS on cp51 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3782 bytes in 1.041 second response time [10:55:04] RECOVERY - cp26 Varnish Backends on cp26 is OK: All 19 backends are healthy [10:55:06] PROBLEM - changeprop151 Puppet on changeprop151 is WARNING: WARNING: Puppet last ran 2 hours ago [10:55:11] RECOVERY - cp37 HTTP 4xx/5xx ERROR Rate on cp37 is OK: OK - NGINX Error Rate is 17% [10:55:11] PROBLEM - mw152 Puppet on mw152 is WARNING: WARNING: Puppet last ran 2 hours ago [10:55:15] RECOVERY - Host swiftobject151 is UP: PING OK - Packet loss = 0%, RTA = 0.34 ms [10:55:16] PROBLEM - mw151 NTP time on mw151 is CRITICAL: NTP CRITICAL: Offset 0.656027317 secs [10:55:17] RECOVERY - cp27 HTTPS on cp27 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3783 bytes in 0.894 second response time [10:55:17] RECOVERY - mw171 MediaWiki Rendering on mw171 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 0.324 second response time [10:55:21] RECOVERY - Host os151 is UP: PING OK - Packet loss = 0%, RTA = 3.08 ms [10:55:21] RECOVERY - cp51 HTTP 4xx/5xx ERROR Rate on cp51 is OK: OK - NGINX Error Rate is 36% [10:55:26] RECOVERY - test151 SSH on test151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:55:26] RECOVERY - test151 Disk Space on test151 is OK: DISK OK - free space: / 63761MiB (72% inode=84%); [10:55:27] RECOVERY - cp51 Varnish Backends on cp51 is OK: All 19 backends are healthy [10:55:28] RECOVERY - Host rdb151 is UP: PING OK - Packet loss = 0%, RTA = 0.36 ms [10:55:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [10:55:31] RECOVERY - Host mem151 is UP: PING OK - Packet loss = 0%, RTA = 0.20 ms [10:55:37] RECOVERY - cp36 HTTPS on cp36 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3784 bytes in 0.078 second response time [10:55:37] RECOVERY - mw162 MediaWiki Rendering on mw162 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 0.535 second response time [10:55:41] RECOVERY - Host matomo151 is UP: PING OK - Packet loss = 0%, RTA = 0.83 ms [10:55:41] RECOVERY - Host prometheus151 is UP: PING OK - Packet loss = 0%, RTA = 0.41 ms [10:55:43] RECOVERY - cp41 Varnish Backends on cp41 is OK: All 19 backends are healthy [10:55:43] RECOVERY - matomo151 NTP time on matomo151 is OK: NTP OK: Offset -0.001078605652 secs [10:55:43] RECOVERY - Host db151 is UP: PING OK - Packet loss = 0%, RTA = 0.65 ms [10:55:46] RECOVERY - db151 NTP time on db151 is OK: NTP OK: Offset 0.04454487562 secs [10:55:46] RECOVERY - cp36 Varnish Backends on cp36 is OK: All 19 backends are healthy [10:55:49] RECOVERY - mw182 MediaWiki Rendering on mw182 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 0.351 second response time [10:55:50] RECOVERY - cp37 HTTPS on cp37 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3762 bytes in 0.072 second response time [10:55:52] RECOVERY - matomo151 Disk Space on matomo151 is OK: DISK OK - free space: / 9017MiB (50% inode=90%); [10:55:57] RECOVERY - test151 NTP time on test151 is OK: NTP OK: Offset 0.002850949764 secs [10:55:57] RECOVERY - cp27 Varnish Backends on cp27 is OK: All 19 backends are healthy [10:55:57] RECOVERY - cp41 HTTPS on cp41 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3784 bytes in 0.798 second response time [10:56:00] RECOVERY - mw181 MediaWiki Rendering on mw181 is OK: HTTP OK: HTTP/1.1 200 OK - 8191 bytes in 0.384 second response time [10:56:01] RECOVERY - changeprop151 SSH on changeprop151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:56:06] RECOVERY - prometheus151 Puppet on prometheus151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:56:11] RECOVERY - graphite151 Puppet on graphite151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:56:16] RECOVERY - swiftobject151 SSH on swiftobject151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) RECOVERY - mem151 PowerDNS Recursor on mem151 is OK: DNS OK: 0.045 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:56:16] RECOVERY - db151 Backups SQL on db151 is OK: FILE_AGE OK: /var/log/sql-backup.log is 5885 seconds old and 106503 bytes [10:56:19] RECOVERY - cp36 HTTP 4xx/5xx ERROR Rate on cp36 is OK: OK - NGINX Error Rate is 12% [10:56:21] RECOVERY - cp41 HTTP 4xx/5xx ERROR Rate on cp41 is OK: OK - NGINX Error Rate is 7% [10:56:21] [02mw-config] 07redbluegreenhat created branch 03revert-5574-c1-down - 13https://github.com/miraheze/mw-config [10:56:21] RECOVERY - mem151 Puppet on mem151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:56:21] RECOVERY - matomo151 Current Load on matomo151 is OK: LOAD OK - total load average: 0.59, 0.24, 0.09 [10:56:21] RECOVERY - ping on changeprop151 is OK: PING OK - Packet loss = 0%, RTA = 1.01 ms [10:56:24] RECOVERY - cp26 HTTPS on cp26 is OK: HTTP OK: HTTP/2 404 - Status line output matched "HTTP/2 404" - 3760 bytes in 1.264 second response time [10:56:24] [02mw-config] 07redbluegreenhat pushed 031 commit to 03revert-5574-c1-down [+0/-0/±1] 13https://github.com/miraheze/mw-config/commit/6bc8345e53a0 [10:56:26] [02mw-config] 07redbluegreenhat 036bc8345 - Revert "mark c1 as down (#5574)" [10:56:26] RECOVERY - matomo151 HTTPS on matomo151 is OK: HTTP OK: HTTP/2 200 - 552 bytes in 0.217 second response time [10:56:26] RECOVERY - mem151 NTP time on mem151 is OK: NTP OK: Offset 0.002567827702 secs [10:56:26] RECOVERY - ping on swiftobject151 is OK: PING OK - Packet loss = 0%, RTA = 0.26 ms [10:56:27] [02mw-config] 07redbluegreenhat opened pull request 03#5575: Revert "mark c1 as down" - 13https://github.com/miraheze/mw-config/pull/5575 [10:56:31] [02mw-config] 07redbluegreenhat closed pull request 03#5575: Revert "mark c1 as down" - 13https://github.com/miraheze/mw-config/pull/5575 [10:56:31] RECOVERY - rdb151 Redis Process on rdb151 is OK: PROCS OK: 1 process with args 'redis-server' [10:56:31] RECOVERY - mem151 Disk Space on mem151 is OK: DISK OK - free space: / 5810MiB (66% inode=86%); [10:56:31] RECOVERY - rdb151 NTP time on rdb151 is OK: NTP OK: Offset 0.006193190813 secs [10:56:31] RECOVERY - rdb151 conntrack_table_size on rdb151 is OK: OK: nf_conntrack is 0 % full [10:56:34] [02mw-config] 07redbluegreenhat deleted branch 03revert-5574-c1-down [10:56:36] RECOVERY - db151 conntrack_table_size on db151 is OK: OK: nf_conntrack is 0 % full [10:56:36] RECOVERY - rdb151 poolcounter process on rdb151 is OK: PROCS OK: 1 process with UID = 999 (poolcounter), command name 'poolcounterd' [10:56:36] RECOVERY - rdb151 Disk Space on rdb151 is OK: DISK OK - free space: / 5817MiB (66% inode=86%); [10:56:37] [02mw-config] 07redbluegreenhat pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mw-config/compare/eb3356bd3b19...be5ca9fb2ccf [10:56:39] [02mw-config] 07redbluegreenhat 03be5ca9f - Revert "mark c1 as down" (#5575) [10:56:41] RECOVERY - os151 Puppet on os151 is OK: OK: Puppet is currently enabled, last run 20 seconds ago with 0 failures [10:56:41] RECOVERY - os151 NTP time on os151 is OK: NTP OK: Offset 0.002594023943 secs [10:56:42] [02mw-config] 07redbluegreenhat deleted branch 03revert-5574-c1-down - 13https://github.com/miraheze/mw-config [10:56:43] RECOVERY - cp37 Varnish Backends on cp37 is OK: All 19 backends are healthy [10:56:46] RECOVERY - os151 PowerDNS Recursor on os151 is OK: DNS OK: 0.109 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:56:46] RECOVERY - rdb151 ferm_active on rdb151 is OK: OK ferm input default policy is set [10:56:47] RECOVERY - os151 conntrack_table_size on os151 is OK: OK: nf_conntrack is 2 % full [10:56:47] RECOVERY - db151 Disk Space on db151 is OK: DISK OK - free space: / 512767MiB (57% inode=98%); [10:56:47] RECOVERY - prometheus151 ferm_active on prometheus151 is OK: OK ferm input default policy is set [10:56:47] RECOVERY - rdb151 SSH on rdb151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:56:47] RECOVERY - ping on db151 is OK: PING OK - Packet loss = 0%, RTA = 0.96 ms [10:56:47] RECOVERY - matomo151 ferm_active on matomo151 is OK: OK ferm input default policy is set [10:56:51] RECOVERY - matomo151 Puppet on matomo151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:56:51] RECOVERY - db151 Backups SQL mhglobal on db151 is OK: FILE_AGE OK: /var/log/sql-mhglobal-backup-weekly.log is 21400 seconds old and 208 bytes [10:56:51] RECOVERY - os151 Disk Space on os151 is OK: DISK OK - free space: / 120381MiB (54% inode=99%); [10:56:51] RECOVERY - db151 PowerDNS Recursor on db151 is OK: DNS OK: 0.034 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:56:53] RECOVERY - mw152 NTP time on mw152 is OK: NTP OK: Offset 0.0001481175423 secs [10:56:56] RECOVERY - rdb151 PowerDNS Recursor on rdb151 is OK: DNS OK: 0.033 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:56:56] RECOVERY - db151 ferm_active on db151 is OK: OK ferm input default policy is set [10:57:00] RECOVERY - changeprop151 Puppet on changeprop151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:57:01] RECOVERY - mem151 ferm_active on mem151 is OK: OK ferm input default policy is set [10:57:01] RECOVERY - mem151 SSH on mem151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:57:09] RECOVERY - mw152 Puppet on mw152 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:57:10] !log [alex@mwtask181] starting deploy of {'pull': 'config', 'config': True} to all [10:57:11] RECOVERY - matomo151 conntrack_table_size on matomo151 is OK: OK: nf_conntrack is 0 % full [10:57:11] RECOVERY - matomo151 Redis Process on matomo151 is OK: PROCS OK: 1 process with args 'redis-server' [10:57:11] RECOVERY - swiftobject151 conntrack_table_size on swiftobject151 is OK: OK: nf_conntrack is 0 % full [10:57:11] RECOVERY - matomo151 PowerDNS Recursor on matomo151 is OK: DNS OK: 0.029 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:57:11] RECOVERY - rdb151 Current Load on rdb151 is OK: LOAD OK - total load average: 0.25, 0.18, 0.08 [10:57:15] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [10:57:16] RECOVERY - db151 SSH on db151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:57:16] RECOVERY - swiftobject151 Puppet on swiftobject151 is OK: OK: Puppet is currently enabled, last run 2 minutes ago with 0 failures [10:57:16] RECOVERY - os151 SSH on os151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:57:16] RECOVERY - mem151 conntrack_table_size on mem151 is OK: OK: nf_conntrack is 0 % full [10:57:16] RECOVERY - mem151 Current Load on mem151 is OK: LOAD OK - total load average: 0.19, 0.19, 0.09 [10:57:16] RECOVERY - db151 Puppet on db151 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [10:57:16] RECOVERY - ping on matomo151 is OK: PING OK - Packet loss = 0%, RTA = 1.00 ms [10:57:17] RECOVERY - mw151 NTP time on mw151 is OK: NTP OK: Offset 0.003615319729 secs [10:57:19] miraheze/mw-config - redbluegreenhat the build passed. [10:57:21] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.038 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [10:57:21] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:57:22] !log [alex@mwtask181] finished deploy of {'pull': 'config', 'config': True} to all - SUCCESS in 11s [10:57:26] RECOVERY - prometheus151 conntrack_table_size on prometheus151 is OK: OK: nf_conntrack is 0 % full [10:57:26] RECOVERY - ping on mem151 is OK: PING OK - Packet loss = 0%, RTA = 0.24 ms [10:57:28] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [10:57:31] miraheze/mw-config - redbluegreenhat the build passed. [10:57:41] RECOVERY - matomo151 SSH on matomo151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u2 (protocol 2.0) [10:57:41] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 0.06, 0.13, 0.07 [10:57:41] RECOVERY - db151 MariaDB Connections on db151 is OK: OK connection usage: 4.6%Current connections: 23 [10:57:41] RECOVERY - matomo151 php-fpm on matomo151 is OK: PROCS OK: 37 processes with command name 'php-fpm8.2' [10:58:37] RECOVERY - os161 Puppet on os161 is OK: OK: Puppet is currently enabled, last run 24 seconds ago with 0 failures [10:59:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.49, 17.14, 8.15 [11:00:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.27, 17.52, 8.51 [11:00:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] !tech RESOLVED: High Job Queue Backlog https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [11:02:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.85, 19.01, 10.14 [11:02:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 29.73, 21.16, 11.03 [11:04:31] PROBLEM - os151 Current Load on os151 is WARNING: LOAD WARNING - total load average: 3.07, 3.73, 2.18 [11:05:40] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.69, 22.63, 13.84 [11:05:40] PROBLEM - graylog161 Current Load on graylog161 is WARNING: LOAD WARNING - total load average: 7.20, 6.21, 3.95 [11:07:04] !log [@mwtask171] starting deploy of {'config': True} to all [11:07:15] !log [@mwtask171] DEPLOY ABORTED: Canary check failed for publictestwiki.com@mw182.wikitide.net [11:07:18] PROBLEM - os161 Current Load on os161 is WARNING: LOAD WARNING - total load average: 3.92, 3.07, 1.64 [11:07:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 18.22, 19.99, 13.89 [11:07:43] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [11:08:04] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [11:08:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.51, 20.83, 13.72 [11:08:28] RECOVERY - os151 Current Load on os151 is OK: LOAD OK - total load average: 2.52, 3.17, 2.31 [11:09:18] RECOVERY - os161 Current Load on os161 is OK: LOAD OK - total load average: 1.18, 2.44, 1.59 [11:09:40] RECOVERY - graylog161 Current Load on graylog161 is OK: LOAD OK - total load average: 1.84, 4.64, 3.91 [11:13:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 19.77, 21.19, 16.36 [11:14:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 24.16, 22.21, 15.94 [11:15:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.33, 23.32, 17.71 [11:18:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.46, 20.67, 15.85 [11:20:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.16, 20.43, 16.37 [11:21:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.36, 23.52, 19.89 [11:22:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 16.23, 18.84, 16.28 [11:23:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 24.02, 24.46, 20.71 [11:23:58] !log [@test151] starting deploy of {'config': True} to test151 [11:23:59] !log [@test151] finished deploy of {'config': True} to test151 - SUCCESS in 0s [11:24:12] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [11:24:53] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [11:27:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.70, 20.37, 16.21 [11:29:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.04, 22.94, 17.63 [11:35:22] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.11, 23.81, 19.78 [11:37:15] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 18.06, 21.58, 19.45 [11:41:02] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.16, 23.79, 20.66 [11:46:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.20, 23.44, 21.81 [11:48:49] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.63, 15.73, 8.26 [11:48:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 26.20, 24.17, 22.24 [11:50:48] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.01, 18.60, 10.21 [11:55:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 29.26, 22.91, 13.56 [11:56:47] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.56, 21.20, 14.26 [11:58:47] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 19.81, 20.30, 14.75 [11:59:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.94, 22.49, 15.71 [12:01:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 27.67, 24.67, 17.34 [12:01:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 17.31, 21.98, 23.92 [12:03:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.78, 23.93, 18.01 [12:05:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 13.68, 20.36, 17.44 [12:06:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 14.69, 20.49, 23.69 [12:08:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 18.44, 22.30, 23.52 [12:09:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 11.33, 14.69, 19.55 [12:09:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 14.83, 20.61, 23.29 [12:14:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 12.54, 15.22, 19.89 [12:16:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 14.56, 16.60, 20.31 [12:17:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 15.50, 15.42, 19.54 [12:20:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [12:25:36] PROBLEM - wiki.mahdiruiz.line.pm - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.mahdiruiz.line.pm All nameservers failed to answer the query. [12:26:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 17.55, 20.40, 23.52 [12:26:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.05, 20.33, 23.72 [12:36:12] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 11.19, 15.11, 19.65 [12:40:27] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 12.05, 16.23, 20.10 [12:50:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 24.51, 18.59, 16.45 [12:52:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.18, 22.53, 19.75 [12:52:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.28, 23.56, 21.14 [12:52:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 22.06, 19.95, 17.22 [12:54:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 26.42, 21.63, 18.12 [12:55:40] RECOVERY - wiki.mahdiruiz.line.pm - reverse DNS on sslhost is OK: SSL OK - wiki.mahdiruiz.line.pm reverse DNS resolves to cp36.wikitide.net - CNAME OK [12:58:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.48, 22.14, 19.24 [13:00:25] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.28, 21.86, 18.13 [13:00:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.78, 24.37, 20.41 [13:02:08] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.62, 22.26, 17.72 [13:02:23] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 23.91, 22.04, 18.62 [13:03:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 24.96, 20.46, 15.64 [13:04:04] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.33, 21.52, 17.98 [13:04:20] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.72, 22.83, 17.82 [13:05:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 17.91, 18.89, 15.63 [13:06:19] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 25.25, 23.32, 19.88 [13:07:54] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 19.98, 20.04, 18.17 [13:08:18] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.56, 23.70, 19.29 [13:09:23] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.23, 22.69, 18.45 [13:11:46] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 17.47, 21.08, 19.22 [13:12:17] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 16.51, 19.46, 18.52 [13:13:17] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.02, 23.52, 19.87 [13:13:42] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 12.90, 18.32, 18.44 [13:16:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [13:17:11] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.81, 22.74, 20.15 [13:19:08] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 17.95, 20.38, 19.57 [13:22:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 18.95, 22.87, 23.63 [13:26:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.21, 25.27, 24.38 [13:26:54] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 21.84, 21.33, 19.83 [13:27:56] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.99, 23.74, 23.90 [13:28:08] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 26.03, 22.62, 20.43 [13:28:52] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 14.00, 18.59, 19.01 [13:29:53] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.31, 25.46, 24.51 [13:30:07] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 15.54, 20.43, 19.94 [13:32:06] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 10.79, 17.18, 18.82 [13:32:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 14.10, 20.44, 22.83 [13:35:47] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 16.26, 21.88, 23.54 [13:38:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 12.32, 16.51, 20.38 [13:43:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 12.61, 15.83, 19.87 [13:44:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 14.97, 18.05, 22.68 [13:50:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 16.25, 19.82, 23.95 [13:51:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [13:52:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.12, 19.33, 21.24 [13:54:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.82, 21.86, 23.79 [13:56:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.25, 22.59, 23.89 [13:56:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.55, 20.86, 21.62 [14:06:27] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.75, 19.05, 20.16 [14:08:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.82, 21.00, 21.60 [14:10:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.64, 21.34, 20.87 [14:12:25] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.14, 20.92, 17.57 [14:12:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.02, 23.78, 21.82 [14:14:12] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.96, 23.87, 22.91 [14:14:23] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 19.95, 19.76, 17.53 [14:16:12] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.70, 25.80, 23.74 [14:16:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.20, 21.95, 18.82 [14:17:47] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.73, 19.33, 16.47 [14:17:51] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.35, 21.24, 17.33 [14:18:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 14.56, 18.84, 18.06 [14:19:48] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 17.31, 20.05, 17.38 [14:21:12] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.88, 21.11, 18.82 [14:23:10] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 16.69, 20.16, 18.78 [14:23:19] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.64, 21.75, 17.94 [14:23:44] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 29.95, 25.02, 19.94 [14:25:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.36, 22.90, 18.77 [14:25:44] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 26.58, 22.45, 18.89 [14:27:04] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.16, 20.96, 19.56 [14:27:37] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 16.96, 22.68, 20.33 [14:28:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.29, 23.49, 20.30 [14:29:05] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.25, 23.56, 19.78 [14:29:43] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 17.48, 21.20, 19.38 [14:30:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 23.03, 22.66, 20.36 [14:31:00] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.31, 23.80, 20.92 [14:31:30] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 16.25, 19.29, 19.42 [14:33:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 22.22, 23.76, 21.01 [14:34:51] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 17.35, 23.08, 21.12 [14:34:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 28.59, 24.90, 21.72 [14:35:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.11, 25.30, 21.90 [14:35:40] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.17, 23.49, 20.79 [14:37:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [14:37:39] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.27, 23.15, 21.06 [14:38:51] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 16.35, 22.77, 22.12 [14:39:38] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.77, 23.96, 21.61 [14:40:37] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.10, 24.11, 21.98 [14:41:37] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 20.61, 23.39, 21.74 [14:42:47] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.10, 21.77, 21.68 [14:43:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 21.99, 22.44, 22.17 [14:44:45] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 17.62, 20.29, 21.18 [14:46:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.07, 23.59, 22.66 [14:47:35] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.10, 23.92, 22.34 [14:48:41] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.08, 22.88, 22.08 [14:48:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 17.75, 22.93, 23.11 [14:52:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.94, 24.45, 23.55 [14:53:33] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 17.33, 21.68, 22.17 [14:54:34] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.62, 22.03, 22.03 [14:55:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 24.35, 22.01, 19.97 [14:55:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 27.61, 22.96, 21.70 [14:57:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 19.59, 20.30, 19.54 [14:58:30] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.09, 22.40, 22.10 [14:58:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.63, 23.21, 22.23 [14:59:40] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 21.40, 23.22, 22.25 [15:00:28] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.06, 22.09, 22.09 [15:00:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 19.29, 23.70, 23.78 [15:01:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.34, 22.58, 20.73 [15:01:29] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.29, 23.54, 22.58 [15:01:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.36, 24.43, 22.83 [15:03:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.95, 24.94, 21.82 [15:03:28] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 19.18, 22.14, 22.22 [15:03:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 18.50, 21.79, 22.03 [15:04:24] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 26.52, 22.56, 22.14 [15:05:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 19.76, 22.33, 21.22 [15:05:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.80, 22.75, 22.30 [15:06:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 16.44, 22.44, 22.77 [15:06:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.81, 24.09, 23.67 [15:08:19] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 22.96, 23.92, 22.88 [15:09:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 15.07, 18.08, 19.74 [15:10:17] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.53, 25.61, 23.61 [15:13:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 18.85, 23.48, 23.29 [15:14:13] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.59, 23.12, 23.12 [15:16:11] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.64, 24.72, 23.68 [15:17:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 22.59, 22.18, 20.89 [15:17:23] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.60, 21.07, 20.97 [15:17:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 26.56, 22.72, 22.80 [15:18:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 14.87, 21.16, 23.18 [15:19:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 18.06, 21.67, 22.47 [15:20:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.29, 23.28, 22.47 [15:20:35] PROBLEM - wiki.theluxuryelevator.com - LetsEncrypt on sslhost is WARNING: WARNING - Certificate 'wiki.theluxuryelevator.com' expires in 15 day(s) (Tue 11 Jun 2024 03:03:37 PM GMT +0000). [15:20:47] [02ssl] 07WikiTideSSLBot pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/ssl/compare/f53dbe0716ef...7b13c2f12212 [15:20:48] [02ssl] 07WikiTideSSLBot 037b13c2f - Bot: Update SSL cert for wiki.theluxuryelevator.com [15:20:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.19, 22.17, 23.24 [15:22:04] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 18.78, 23.81, 23.88 [15:22:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.38, 21.34, 22.81 [15:24:02] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.33, 24.32, 24.06 [15:26:00] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.51, 22.74, 23.50 [15:26:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 16.03, 22.09, 22.61 [15:27:19] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.47, 23.30, 22.68 [15:29:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 27.10, 23.05, 21.58 [15:29:18] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.13, 24.17, 23.10 [15:29:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.55, 22.46, 22.26 [15:31:03] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 18.30, 21.07, 21.03 [15:31:17] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.80, 23.16, 22.92 [15:31:39] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 16.52, 19.86, 21.33 [15:33:03] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 12.46, 17.91, 19.88 [15:35:15] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 25.90, 22.69, 22.58 [15:36:54] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.58, 20.42, 21.06 [15:37:14] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.69, 21.79, 22.32 [15:37:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [15:38:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 23.13, 21.88, 21.56 [15:39:39] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 19.35, 19.10, 20.36 [15:39:45] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 17.48, 17.54, 20.09 [15:40:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 15.86, 18.68, 20.15 [15:42:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 17.68, 18.46, 20.15 [15:46:54] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 22.60, 22.20, 21.37 [15:47:40] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 27.67, 21.93, 20.63 [15:48:31] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 19.00, 21.13, 20.69 [15:48:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.31, 20.67, 20.13 [15:49:09] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 12.31, 16.58, 19.53 [15:49:40] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 21.05, 20.98, 20.41 [15:50:28] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.52, 22.09, 21.04 [15:50:32] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.04, 22.24, 20.74 [15:50:54] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 13.81, 17.61, 19.71 [15:52:25] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 18.17, 20.53, 20.60 [15:52:32] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.89, 22.64, 21.12 [15:53:40] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 12.69, 17.75, 19.31 [15:54:23] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 14.65, 18.60, 19.90 [15:54:32] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 17.07, 20.24, 20.40 [16:01:27] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 24.96, 21.67, 20.78 [16:03:21] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 17.18, 20.66, 20.58 [16:05:15] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 16.64, 19.25, 20.06 [16:07:55] [02ssl] 07WikiTideSSLBot pushed 031 commit to 03master [+1/-0/±1] 13https://github.com/miraheze/ssl/compare/7b13c2f12212...ca42a0ad23af [16:07:57] [02ssl] 07WikiTideSSLBot 03ca42a0a - Bot: Add SSL cert for wiki.ventistudio.fr [16:10:27] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 16.01, 20.64, 23.47 [16:12:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [16:14:27] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.33, 22.49, 23.57 [16:18:45] RECOVERY - wiki.theluxuryelevator.com - LetsEncrypt on sslhost is OK: OK - Certificate 'wiki.theluxuryelevator.com' will expire on Sat 24 Aug 2024 02:20:41 PM GMT +0000. [16:19:49] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 28.61, 23.24, 20.78 [16:20:00] !log [macfan@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php /srv/mediawiki/1.41/extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=hebwiki --logwiki=metawiki --ignorestatus Ora_&_D Ora (END - exit=32512) [16:20:05] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:20:18] !log [macfan@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php /srv/mediawiki/1.41/extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=hebwiki --logwiki=metawiki --ignorestatus Ora & D Ora (END - exit=32512) [16:20:25] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:20:45] MacFan4000: what does it say [16:21:11] !log [macfan@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php /srv/mediawiki/1.41/extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=hebwiki --logwiki=metawiki --ignorestatus Ora & D Ora (END - exit=32512) [16:21:23] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:21:34] ERROR: Argument is required! [16:22:10] MacFan4000: have you put the username in "" [16:22:16] Is log just removing that [16:22:19] There's a space [16:22:31] yes [16:22:41] !log [macfan@mwtask181] sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php /srv/mediawiki/1.41/extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=hebwiki --logwiki=metawiki --ignorestatus Ora_&_D Ora (END - exit=32512) [16:22:54] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:23:45] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 22.70, 22.98, 21.31 [16:23:52] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.64, 20.56, 18.84 [16:25:29] !log sudo -u www-data php /srv/mediawiki/1.41/maintenance/run.php /srv/mediawiki/1.41/extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=hebwiki --logwiki=metawiki --ignorestatus "Ora_&_D" Ora [16:25:36] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:25:42] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 24.76, 23.54, 21.72 [16:26:30] for whatever reason mwscript removes quotation marks from the actual command that it runs, (doesn't preserve them) [16:26:37] That worked [16:27:00] MacFan4000: you need to double them [16:27:03] So '" [16:27:10] ah [16:27:16] It should warn you though [16:27:37] Doesn't it have a confirm prompt [16:27:52] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 18.28, 22.11, 23.87 [16:27:59] That unblocked it [16:28:05] yes but doesn't warn about quoting [16:28:16] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.21, 22.67, 23.92 [16:28:38] MacFan4000: ye you should carefully check the confirm prompt [16:28:49] With stuff like quotes [16:29:01] Any reason why the rename failed in the first place [16:29:38] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.67, 23.45, 22.24 [16:29:44] most likely due to CU patches not being applied, that's usually what it is [16:29:47] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.40, 23.32, 24.09 [16:29:49] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 17.71, 19.03, 18.89 [16:30:03] MacFan4000: can you apply them? [16:30:14] Also why are they missing to the point it's usual? [16:30:21] Can't we rerun the patch [16:30:48] no idea why it happend, but have seen it on multiple wikis [16:32:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [16:32:12] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.63, 21.98, 23.17 [16:33:23] MacFan4000: thats not good [16:33:38] Surely we can work out a way to test if the patch is applied and if not reapply it [16:33:39] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 17.70, 22.35, 23.72 [16:33:43] Which patch is it MacFan4000 [16:35:31] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 10.75, 17.36, 20.16 [16:35:35] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.81, 24.06, 24.20 [16:36:08] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 18.11, 21.46, 22.85 [16:36:15] Two of them: patch-cu_private_event-def.sql, and patch-cu_changes-add-cuc_only_for_read_old.sql [16:37:31] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 13.73, 20.38, 22.86 [16:40:15] MacFan4000: both should support IF NOT EXISTS [16:40:20] And be safe to run twice [16:43:08] !log [trollpastawiki]> DELETE FROM categorylinks WHERE cl_from=8; [16:43:14] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:43:59] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 13.20, 16.18, 19.94 [16:45:14] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 12.55, 15.67, 19.69 [16:52:59] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.50, 20.46, 19.67 [16:53:11] !log [alex@mwtask181] starting deploy of {'versions': '1.41', 'upgrade_extensions': 'Purge'} to all [16:53:18] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:53:25] !log [alex@mwtask181] finished deploy of {'versions': '1.41', 'upgrade_extensions': 'Purge'} to all - SUCCESS in 13s [16:53:31] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:53:47] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.84, 19.89, 19.65 [16:54:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.09, 21.37, 20.14 [16:55:45] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 17.58, 19.02, 19.36 [16:57:52] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 20.82, 18.30, 15.77 [16:58:56] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 16.56, 20.00, 19.94 [16:59:38] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.20, 21.88, 20.56 [16:59:49] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 13.19, 16.01, 15.22 [17:02:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.14, 20.99, 20.39 [17:04:56] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 14.75, 18.65, 19.61 [17:05:32] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 24.24, 21.84, 20.89 [17:07:30] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.80, 22.17, 21.18 [17:13:24] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.21, 22.93, 21.59 [17:15:22] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 19.87, 21.39, 21.18 [17:23:13] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.15, 18.65, 20.20 [17:29:32] [02mw-config] 07redbluegreenhat opened pull request 03#5576: T11421: Prevent categories from showing up on Special:ShortPages for … - 13https://github.com/miraheze/mw-config/pull/5576 [17:29:43] [02mw-config] 07redbluegreenhat edited pull request 03#5576: T11421: Prevent categories from showing up on Special:ShortPages for tuscriaturaswiki - 13https://github.com/miraheze/mw-config/pull/5576 [17:29:52] [02mw-config] 07redbluegreenhat edited pull request 03#5576: T11421: Prevent categories from showing up on Special:ShortPages for tuscriaturaswiki - 13https://github.com/miraheze/mw-config/pull/5576 [17:30:26] miraheze/mw-config - redbluegreenhat the build passed. [17:30:36] [02mw-config] 07redbluegreenhat synchronize pull request 03#5576: T11421: Prevent categories from showing up on Special:ShortPages for tuscriaturaswiki - 13https://github.com/miraheze/mw-config/pull/5576 [17:31:30] miraheze/mw-config - redbluegreenhat the build passed. [17:31:58] [02mw-config] 07redbluegreenhat closed pull request 03#5576: T11421: Prevent categories from showing up on Special:ShortPages for tuscriaturaswiki - 13https://github.com/miraheze/mw-config/pull/5576 [17:32:01] [02mw-config] 07redbluegreenhat pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mw-config/compare/be5ca9fb2ccf...d839822af3c8 [17:32:02] [02mw-config] 07redbluegreenhat 03d839822 - T11421: Prevent categories from showing up on Special:ShortPages for tuscriaturaswiki (#5576) [17:32:52] !log [@mwtask181] starting deploy of {'config': True} to all [17:32:54] miraheze/mw-config - redbluegreenhat the build passed. [17:32:57] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:33:05] !log [@mwtask181] finished deploy of {'config': True} to all - SUCCESS in 12s [17:33:10] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:35:09] [02mw-config] 07redbluegreenhat opened pull request 03#5577: fix dbname - 13https://github.com/miraheze/mw-config/pull/5577 [17:35:17] [02mw-config] 07redbluegreenhat closed pull request 03#5577: fix dbname - 13https://github.com/miraheze/mw-config/pull/5577 [17:35:20] [02mw-config] 07redbluegreenhat pushed 031 commit to 03master [+0/-0/±1] 13https://github.com/miraheze/mw-config/compare/d839822af3c8...6cbcb139d501 [17:35:22] [02mw-config] 07redbluegreenhat 036cbcb13 - fix dbname (#5577) [17:36:00] miraheze/mw-config - redbluegreenhat the build passed. [17:36:14] miraheze/mw-config - redbluegreenhat the build passed. [17:37:42] !log [@mwtask171] starting deploy of {'config': True} to all [17:38:07] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:38:10] !log [@mwtask171] finished deploy of {'config': True} to all - SUCCESS in 28s [17:38:21] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:50:58] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.18, 20.33, 18.18 [17:52:58] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 14.81, 17.96, 17.56 [17:53:42] !log [@test151] starting deploy of {'config': True} to test151 [17:53:43] !log [@test151] finished deploy of {'config': True} to test151 - SUCCESS in 0s [17:53:47] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:53:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:02:22] !log [@mwtask181] starting deploy of {'config': True} to all [18:02:27] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:02:33] !log [@mwtask181] finished deploy of {'config': True} to all - SUCCESS in 11s [18:02:38] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:25:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 18.94, 20.93, 18.83 [18:28:58] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.93, 19.78, 18.35 [18:29:16] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.56, 22.16, 19.37 [18:29:49] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 24.24, 19.22, 15.60 [18:30:58] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.73, 23.03, 19.71 [18:31:43] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.48, 23.66, 20.45 [18:31:48] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 22.40, 21.07, 16.77 [18:32:45] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 26.87, 23.16, 18.39 [18:33:33] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.26, 21.98, 17.86 [18:33:48] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 18.14, 19.76, 16.80 [18:34:05] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 23.27, 21.66, 17.24 [18:34:05] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.18, 18.78, 14.96 [18:34:43] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 20.68, 22.54, 18.78 [18:36:41] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.09, 24.54, 19.94 [18:38:00] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 15.45, 19.46, 17.39 [18:39:33] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 19.25, 23.36, 20.18 [18:40:05] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 18.64, 21.24, 17.65 [18:41:33] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 13.56, 19.99, 19.33 [18:42:34] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 10.28, 20.00, 19.85 [18:42:58] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 9.61, 22.10, 22.43 [18:43:01] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 10.82, 22.63, 23.03 [18:43:18] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 9.55, 21.10, 22.33 [18:44:05] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 5.76, 15.02, 16.20 [18:44:58] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 5.93, 16.54, 20.34 [18:45:14] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 6.51, 16.11, 20.36 [18:46:56] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 7.08, 13.82, 19.28 [19:16:15] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 26.76, 22.23, 19.02 [19:18:40] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.47, 22.37, 18.81 [19:19:39] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.95, 20.26, 17.36 [19:21:37] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 27.13, 22.87, 18.68 [19:22:19] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 22.93, 19.05, 15.94 [19:24:16] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 18.78, 19.59, 16.58 [19:26:05] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 19.38, 21.30, 17.61 [19:28:26] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 18.80, 20.62, 16.98 [19:28:27] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 28.08, 21.25, 16.35 [19:30:04] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 27.53, 23.42, 18.96 [19:30:05] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 28.80, 23.52, 19.13 [19:30:20] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 19.16, 20.01, 17.17 [19:33:36] PROBLEM - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.andreijiroh.uk.eu.org All nameservers failed to answer the query. [19:34:09] PROBLEM - mw151 Current Load on mw151 is CRITICAL: LOAD CRITICAL - total load average: 27.00, 22.00, 18.43 [19:36:17] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 20.85, 22.66, 19.33 [19:40:12] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 26.57, 23.24, 20.21 [19:41:46] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 16.76, 22.89, 22.24 [19:43:14] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 28.78, 22.33, 18.94 [19:43:43] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 24.29, 23.05, 22.32 [19:46:05] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 21.68, 23.16, 21.29 [19:46:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [19:48:03] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 26.27, 23.87, 21.73 [19:54:49] PROBLEM - cp27 Varnish Backends on cp27 is CRITICAL: 2 backends are down. mw181 mw182 [20:00:48] RECOVERY - cp27 Varnish Backends on cp27 is OK: All 19 backends are healthy [20:10:58] PROBLEM - mw161 Current Load on mw161 is WARNING: LOAD WARNING - total load average: 13.68, 20.91, 23.90 [20:11:33] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 12.07, 19.25, 23.77 [20:11:34] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 12.41, 18.20, 22.61 [20:11:48] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 13.69, 19.07, 23.17 [20:12:05] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 9.21, 16.23, 22.38 [20:15:34] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 7.06, 12.14, 19.12 [20:15:58] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 12.37, 18.06, 23.73 [20:16:05] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 8.50, 12.10, 19.28 [20:16:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [20:16:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 9.35, 16.55, 23.54 [20:16:58] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 6.31, 11.51, 18.73 [20:16:58] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 10.07, 16.48, 23.12 [20:17:33] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 9.25, 12.25, 19.11 [20:17:49] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 14.35, 15.67, 20.40 [20:20:58] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 9.33, 12.32, 19.88 [20:21:58] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 9.97, 12.81, 19.56 [20:22:56] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 10.18, 12.00, 19.20 [20:38:58] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.45, 21.97, 18.72 [20:39:58] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.67, 19.31, 17.70 [20:40:58] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 22.16, 22.03, 19.16 [20:41:58] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 24.24, 21.13, 18.56 [20:42:56] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 24.72, 20.31, 18.20 [20:42:58] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 28.81, 24.61, 20.46 [20:43:33] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.10, 19.14, 16.17 [20:43:58] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 21.50, 21.58, 19.08 [20:44:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 21.04, 20.04, 18.33 [20:45:33] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 14.39, 17.23, 15.83 [20:45:58] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 27.89, 23.82, 20.19 [20:46:56] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.43, 23.02, 19.62 [20:53:39] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 25.08, 22.93, 19.17 [20:54:05] PROBLEM - mw171 Current Load on mw171 is WARNING: LOAD WARNING - total load average: 20.57, 18.87, 16.24 [20:55:36] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 14.12, 18.96, 18.14 [20:55:49] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 23.50, 21.27, 17.71 [20:56:05] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 19.60, 19.68, 16.88 [20:56:21] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 21.44, 21.50, 18.68 [20:56:22] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 25.53, 22.83, 19.02 [20:57:48] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 17.07, 20.34, 17.86 [20:58:18] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 24.49, 22.40, 19.33 [21:00:15] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 23.72, 22.69, 19.79 [21:00:17] PROBLEM - mw152 Current Load on mw152 is WARNING: LOAD WARNING - total load average: 16.54, 22.39, 19.94 [21:01:28] PROBLEM - mw161 Current Load on mw161 is CRITICAL: LOAD CRITICAL - total load average: 26.27, 23.47, 20.27 [21:01:30] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [21:02:12] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 25.48, 22.97, 20.19 [21:03:39] RECOVERY - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is OK: SSL OK - wiki.andreijiroh.uk.eu.org reverse DNS resolves to cp36.wikitide.net - CNAME OK [21:04:09] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 15.32, 20.77, 19.79 [21:05:23] RECOVERY - mw161 Current Load on mw161 is OK: LOAD OK - total load average: 15.19, 20.32, 19.85 [21:05:48] PROBLEM - mw151 Current Load on mw151 is WARNING: LOAD WARNING - total load average: 21.96, 21.90, 19.25 [21:06:06] PROBLEM - mw162 Current Load on mw162 is CRITICAL: LOAD CRITICAL - total load average: 26.83, 23.30, 20.82 [21:06:10] PROBLEM - mw152 Current Load on mw152 is CRITICAL: LOAD CRITICAL - total load average: 25.21, 22.15, 20.45 [21:07:03] PROBLEM - mw171 Current Load on mw171 is CRITICAL: LOAD CRITICAL - total load average: 24.59, 22.57, 19.35 [21:07:49] RECOVERY - mw151 Current Load on mw151 is OK: LOAD OK - total load average: 15.88, 19.92, 18.85 [21:08:04] PROBLEM - mw162 Current Load on mw162 is WARNING: LOAD WARNING - total load average: 17.89, 22.45, 20.89 [21:08:07] RECOVERY - mw152 Current Load on mw152 is OK: LOAD OK - total load average: 12.84, 19.79, 19.89 [21:09:02] RECOVERY - mw171 Current Load on mw171 is OK: LOAD OK - total load average: 8.29, 17.08, 17.74 [21:09:58] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 8.26, 20.31, 23.93 [21:10:00] RECOVERY - mw162 Current Load on mw162 is OK: LOAD OK - total load average: 6.90, 16.74, 18.98 [21:10:56] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 6.03, 17.27, 22.36 [21:10:58] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 8.06, 18.21, 22.92 [21:11:30] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [21:13:58] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 7.57, 13.22, 20.16 [21:14:56] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 7.78, 11.91, 18.96 [21:14:58] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 7.45, 12.13, 19.31 [21:38:56] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 26.49, 20.06, 16.27 [21:39:58] PROBLEM - mw172 Current Load on mw172 is CRITICAL: LOAD CRITICAL - total load average: 25.88, 20.30, 17.47 [21:40:56] RECOVERY - mw181 Current Load on mw181 is OK: LOAD OK - total load average: 15.15, 17.50, 15.77 [21:40:58] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 29.20, 22.06, 17.63 [21:41:58] PROBLEM - mw172 Current Load on mw172 is WARNING: LOAD WARNING - total load average: 20.18, 20.81, 18.06 [21:42:58] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 12.08, 19.05, 17.15 [21:43:58] RECOVERY - mw172 Current Load on mw172 is OK: LOAD OK - total load average: 8.14, 15.93, 16.60 [22:05:41] PROBLEM - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.andreijiroh.uk.eu.org All nameservers failed to answer the query. [22:07:25] [Grafana] !tech FIRING: The mediawiki job queue has more than 500 unclaimed jobs https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [22:35:43] RECOVERY - wiki.andreijiroh.uk.eu.org - reverse DNS on sslhost is OK: SSL OK - wiki.andreijiroh.uk.eu.org reverse DNS resolves to cp37.wikitide.net - CNAME OK