[00:49:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.50, 22.66, 24.00 [00:51:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.54, 23.81, 24.26 [00:53:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 20.69, 22.73, 23.83 [01:05:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 28.39, 23.78, 23.27 [01:11:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 15.54, 22.33, 23.15 [01:13:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 27.60, 24.60, 23.88 [01:19:34] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.12, 22.09, 19.52 [01:23:28] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.77, 18.91, 18.83 [01:45:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 23.87, 22.22, 23.85 [01:51:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 29.04, 24.60, 24.20 [01:59:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 18.50, 22.86, 23.91 [02:03:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.16, 23.11, 23.70 [02:05:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.76, 23.09, 23.64 [02:07:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.06, 23.05, 23.50 [02:11:20] PROBLEM - mw181 Current Load on mw181 is WARNING: LOAD WARNING - total load average: 22.83, 23.68, 23.63 [02:13:20] PROBLEM - mw181 Current Load on mw181 is CRITICAL: LOAD CRITICAL - total load average: 25.73, 24.24, 23.83 [03:04:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.03, 3.45, 1.41 [03:04:32] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [03:05:32] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:06:20] [Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:06:26] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.066 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [03:07:34] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [03:08:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.26, 3.50, 1.97 [03:10:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.69, 3.65, 2.19 [03:11:20] [Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:11:54] PROBLEM - prometheus151 SSH on prometheus151 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [03:15:58] RECOVERY - prometheus151 SSH on prometheus151 is OK: SSH OK - OpenSSH_9.2p1 Debian-2+deb12u3 (protocol 2.0) [03:16:32] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.39, 3.44, 2.61 [03:18:32] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.97, 3.19, 2.64 [03:21:42] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 25.37, 21.62, 19.36 [03:23:38] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 20.92, 21.64, 19.66 [03:27:19] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.89, 3.72, 3.13 [03:27:32] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 19.08, 20.09, 19.46 [03:29:14] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.27, 4.18, 3.35 [03:31:09] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.52, 3.58, 3.21 [03:33:04] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.01, 4.22, 3.50 [03:34:59] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.42, 3.47, 3.31 [03:36:54] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.51, 3.01, 3.18 [03:39:49] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [03:40:48] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 3.55, 4.09, 3.62 [03:42:43] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.31, 3.50, 3.46 [03:43:55] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 7.222 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [03:44:37] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.98, 3.21, 3.34 [03:45:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:54:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.42, 4.75, 3.83 [03:55:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [03:56:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.29, 3.80, 3.58 [03:58:33] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.23, 5.13, 4.09 [04:02:32] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.77, 3.51, 3.68 [04:08:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 4.77, 3.73, 3.70 [04:12:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.93, 3.55, 3.68 [04:16:31] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.47, 2.80, 3.32 [04:21:21] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.07, 20.56, 19.12 [04:23:18] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.74, 19.73, 18.98 [04:27:00] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:32:00] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [04:34:32] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [04:35:09] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.33, 4.52, 3.69 [04:38:59] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.87, 3.38, 3.38 [04:42:40] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.069 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [04:42:50] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 1.90, 3.59, 3.54 [04:44:44] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.61, 4.40, 3.84 [04:50:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.48, 3.58, 3.66 [04:58:32] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.01, 4.17, 3.82 [05:00:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.60, 3.36, 3.56 [05:02:32] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 2.56, 2.92, 3.37 [05:05:01] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [05:06:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.02, 5.11, 4.16 [05:07:04] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 7.331 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [05:07:34] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:14:52] [02mediawiki-repos] 07Universal-Omega commented on pull request 03#31: T12394: Install UserProfileV2 - 13https://github.com/miraheze/mediawiki-repos/pull/31#issuecomment-2287872147 [05:15:25] [02mediawiki-repos] 07Universal-Omega edited a comment on pull request 03#31: T12394: Install UserProfileV2 - 13https://github.com/miraheze/mediawiki-repos/pull/31#issuecomment-2287872147 [05:18:18] PROBLEM - ns2 Puppet on ns2 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [05:19:25] [02mediawiki-repos] 07songnguxyz commented on pull request 03#31: T12394: Install UserProfileV2 - 13https://github.com/miraheze/mediawiki-repos/pull/31#issuecomment-2287876474 [05:20:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.80, 3.63, 3.94 [05:22:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 7.69, 5.13, 4.45 [05:25:15] PROBLEM - ping6 on cp26 is CRITICAL: PING CRITICAL - Packet loss = 28%, RTA = 189.43 ms [05:26:32] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.35, 3.72, 3.99 [05:27:34] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:28:51] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 23.08, 20.71, 18.25 [05:30:32] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.98, 4.37, 4.13 [05:30:51] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 15.09, 18.63, 17.80 [05:32:33] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.08, 3.73, 3.93 [05:35:33] RECOVERY - ping6 on cp26 is OK: PING OK - Packet loss = 0%, RTA = 189.17 ms [05:40:32] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.39, 3.70, 3.74 [05:42:32] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 2.67, 3.09, 3.51 [05:46:31] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 6.54, 4.38, 3.90 [05:51:06] PROBLEM - ping6 on cp26 is CRITICAL: PING CRITICAL - Packet loss = 16%, RTA = 188.46 ms [05:54:20] [Grafana] FIRING: There has been a rise in the MediaWiki exception rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [05:55:53] PROBLEM - prometheus151 PowerDNS Recursor on prometheus151 is CRITICAL: CRITICAL - Plugin timed out while executing system call [05:57:17] RECOVERY - ping6 on cp26 is OK: PING OK - Packet loss = 0%, RTA = 188.19 ms [05:57:47] RECOVERY - prometheus151 PowerDNS Recursor on prometheus151 is OK: DNS OK: 0.065 seconds response time. wikitide.net returns 2602:294:0:b13::110,2602:294:0:b23::112,38.46.223.205,38.46.223.206 [05:59:20] [Grafana] FIRING: Some MediaWiki Appservers are running out of PHP-FPM workers. https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1[Grafana] RESOLVED: MediaWiki Exception Rate https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:01:32] PROBLEM - ping6 on cp26 is CRITICAL: PING CRITICAL - Packet loss = 28%, RTA = 186.98 ms [06:04:20] [Grafana] RESOLVED: PHP-FPM Worker Usage High https://grafana.wikitide.net/d/GtxbP1Xnk?orgId=1 [06:09:49] RECOVERY - ping6 on cp26 is OK: PING OK - Packet loss = 0%, RTA = 187.13 ms [06:17:08] PROBLEM - ping6 on cp26 is CRITICAL: PING CRITICAL - Packet loss = 28%, RTA = 188.62 ms [06:19:31] PROBLEM - mw182 Current Load on mw182 is CRITICAL: LOAD CRITICAL - total load average: 26.59, 21.98, 19.42 [06:22:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.26, 3.04, 3.81 [06:23:18] RECOVERY - ping6 on cp26 is OK: PING OK - Packet loss = 0%, RTA = 187.82 ms [06:23:25] PROBLEM - mw182 Current Load on mw182 is WARNING: LOAD WARNING - total load average: 21.24, 23.20, 20.64 [06:26:32] PROBLEM - prometheus151 Current Load on prometheus151 is CRITICAL: LOAD CRITICAL - total load average: 5.13, 3.82, 3.94 [06:27:18] RECOVERY - mw182 Current Load on mw182 is OK: LOAD OK - total load average: 18.14, 19.93, 19.84 [06:28:31] PROBLEM - prometheus151 Current Load on prometheus151 is WARNING: LOAD WARNING - total load average: 3.23, 3.50, 3.81 [06:32:31] RECOVERY - prometheus151 Current Load on prometheus151 is OK: LOAD OK - total load average: 1.54, 2.67, 3.40 [06:32:31] PROBLEM - ns2 NTP time on ns2 is UNKNOWN: check_ntp_time: Invalid hostname/address - time.cloudflare.comUsage: check_ntp_time -H [-4|-6] [-w ] [-c ] [-v verbose] [-o