[00:09:09] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.47, 3.88, 3.23 [00:14:14] PROBLEM - cp31 Current Load on cp31 is CRITICAL: CRITICAL - load average: 1.76, 2.10, 1.51 [00:15:07] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.38, 3.59, 3.40 [00:18:11] RECOVERY - cp31 Current Load on cp31 is OK: OK - load average: 1.04, 1.63, 1.45 [00:21:06] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.58, 4.15, 3.63 [00:37:54] PROBLEM - cp31 Current Load on cp31 is CRITICAL: CRITICAL - load average: 2.11, 2.54, 1.70 [00:39:01] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.64, 3.81, 3.87 [00:39:52] PROBLEM - cp31 Current Load on cp31 is WARNING: WARNING - load average: 1.02, 1.97, 1.60 [00:41:00] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.56, 4.23, 4.02 [00:41:49] RECOVERY - cp31 Current Load on cp31 is OK: OK - load average: 0.64, 1.53, 1.48 [01:18:49] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.18, 3.59, 3.97 [01:20:48] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.52, 4.35, 4.19 [01:24:47] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.98, 3.46, 3.87 [01:32:45] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.03, 3.45, 3.66 [01:34:44] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.83, 3.41, 3.61 [01:40:42] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.05, 3.34, 3.47 [01:42:42] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.33, 3.33, 3.45 [01:44:41] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.02, 3.20, 3.39 [01:52:38] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.95, 4.12, 3.64 [01:58:36] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.51, 3.51, 3.57 [02:02:35] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.20, 3.72, 3.62 [02:04:34] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.20, 3.51, 3.55 [02:10:33] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.60, 4.51, 3.95 [02:15:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.69, 7.05, 5.95 [02:17:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.51, 6.46, 5.86 [02:20:13] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 9.26, 7.00, 5.68 [02:21:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 9.09, 8.18, 6.71 [02:22:11] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 6.12, 6.70, 5.74 [02:22:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 6.77, 8.47, 6.50 [02:23:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 4.62, 6.94, 6.44 [02:24:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 4.06, 6.99, 6.20 [02:25:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.60, 6.55, 6.36 [02:26:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.00, 6.05, 5.96 [02:29:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 7.79, 7.50, 6.76 [02:31:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.06, 6.61, 6.53 [02:57:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 7.16, 6.99, 6.03 [02:59:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.38, 6.00, 5.78 [03:04:18] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.89, 3.69, 3.99 [03:22:13] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.43, 3.92, 3.73 [03:22:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 7.11, 6.97, 5.46 [03:24:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 5.23, 6.27, 5.39 [03:26:12] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.60, 3.85, 3.75 [03:28:12] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 6.19, 4.37, 3.93 [03:30:12] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.36, 3.87, 3.80 [03:46:12] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.35, 3.65, 3.63 [03:48:12] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.72, 3.88, 3.74 [03:58:34] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.85, 6.52, 5.25 [04:00:30] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.69, 6.07, 5.23 [04:03:12] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 7.64, 6.50, 4.88 [04:03:49] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.02, 6.57, 4.96 [04:04:12] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.79, 3.21, 3.40 [04:05:49] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 4.46, 5.72, 4.84 [04:07:12] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 3.97, 5.36, 4.80 [04:17:09] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.66, 3.55, 3.43 [04:21:19] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 7.59, 6.53, 5.42 [04:23:13] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 3.56, 5.32, 5.10 [04:27:06] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.95, 3.84, 3.58 [04:29:06] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.80, 3.42, 3.45 [04:31:05] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.07, 3.27, 3.39 [04:44:59] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.99, 3.32, 3.29 [04:46:59] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.91, 3.30, 3.29 [05:21:52] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.93, 3.54, 3.17 [05:25:51] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.44, 3.08, 3.07 [05:31:48] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.41, 3.59, 3.23 [05:33:47] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.90, 3.47, 3.24 [05:35:47] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.02, 3.62, 3.32 [05:37:46] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.10, 3.23, 3.20 [06:03:45] RECOVERY - db13 APT on db13 is OK: APT OK: 9 packages available for upgrade (0 critical updates). [06:08:50] RECOVERY - mem2 APT on mem2 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [06:09:51] RECOVERY - mail2 APT on mail2 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [06:13:44] RECOVERY - ns2 APT on ns2 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [06:14:20] RECOVERY - gluster3 APT on gluster3 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [06:22:31] RECOVERY - puppet3 APT on puppet3 is OK: APT OK: 4 packages available for upgrade (0 critical updates). [06:27:16] RECOVERY - cp20 APT on cp20 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [06:28:27] RECOVERY - bacula2 APT on bacula2 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [06:28:30] RECOVERY - graylog2 APT on graylog2 is OK: APT OK: 4 packages available for upgrade (0 critical updates). [06:35:39] RECOVERY - ns1 APT on ns1 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [06:37:09] RECOVERY - mem1 APT on mem1 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [06:41:58] RECOVERY - db11 APT on db11 is OK: APT OK: 9 packages available for upgrade (0 critical updates). [06:42:59] RECOVERY - phab2 APT on phab2 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [06:45:43] RECOVERY - gluster4 APT on gluster4 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [06:51:13] RECOVERY - mon2 APT on mon2 is OK: APT OK: 19 packages available for upgrade (0 critical updates). [06:53:20] RECOVERY - ldap2 APT on ldap2 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [07:00:36] RECOVERY - db12 APT on db12 is OK: APT OK: 9 packages available for upgrade (0 critical updates). [07:08:31] PROBLEM - db12 Current Load on db12 is CRITICAL: CRITICAL - load average: 9.30, 11.04, 6.76 [07:10:30] PROBLEM - db12 Current Load on db12 is WARNING: WARNING - load average: 1.79, 7.64, 6.03 [07:12:29] RECOVERY - db12 Current Load on db12 is OK: OK - load average: 1.63, 5.69, 5.51 [07:21:31] PROBLEM - mw9 APT on mw9 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [07:22:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 12.73, 8.92, 5.51 [07:23:27] RECOVERY - mw9 APT on mw9 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [07:24:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.33, 7.42, 5.38 [07:26:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.63, 6.32, 5.21 [07:29:24] !log [rhinos@mwtask1] sudo -u www-data php /srv/mediawiki/w/maintenance/purgeList.php --wiki=gratisdatawiki --all-namespaces --db-touch --delay=1 (END - exit=2) [07:29:32] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [07:29:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.64, 6.66, 4.96 [07:31:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.69, 5.89, 4.89 [07:34:28] [02puppet] 07RhinosF1 opened pull request 03#2138: mwscript: mark purgeList.php long - 13https://git.io/JMV9n [07:34:38] Reception123: ^ [07:38:37] Why does Mobile.css not work here on Miraheze? [07:45:02] There's a task somewhere [07:45:33] [02puppet] 07Reception123 closed pull request 03#2138: mwscript: mark purgeList.php long - 13https://git.io/JMV9n [07:45:35] [02miraheze/puppet] 07Reception123 pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMVQy [07:45:36] [02miraheze/puppet] 07RhinosF1 03b1fad71 - mwscript: mark purgeList.php long (#2138) [07:56:08] RhinosF1 🥲 Okay [07:56:21] It really a pain in the ass [10:01:44] !log [rhinos@mwtask1] sudo -u www-data php /srv/mediawiki/w/maintenance/purgeList.php --wiki=gratisdatawiki --all-namespaces --db-touch --delay=1 --verbose (END - exit=0) [10:01:47] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [10:06:52] .in 9hours do 09:44:00 !log [urbanecm@mwmaint1002 ~]$ foreachwiki extensions/CheckUser/maintenance/fixTrailingSpacesInLogs.php for mh [10:06:52] RhinosF1: Okay, will remind at 2021-12-01 - 19:06:52GMT [10:48:25] PROBLEM - wiki.thedev.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.thedev.gq All nameservers failed to answer the query. [10:49:55] PROBLEM - wikidiffs.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/python3/di [10:49:55] ckages/dns/resolver.py", line 992, in query timeout = self._compute_timeout(start, lifetime) File "/usr/lib/python3/dist-packages/dns/resolver.py", line 799, in _compute_timeout raise Timeout(timeout=duration)dns.exception.Timeout: The DNS operation timed out after 30.00318169593811 seconds [10:50:07] PROBLEM - otcg.ml - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/dist-p [10:50:07] es/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query otcg.ml. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [10:50:14] PROBLEM - wiki.apadotech.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/pyth [10:50:14] ist-packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query apadotech.ga. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [10:50:17] PROBLEM - matttest.mtwiki.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - matttest.mtwiki.cf All nameservers failed to answer the query. [10:51:25] PROBLEM - miraheze.ml - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - miraheze.ml All nameservers failed to answer the query. [10:51:39] PROBLEM - wikileague.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wikileague.cf All nameservers failed to answer the query. [10:51:47] PROBLEM - diamowiki.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - diamowiki.ga All nameservers failed to answer the query. [10:51:53] PROBLEM - miraheze.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - miraheze.gq All nameservers failed to answer the query. [10:53:45] PROBLEM - wiki-asterix.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki-asterix.cf All nameservers failed to answer the query. [10:54:09] PROBLEM - miraheze.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - miraheze.ga All nameservers failed to answer the query. [10:54:15] PROBLEM - ircwiki.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/dis [10:54:15] kages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query ircwiki.cf. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [10:56:48] PROBLEM - wikidiffs.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wikidiffs.ga All nameservers failed to answer the query. [10:56:57] PROBLEM - otcg.ml - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - otcg.ml All nameservers failed to answer the query. [10:58:37] PROBLEM - diamowiki.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/d [10:58:37] ackages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query diamowiki.ga. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [10:58:57] PROBLEM - wikileague.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/ [10:58:57] packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wikileague.cf. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:00:51] PROBLEM - wiki-asterix.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python [11:00:51] t-packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wiki-asterix.cf. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:01:17] PROBLEM - miraheze.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/di [11:01:17] ckages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query miraheze.ga. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:01:37] PROBLEM - pj-masks-info.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/pytho [11:01:37] st-packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query pj-masks-info.cf. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:02:15] RECOVERY - wiki.thedev.gq - reverse DNS on sslhost is OK: SSL OK - wiki.thedev.gq reverse DNS resolves to cp21.miraheze.org - CNAME OK [11:02:31] PROBLEM - storytime.jdstroy.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - storytime.jdstroy.cf All nameservers failed to answer the query. [11:03:05] PROBLEM - wiki.konjitownmc.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.konjitownmc.cf All nameservers failed to answer the query. [11:03:15] PROBLEM - linkwiki.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - linkwiki.cf All nameservers failed to answer the query. [11:03:50] RECOVERY - otcg.ml - reverse DNS on sslhost is OK: SSL OK - otcg.ml reverse DNS resolves to cp20.miraheze.org - NS RECORDS OK [11:04:44] PROBLEM - memipedia.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - memipedia.ga All nameservers failed to answer the query. [11:04:46] PROBLEM - matttest.mtwiki.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/pyt [11:04:46] dist-packages/dns/resolver.py", line 992, in query timeout = self._compute_timeout(start, lifetime) File "/usr/lib/python3/dist-packages/dns/resolver.py", line 799, in _compute_timeout raise Timeout(timeout=duration)dns.exception.Timeout: The DNS operation timed out after 30.00181770324707 seconds [11:05:03] PROBLEM - miraheze.ml - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/di [11:05:04] ckages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query miraheze.ml. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:05:37] RECOVERY - diamowiki.ga - reverse DNS on sslhost is OK: SSL OK - diamowiki.ga reverse DNS resolves to cp20.miraheze.org - CNAME OK [11:06:34] PROBLEM - lakehub.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - lakehub.ga All nameservers failed to answer the query. [11:07:47] PROBLEM - wiki-asterix.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki-asterix.cf All nameservers failed to answer the query. [11:08:22] PROBLEM - miraheze.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - miraheze.ga All nameservers failed to answer the query. [11:08:49] PROBLEM - pj-masks-info.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - pj-masks-info.cf All nameservers failed to answer the query. [11:10:06] PROBLEM - wiki.konjitownmc.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/pyt [11:10:06] dist-packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wiki.konjitownmc.cf. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS [11:10:06] ation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:10:29] PROBLEM - wiki.ripto.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.ripto.gq All nameservers failed to answer the query. [11:10:32] RECOVERY - wikidiffs.ga - reverse DNS on sslhost is OK: SSL OK - wikidiffs.ga reverse DNS resolves to cp20.miraheze.org - CNAME FLAT [11:11:23] PROBLEM - wiki.thedev.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.thedev.gq All nameservers failed to answer the query. [11:11:47] PROBLEM - data.memipedia.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/pytho [11:11:47] st-packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query data.memipedia.ga. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:11:51] PROBLEM - miraheze.ml - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - miraheze.ml All nameservers failed to answer the query. [11:11:57] PROBLEM - wiki.apadotech.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.apadotech.ga All nameservers failed to answer the query. [11:12:48] PROBLEM - otcg.ml - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - otcg.ml All nameservers failed to answer the query. [11:13:56] PROBLEM - lakehub.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 116, in main rdns_hostname = get_reverse_dnshostname(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 101, in get_reverse_dnshostname resolved_ip_addr = str(dns_resolver.query(hostname, 'A [11:13:56] ) File "/usr/lib/python3/dist-packages/dns/resolver.py", line 992, in query timeout = self._compute_timeout(start, lifetime) File "/usr/lib/python3/dist-packages/dns/resolver.py", line 799, in _compute_timeout raise Timeout(timeout=duration)dns.exception.Timeout: The DNS operation timed out after 30.003780126571655 seconds [11:15:00] PROBLEM - ircwiki.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - ircwiki.cf All nameservers failed to answer the query. [11:15:22] PROBLEM - diamowiki.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - diamowiki.ga All nameservers failed to answer the query. [11:15:31] PROBLEM - miraheze.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/di [11:15:31] ckages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query miraheze.ga. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:15:42] PROBLEM - wiki.ct777.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.ct777.cf All nameservers failed to answer the query. [11:16:56] PROBLEM - wiki.konjitownmc.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.konjitownmc.cf All nameservers failed to answer the query. [11:17:16] PROBLEM - linkwiki.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/python3/dis [11:17:16] kages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query linkwiki.cf. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:17:29] PROBLEM - wiki.ripto.gq - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/ [11:17:29] packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query ripto.gq. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed ou [11:17:29] erver 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:17:48] paladox: ^ [11:17:58] They are all freenom domains [11:18:37] RECOVERY - wiki.thedev.gq - reverse DNS on sslhost is OK: SSL OK - wiki.thedev.gq reverse DNS resolves to cp20.miraheze.org - CNAME OK [11:18:52] PROBLEM - miraheze.ml - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/di [11:18:52] ckages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query miraheze.ml. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:19:10] RECOVERY - memipedia.ga - reverse DNS on sslhost is OK: SSL OK - memipedia.ga reverse DNS resolves to cp21.miraheze.org - CNAME FLAT [11:19:12] PROBLEM - matttest.mtwiki.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - matttest.mtwiki.cf All nameservers failed to answer the query. [11:19:28] PROBLEM - gp.ct777.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - gp.ct777.cf All nameservers failed to answer the query. [11:19:42] RECOVERY - miraheze.gq - reverse DNS on sslhost is OK: SSL OK - miraheze.gq reverse DNS resolves to cp20.miraheze.org - NS RECORDS OK [11:19:42] PROBLEM - wikidiffs.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wikidiffs.ga All nameservers failed to answer the query. [11:20:52] PROBLEM - lakehub.ga - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - lakehub.ga All nameservers failed to answer the query. [11:22:22] !log downtime sslhost for until 15:21 [11:22:26] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [11:22:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 6.46, 7.94, 5.53 [11:24:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.32, 6.51, 5.29 [11:26:50] PROBLEM - wikidiffs.ga - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/python3/di [11:26:51] ckages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wikidiffs.ga. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation tim [11:26:51] t.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:31:24] RECOVERY - wiki.ripto.gq - reverse DNS on sslhost is OK: SSL OK - wiki.ripto.gq reverse DNS resolves to cp20.miraheze.org - CNAME OK [11:40:19] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 9.41, 7.45, 5.51 [11:40:45] RECOVERY - wikidiffs.ga - reverse DNS on sslhost is OK: SSL OK - wikidiffs.ga reverse DNS resolves to cp20.miraheze.org - CNAME FLAT [11:40:50] RECOVERY - wikileague.cf - reverse DNS on sslhost is OK: SSL OK - wikileague.cf reverse DNS resolves to cp20.miraheze.org - NS RECORDS OK [11:44:04] PROBLEM - wiki.thedev.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.thedev.gq All nameservers failed to answer the query. [11:46:08] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 7.12, 7.84, 6.35 [11:49:51] PROBLEM - wikileague.cf - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wikileague.cf All nameservers failed to answer the query. [11:50:06] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 12.24, 8.48, 6.85 [11:50:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 23.43, 12.71, 7.93 [11:50:38] PROBLEM - wiki.ripto.gq - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/python3/d [11:50:38] ackages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wiki.ripto.gq. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:51:10] RECOVERY - wiki.thedev.gq - reverse DNS on sslhost is OK: SSL OK - wiki.thedev.gq reverse DNS resolves to cp20.miraheze.org - CNAME OK [11:54:00] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.82, 7.61, 6.87 [11:55:56] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 13.91, 10.96, 8.23 [11:56:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 4.66, 7.68, 7.22 [11:56:54] PROBLEM - wikileague.cf - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/ [11:56:54] packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wikileague.cf. IN NS: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [11:57:40] PROBLEM - wiki.ripto.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.ripto.gq All nameservers failed to answer the query. [12:00:27] PROBLEM - wiki.thedev.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.thedev.gq All nameservers failed to answer the query. [12:01:46] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.76, 7.72, 7.58 [12:02:59] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.18, 3.02, 2.59 [12:04:43] RECOVERY - wiki.ripto.gq - reverse DNS on sslhost is OK: SSL OK - wiki.ripto.gq reverse DNS resolves to cp20.miraheze.org - CNAME OK [12:04:58] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.07, 2.94, 2.61 [12:07:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.71, 5.73, 6.74 [12:08:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 6.05, 6.42, 6.74 [12:13:12] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 10.73, 8.08, 5.67 [12:14:13] PROBLEM - wiki.ripto.gq - reverse DNS on sslhost is CRITICAL: rDNS CRITICAL - wiki.ripto.gq All nameservers failed to answer the query. [12:14:31] PROBLEM - wiki.thedev.gq - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 79, in check_records cname = str(dns_resolver.query(hostname, 'CNAME')[0]) File "/usr/lib/python3/ [12:14:31] packages/dns/resolver.py", line 898, in query raise NoNameservers(request=request, errors=errors)dns.resolver.NoNameservers: All nameservers failed to answer the query wiki.thedev.gq. IN CNAME: Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered The DNS operation timed out.; Server 1.1.1.1 UDP port 53 answered SERVFAIL [12:14:39] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.07, 6.03, 4.89 [12:15:12] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.06, 7.24, 5.66 [12:16:34] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 6.17, 6.08, 5.05 [12:19:12] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 4.40, 6.08, 5.57 [12:20:56] RECOVERY - wiki.ripto.gq - reverse DNS on sslhost is OK: SSL OK - wiki.ripto.gq reverse DNS resolves to cp21.miraheze.org - CNAME OK [12:21:13] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 15.85, 12.57, 8.99 [12:21:18] RECOVERY - wiki.thedev.gq - reverse DNS on sslhost is OK: SSL OK - wiki.thedev.gq reverse DNS resolves to cp21.miraheze.org - CNAME OK [12:21:27] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 13.74, 9.34, 7.50 [12:25:21] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.13, 7.64, 7.24 [12:25:32] RECOVERY - wikileague.cf - reverse DNS on sslhost is OK: SSL OK - wikileague.cf reverse DNS resolves to cp20.miraheze.org - NS RECORDS OK [12:27:18] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 10.22, 8.24, 7.48 [12:28:42] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.12, 6.75, 5.88 [12:29:15] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.28, 7.89, 7.47 [12:30:39] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 6.55, 6.66, 5.95 [12:32:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 4.17, 6.24, 7.57 [12:33:08] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.02, 5.71, 6.67 [12:38:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.01, 4.83, 6.52 [12:52:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 9.12, 9.55, 7.65 [12:54:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.20, 7.86, 7.26 [12:58:28] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.32, 6.72, 5.80 [13:00:28] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 6.11, 6.46, 5.82 [13:01:43] PROBLEM - cp21 Current Load on cp21 is WARNING: WARNING - load average: 0.95, 1.86, 1.25 [13:02:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.04, 5.90, 6.69 [13:03:40] RECOVERY - cp21 Current Load on cp21 is OK: OK - load average: 0.37, 1.32, 1.12 [13:19:52] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.81, 7.51, 6.48 [13:20:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 11.99, 8.36, 6.83 [13:22:28] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.32, 6.88, 6.13 [13:22:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 7.54, 7.90, 6.85 [13:24:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 5.04, 6.71, 6.53 [13:25:42] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.71, 7.89, 7.11 [13:26:28] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 4.73, 6.17, 6.05 [13:27:38] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.43, 8.19, 7.32 [13:27:47] .op [13:27:48] Attempting to OP... [13:29:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.69, 7.11, 7.02 [13:31:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.10, 6.41, 6.77 [13:34:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 8.10, 6.75, 6.46 [13:43:17] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 9.21, 7.66, 6.57 [13:43:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 10.91, 8.93, 7.64 [13:44:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 6.42, 8.00, 7.64 [13:45:14] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 6.14, 7.14, 6.53 [13:47:12] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 4.52, 6.30, 6.30 [13:50:36] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 16.38, 10.59, 8.61 [13:51:05] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 10.97, 8.05, 6.94 [13:53:02] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 5.77, 7.40, 6.86 [13:53:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.45, 7.90, 7.95 [13:55:00] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.36, 6.51, 6.59 [13:55:40] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.90, 8.12, 8.00 [13:56:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.45, 7.51, 7.95 [14:00:25] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.82, 3.44, 3.16 [14:01:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.24, 7.58, 7.95 [14:02:24] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.30, 3.27, 3.12 [14:07:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 11.65, 8.24, 7.92 [14:10:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 5.35, 5.79, 6.69 [14:10:38] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.17, 6.92, 6.46 [14:11:49] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 8.05, 6.76, 5.57 [14:12:36] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 6.70, 6.75, 6.45 [14:13:12] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.11, 7.35, 6.25 [14:13:49] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 3.65, 5.62, 5.30 [14:15:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 4.65, 7.01, 7.67 [14:16:29] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 6.44, 6.87, 6.58 [14:17:12] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 5.53, 6.54, 6.19 [14:17:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 9.71, 7.31, 7.65 [14:18:28] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.04, 7.25, 6.74 [14:19:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.93, 6.66, 7.37 [14:20:05] PROBLEM - mw9 APT on mw9 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [14:21:17] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 10.89, 10.04, 8.15 [14:21:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 12.61, 8.73, 8.03 [14:22:01] RECOVERY - mw9 APT on mw9 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [14:22:28] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.74, 7.83, 7.11 [14:26:04] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 7.74, 6.44, 5.60 [14:26:28] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 9.21, 8.24, 7.40 [14:28:00] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 5.61, 6.12, 5.59 [14:28:53] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 21.40, 19.51, 18.05 [14:30:45] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.26, 7.88, 7.94 [14:30:52] RECOVERY - cloud4 Current Load on cloud4 is OK: OK - load average: 17.06, 18.59, 17.90 [14:32:28] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 4.90, 7.49, 7.54 [14:34:12] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.61, 3.73, 3.41 [14:35:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.59, 6.90, 7.92 [14:36:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 3.69, 5.29, 6.77 [14:38:28] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 4.27, 5.39, 6.60 [14:40:12] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 1.78, 2.98, 3.21 [14:41:38] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 5.21, 5.11, 6.72 [14:46:12] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.58, 3.61, 3.36 [14:48:12] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.81, 3.26, 3.25 [14:52:14] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.57, 3.85, 3.47 [14:54:13] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.73, 3.39, 3.35 [15:11:10] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.96, 3.41, 3.27 [15:21:01] RECOVERY - lakehub.ga - reverse DNS on sslhost is OK: SSL OK - lakehub.ga reverse DNS resolves to cp21.miraheze.org - NS RECORDS OK [15:21:39] PROBLEM - housing.wiki - reverse DNS on sslhost is WARNING: SSL WARNING - rDNS OK but records conflict. {'NS': ['ns1.dreamhost.com.', 'ns3.dreamhost.com.', 'ns2.dreamhost.com.'], 'CNAME': None} PROBLEM - wiki.3805.co.uk - reverse DNS on sslhost is WARNING: SSL WARNING - rDNS OK but records conflict. {'NS': ['ultra104.uk2.net.', 'ultra103.uk2.net.'], 'CNAME': '3805.miraheze.org.'} [15:21:39] PROBLEM - dreamsit.com.br - reverse DNS on sslhost is WARNING: SSL WARNING - rDNS OK but records conflict. {'NS': ['ns82.domaincontrol.com.', 'ns81.domaincontrol.com.'], 'CNAME': None} [15:21:40] PROBLEM - wiki.minkyu.kim - reverse DNS on sslhost is WARNING: SSL WARNING - rDNS OK but records conflict. {'NS': ['ns3.wordpress.com.', 'ns2.wordpress.com.', 'ns1.wordpress.com.'], 'CNAME': 'mk.miraheze.org.'} [15:21:40] PROBLEM - fr.gyaanipedia.com - reverse DNS on sslhost is WARNING: rDNS WARNING - reverse DNS entry for fr.gyaanipedia.com could not be found [15:21:40] PROBLEM - en.famepedia.org - LetsEncrypt on sslhost is CRITICAL: CRITICAL - Certificate 'en.famepedia.org' expires in 6 day(s) (Wed 08 Dec 2021 04:12:21 GMT +0000). PROBLEM - ambient.wiki - LetsEncrypt on sslhost is CRITICAL: CRITICAL - Certificate 'ambient.wiki' expires in 3 day(s) (Sun 05 Dec 2021 08:49:56 GMT +0000). PROBLEM - pa.gyaanipedia.com - reverse DNS on sslhost is WARNING: rDNS WARNING - reverse DNS entry for pa.gyaanipedia.com c [15:24:36] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 3.25, 6.91, 5.92 [15:25:06] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.17, 3.75, 3.51 [15:26:36] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 2.68, 5.54, 5.54 [15:31:04] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.83, 3.55, 3.53 [15:37:02] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 1.90, 2.84, 3.24 [15:44:28] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.49, 6.31, 4.96 [15:45:49] RhinosF1: https://phabricator.miraheze.org/T8364 what else is left to do on the task? [15:45:51] [url] ⚓ T8364 DNS failures for all freenom registered domains | phabricator.miraheze.org [15:46:28] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.52, 5.71, 4.89 [15:48:41] PROBLEM - cp30 Current Load on cp30 is CRITICAL: CRITICAL - load average: 2.01, 1.86, 1.16 [15:50:41] RECOVERY - cp30 Current Load on cp30 is OK: OK - load average: 0.82, 1.48, 1.10 [15:50:57] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.53, 3.89, 3.52 [15:52:56] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.53, 3.83, 3.55 [15:54:56] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.02, 3.85, 3.59 [15:55:11] JohnLewis: remove my downtime [15:55:20] And reset topic [15:55:47] Luckily no users complained [15:56:31] If anyone did, they can be directed to freenom :P [15:56:55] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.25, 3.79, 3.61 [15:57:39] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.79, 7.51, 6.23 [15:58:43] JohnLewis: https://phabricator.miraheze.org/T8364#168916 [15:58:44] [url] ⚓ T8364 DNS failures for all freenom registered domains | phabricator.miraheze.org [15:58:55] No idea what happened but not us [15:59:22] tbh, I don't think a task really achieved much anyway, I wouldn't have even opened a task [16:00:40] Tracking, a notice we're aware, central place we can send dupe tasks if one was made, reference for icinga history [16:00:54] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 1.96, 2.93, 3.30 [16:01:39] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.60, 6.77, 6.32 [16:04:44] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.56, 3.53, 3.47 [16:08:33] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 3.10, 3.31, 3.40 [16:11:07] mkay, I still wouldn't have made one regardless anyway [16:13:06] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.03, 6.06, 5.47 [16:15:06] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.37, 5.87, 5.48 [16:17:10] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.88, 3.72, 3.53 [16:18:55] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 10.82, 7.73, 6.52 [16:19:05] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.14, 3.24, 3.38 [16:19:50] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 14.55, 8.56, 6.78 [16:21:06] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 13.51, 9.47, 6.99 [16:22:37] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [16:22:41] PROBLEM - mw9 APT on mw9 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [16:24:44] RECOVERY - mw9 APT on mw9 is OK: APT OK: 1 packages available for upgrade (0 critical updates). [16:27:36] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [16:28:38] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.85, 4.22, 3.68 [16:40:05] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.58, 3.66, 3.82 [16:40:55] PROBLEM - mw13 Current Load on mw13 is CRITICAL: CRITICAL - load average: 8.46, 6.74, 5.58 [16:41:07] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.17, 7.47, 7.93 [16:41:31] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 9.04, 7.85, 6.66 [16:42:35] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 11.95, 7.95, 6.37 [16:43:29] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.26, 7.41, 6.63 [16:43:54] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.16, 3.87, 3.86 [16:45:49] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.03, 3.59, 3.77 [16:46:55] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 5.65, 7.22, 6.29 [16:47:25] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 5.55, 6.69, 6.54 [16:48:33] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 7.71, 7.81, 6.95 [16:48:36] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [16:48:55] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 5.97, 6.68, 6.19 [16:49:06] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 12.12, 9.16, 8.42 [16:52:33] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 4.39, 6.02, 6.42 [16:53:36] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [16:53:48] !log delete 5 old indices on graylog [16:53:56] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [16:54:46] RECOVERY - graylog2 Disk Space on graylog2 is OK: DISK OK - free space: / 173728 MB (24% inode=99%); [16:57:06] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 6.71, 7.53, 7.96 [17:03:31] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.54, 2.88, 3.35 [17:03:39] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMovM [17:03:41] [02miraheze/puppet] 07paladox 03603bd4a - graylog: Upgrade graylog to 4.2.1 [17:03:53] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMov9 [17:03:55] [02miraheze/puppet] 07paladox 03f121906 - graylog: Upgrade mongodb to 4.4.10 [17:04:18] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMov5 [17:04:20] [02miraheze/puppet] 07paladox 0320c41c5 - graylog: Upgrade elasticsearch to 7.15.2 [17:08:40] [02miraheze/mw-config] 07Universal-Omega pushed 031 commit to 03Universal-Omega-patch-1 [+0/-0/±1] 13https://git.io/JMofN [17:08:42] [02miraheze/mw-config] 07Universal-Omega 03841e470 - Remove Southparkfan from staffwiki [17:08:43] [02mw-config] 07Universal-Omega created branch 03Universal-Omega-patch-1 - 13https://git.io/vbvb3 [17:08:45] [02mw-config] 07Universal-Omega opened pull request 03#4242: Remove Southparkfan from staffwiki - 13https://git.io/JMofx [17:09:06] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.62, 5.73, 6.75 [17:09:43] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.36, 6.44, 7.70 [17:09:50] miraheze/mw-config - Universal-Omega the build passed. [17:12:31] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 5.09, 6.78, 7.95 [17:13:27] PROBLEM - graylog2 Puppet on graylog2 is CRITICAL: CRITICAL: Puppet has 2 failures. Last run 2 minutes ago with 2 failures. Failed resources (up to 3 shown): Service[elasticsearch],Elasticsearch_template[graylog-internal] [17:15:16] RECOVERY - graylog2 Current Load on graylog2 is OK: OK - load average: 1.62, 0.38, 0.13 [17:15:39] !log rebooted graylog2 (seems like processess were stuck? high load/cpu) [17:15:43] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:16:47] PROBLEM - graylog2 HTTPS on graylog2 is CRITICAL: HTTP CRITICAL: HTTP/1.1 502 Bad Gateway - 311 bytes in 0.012 second response time [17:17:26] RECOVERY - graylog2 Puppet on graylog2 is OK: OK: Puppet is currently enabled, last run 57 seconds ago with 0 failures [17:17:33] [02mw-config] 07Universal-Omega deleted branch 03Universal-Omega-patch-1 - 13https://git.io/vbvb3 [17:17:34] [02miraheze/mw-config] 07Universal-Omega pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMoUr [17:17:36] [02miraheze/mw-config] 07Universal-Omega 033809d73 - Remove Southparkfan from staffwiki (#4242) [17:17:37] [02miraheze/mw-config] 07Universal-Omega deleted branch 03Universal-Omega-patch-1 [17:17:39] [02mw-config] 07Universal-Omega closed pull request 03#4242: Remove Southparkfan from staffwiki - 13https://git.io/JMofx [17:18:31] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.79, 7.42, 7.79 [17:18:44] miraheze/mw-config - Universal-Omega the build passed. [17:19:16] PROBLEM - graylog2 Current Load on graylog2 is CRITICAL: CRITICAL - load average: 10.22, 5.74, 2.34 [17:19:29] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 9.79, 7.57, 7.47 [17:20:43] RECOVERY - graylog2 HTTPS on graylog2 is OK: HTTP OK: HTTP/1.1 200 OK - 1418 bytes in 0.087 second response time [17:21:39] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 9.14, 6.71, 6.41 [17:21:54] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 24.42, 21.78, 19.89 [17:23:33] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.70, 6.95, 6.54 [17:24:39] !log [@test3] starting deploy of {'config': True} to skip [17:24:40] !log [@test3] finished deploy of {'config': True} to skip - SUCCESS in 0s [17:24:45] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:24:51] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:25:27] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 6.66, 6.71, 6.49 [17:25:54] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 20.62, 21.98, 20.48 [17:27:41] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMokX [17:27:43] [02miraheze/puppet] 07paladox 03eef09ea - matom: Upgrade to 4.6.1 [17:31:47] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.11, 6.86, 6.55 [17:32:29] !log rebooted mon2, high load [17:33:02] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [17:33:14] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.00, 6.24, 6.36 [17:34:25] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 13.69, 4.60, 1.64 [17:34:59] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 3.72, 6.74, 7.99 [17:36:51] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.88, 6.43, 7.80 [17:38:01] ok : [RESOLVED] (mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [17:42:26] !log [@mw11] starting deploy of {'config': True} to all [17:42:31] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:42:46] !log [@mw11] finished deploy of {'config': True} to all - SUCCESS in 19s [17:42:53] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:43:26] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 7.88, 7.21, 6.12 [17:44:42] [02miraheze/mw-config] 07Universal-Omega pushed 031 commit to 03Universal-Omega-patch-1 [+0/-0/±1] 13https://git.io/JMotb [17:44:43] [02miraheze/mw-config] 07Universal-Omega 031460b35 - Disable DBPerformance logging [17:44:45] [02mw-config] 07Universal-Omega created branch 03Universal-Omega-patch-1 - 13https://git.io/vbvb3 [17:44:46] [02mw-config] 07Universal-Omega opened pull request 03#4243: Disable DBPerformance logging - 13https://git.io/JMotN [17:45:10] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.60, 5.26, 6.60 [17:45:25] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 6.48, 6.80, 6.10 [17:45:38] [02mw-config] 07paladox closed pull request 03#4243: Disable DBPerformance logging - 13https://git.io/JMotN [17:45:39] [02miraheze/mw-config] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMoqk [17:45:41] [02miraheze/mw-config] 07Universal-Omega 03f74d426 - Disable DBPerformance logging (#4243) [17:45:42] [02mw-config] 07paladox deleted branch 03Universal-Omega-patch-1 - 13https://git.io/vbvb3 [17:45:44] [02miraheze/mw-config] 07paladox deleted branch 03Universal-Omega-patch-1 [17:45:47] miraheze/mw-config - Universal-Omega the build passed. [17:46:31] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 6.94, 6.30, 5.58 [17:46:32] !log [paladox@mw11] starting deploy of {'config': True} to all [17:46:36] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:46:39] !log [paladox@mw11] finished deploy of {'config': True} to all - SUCCESS in 6s [17:46:43] miraheze/mw-config - paladox the build passed. [17:46:44] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:48:24] paladox: I don't think that that did anything, it wouldn't have been pulled yet unless you did manually, otherwise it would need the `--pull=config` argument. [17:48:31] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 4.80, 5.91, 5.54 [17:48:34] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.76, 3.85, 3.66 [17:48:35] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 10.75, 7.11, 7.22 [17:48:40] i manually git pulled [17:48:46] paladox: just run puppet [17:48:52] Puppet does it automatically [17:48:59] yeh but it's slower [17:49:09] paladox: Alright thought you might've but wasn't sure. Thanks! [17:49:55] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 25.53, 22.17, 20.51 [17:50:22] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 9.50, 8.00, 6.77 [17:50:59] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 16.11, 9.04, 7.62 [17:51:53] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 21.41, 22.01, 20.67 [17:54:53] !log [@test3] starting deploy of {'config': True} to skip [17:54:55] !log [@test3] finished deploy of {'config': True} to skip - SUCCESS in 1s [17:55:06] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:55:11] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [17:58:14] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 6.65, 7.71, 7.43 [17:58:22] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 6.06, 7.45, 7.90 [17:59:52] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 24.12, 23.18, 21.60 [18:01:52] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 19.41, 22.06, 21.41 [18:02:11] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.27, 7.53, 7.40 [18:04:09] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.29, 7.43, 7.39 [18:04:39] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 4.36, 7.40, 7.98 [18:08:05] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 3.38, 5.50, 6.62 [18:10:06] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 5.17, 5.57, 6.73 [18:10:22] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.74, 4.24, 3.88 [18:12:27] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 12.33, 7.61, 7.55 [18:13:52] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 27.62, 24.96, 22.49 [18:14:22] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.70, 3.70, 3.77 [18:14:24] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.75, 7.58, 7.58 [18:15:52] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 20.17, 22.84, 22.00 [18:19:52] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 34.68, 25.30, 22.90 [18:20:15] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 8.37, 7.26, 7.29 [18:20:22] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.54, 3.70, 3.69 [18:21:47] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 13.01, 8.56, 7.21 [18:22:15] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 4.81, 6.58, 7.06 [18:22:22] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.08, 3.51, 3.63 [18:22:31] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 10.08, 7.97, 6.38 [18:23:10] PROBLEM - cloud5 Current Load on cloud5 is WARNING: WARNING - load average: 22.76, 19.29, 16.89 [18:23:13] PROBLEM - ns2 GDNSD Datacenters on ns2 is CRITICAL: CRITICAL - 1 datacenter is down: 51.195.220.68/cpweb [18:23:55] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 7.99, 6.93, 6.01 [18:24:02] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [18:24:15] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.88, 5.79, 6.69 [18:25:09] RECOVERY - ns2 GDNSD Datacenters on ns2 is OK: OK - all datacenters are online [18:25:10] RECOVERY - cloud5 Current Load on cloud5 is OK: OK - load average: 18.51, 18.85, 17.02 [18:25:42] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.53, 7.15, 6.95 [18:25:53] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 5.88, 6.60, 6.00 [18:26:31] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 4.42, 7.22, 6.58 [18:27:39] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.53, 6.16, 6.61 [18:28:15] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 10.73, 7.57, 7.15 [18:28:31] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 3.18, 5.91, 6.18 [18:29:02] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [18:29:52] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 18.07, 22.85, 23.36 [18:30:15] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.01, 6.80, 6.91 [18:32:15] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.27, 6.12, 6.67 [18:34:22] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 1.79, 2.85, 3.32 [18:38:22] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.69, 3.36, 3.42 [18:41:55] !log puppet3: upgrade puppet-agent puppetdb puppetdb-termini puppetserver [18:42:00] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:42:22] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.38, 4.09, 3.66 [18:44:22] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.87, 3.98, 3.67 [18:45:10] PROBLEM - cloud4 Puppet on cloud4 is CRITICAL: CRITICAL: Failed to apply catalog, zero resources tracked by Puppet. It might be a dependency cycle. [18:45:12] !log upgrade puppet-agent everywhere [18:45:24] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:45:52] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 35.80, 23.84, 22.18 [18:46:22] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.86, 4.87, 4.05 [18:47:10] PROBLEM - cloud5 Current Load on cloud5 is CRITICAL: CRITICAL - load average: 35.75, 24.03, 17.67 [18:48:52] !log mon2: upgrade icinga2 [18:48:57] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:49:23] !log mon2: upgrade icingaweb2 [18:49:32] RECOVERY - cloud5 Current Load on cloud5 is OK: OK - load average: 15.25, 19.84, 16.99 [18:49:32] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [18:51:47] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 19.58, 23.27, 22.85 [18:51:54] !log mon2: upgrade grafana [18:51:58] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:01:18] RECOVERY - cloud4 Current Load on cloud4 is OK: OK - load average: 14.69, 17.38, 19.88 [19:05:44] PROBLEM - graylog2 Current Load on graylog2 is WARNING: WARNING - load average: 0.33, 0.71, 3.80 [19:06:54] RhinosF1: do 09:44:00 !log [urbanecm@mwmaint1002 ~]$ foreachwiki extensions/CheckUser/maintenance/fixTrailingSpacesInLogs.php for mh [19:07:00] [02miraheze/mw-config] 07Universal-Omega pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMoRT [19:07:02] [02miraheze/mw-config] 07Universal-Omega 036879d81 - Fix RandomGameUnit loading [19:08:07] miraheze/mw-config - Universal-Omega the build passed. [19:08:17] JohnLewis: fyi I'm gonna run the script to clean up an old bug in CU. It should be no-op but if you see errors then shout at me [19:08:21] I'll run it later tonight [19:09:38] RECOVERY - graylog2 Current Load on graylog2 is OK: OK - load average: 0.48, 0.61, 3.08 [19:12:07] !log [@mw11] starting deploy of {'config': True} to all [19:12:10] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:12:14] !log [@mw11] finished deploy of {'config': True} to all - SUCCESS in 7s [19:12:18] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:14:15] RECOVERY - cloud4 Puppet on cloud4 is OK: OK: Puppet is currently enabled, last run 1 minute ago with 0 failures [19:15:55] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.74, 3.27, 3.94 [19:19:13] Latest post on [[CN]] asking about MediaWiki 1.37 rollout [19:19:13] https://meta.miraheze.org/wiki/CN [19:19:14] https://meta.miraheze.org/wiki/CN [19:19:17] [url] Community noticeboard - Miraheze Meta | meta.miraheze.org [19:21:18] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 24.31, 20.70, 18.87 [19:21:25] PROBLEM - gluster3 Current Load on gluster3 is WARNING: WARNING - load average: 5.48, 4.58, 3.97 [19:23:12] RECOVERY - cloud4 Current Load on cloud4 is OK: OK - load average: 13.75, 18.31, 18.24 [19:23:23] RECOVERY - gluster3 Current Load on gluster3 is OK: OK - load average: 1.83, 3.62, 3.70 [19:24:47] !log [@test3] starting deploy of {'config': True} to skip [19:24:48] !log [@test3] finished deploy of {'config': True} to skip - SUCCESS in 0s [19:24:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:24:56] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [19:33:35] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.90, 3.88, 3.78 [19:36:32] I'll answer there later if no one does but basically we'll try to do it in the next few weeks but there's some extension blockers that need resolving before that [19:36:40] We've almost finished extension testing [19:37:58] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.63, 6.76, 5.37 [19:38:06] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 9.38, 8.18, 6.51 [19:38:58] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 7.17, 7.16, 5.94 [19:40:02] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 5.27, 7.06, 6.29 [19:40:58] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 5.66, 6.72, 5.94 [19:41:27] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.48, 3.95, 3.89 [19:41:55] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 6.58, 6.54, 5.57 [19:41:58] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 4.91, 6.62, 6.24 [19:43:25] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.11, 4.14, 3.97 [19:45:20] I figured the rollout was soon, but that someone from SRE is more able to offer an elegant answer [19:45:33] From the start that is [19:47:21] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.50, 3.77, 3.89 [19:49:19] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.28, 3.90, 3.92 [19:52:01] PROBLEM - mw8 Current Load on mw8 is WARNING: WARNING - load average: 7.21, 6.87, 6.13 [19:52:22] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 9.65, 9.20, 6.75 [19:53:34] PROBLEM - lcn.zfc.id.lv - reverse DNS on sslhost is WARNING: Traceback (most recent call last): File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 148, in main() File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 129, in main records = check_records(args.hostname) File "/usr/lib/nagios/plugins/check_reverse_dns.py", line 66, in check_records nameserversans = dns_resolver.query(root_domain, 'NS') File "/usr/lib/python3/ [19:53:34] packages/dns/resolver.py", line 1002, in query raise NXDOMAIN(qnames=qnames_to_try, responses=nxdomain_responses)dns.resolver.NXDOMAIN: None of DNS query names exist: zfc.id.lv., zfc.id.lv. [19:53:57] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 5.96, 6.46, 6.06 [19:54:19] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 5.75, 7.83, 6.54 [20:00:28] RECOVERY - lcn.zfc.id.lv - reverse DNS on sslhost is OK: SSL OK - lcn.zfc.id.lv reverse DNS resolves to cp21.miraheze.org - CNAME OK [20:02:19] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.42, 6.20, 6.39 [20:09:55] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 8.62, 8.13, 6.90 [20:13:46] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 5.66, 7.23, 6.84 [20:15:42] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 3.28, 5.91, 6.41 [20:21:20] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.16, 6.79, 5.68 [20:23:20] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 4.34, 5.92, 5.50 [20:23:29] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.97, 7.26, 6.28 [20:25:28] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 5.94, 6.71, 6.19 [20:30:34] Someone posted, I'll just relay the gist to the CN [20:30:36] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 6.47, 7.57, 6.66 [20:32:36] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.26, 6.47, 6.37 [20:33:02] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 7.90, 6.70, 6.24 [20:34:58] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 6.08, 6.79, 6.35 [20:37:02] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 7.69, 6.69, 5.97 [20:39:00] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 6.24, 6.45, 5.97 [20:43:25] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 9.18, 7.80, 6.84 [20:44:19] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 7.47, 7.80, 6.39 [20:45:19] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 4.92, 6.74, 6.57 [20:46:19] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 4.76, 6.75, 6.18 [21:00:28] PROBLEM - cp31 Stunnel Http for mw13 on cp31 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:00:39] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [21:00:44] PROBLEM - gluster4 Current Load on gluster4 is CRITICAL: CRITICAL - load average: 11.45, 5.63, 3.20 [21:00:49] PROBLEM - ns2 GDNSD Datacenters on ns2 is CRITICAL: CRITICAL - 6 datacenters are down: 198.244.148.90/cpweb, 2001:41d0:801:2000::4c25/cpweb, 2001:41d0:801:2000::1b80/cpweb, 149.56.140.43/cpweb, 149.56.141.75/cpweb, 2607:5300:201:3100::929a/cpweb [21:00:57] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 35.92, 31.66, 22.70 [21:01:10] PROBLEM - ns1 GDNSD Datacenters on ns1 is CRITICAL: CRITICAL - 6 datacenters are down: 51.195.220.68/cpweb, 2001:41d0:801:2000::4c25/cpweb, 2001:41d0:801:2000::1b80/cpweb, 149.56.140.43/cpweb, 149.56.141.75/cpweb, 2607:5300:201:3100::929a/cpweb [21:01:32] PROBLEM - cp30 Stunnel Http for mw13 on cp30 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:01:42] PROBLEM - cp20 Stunnel Http for mw10 on cp20 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:01:43] PROBLEM - mw13 MediaWiki Rendering on mw13 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [21:01:43] PROBLEM - gluster3 Current Load on gluster3 is WARNING: WARNING - load average: 5.21, 5.59, 3.82 [21:01:55] PROBLEM - cp30 Varnish Backends on cp30 is CRITICAL: 2 backends are down. mw8 mw10 [21:02:00] PROBLEM - cloud5 Current Load on cloud5 is CRITICAL: CRITICAL - load average: 65.21, 43.44, 27.27 [21:02:12] paladox: JohnLewis Reception123 [21:02:31] RECOVERY - cp31 Stunnel Http for mw13 on cp31 is OK: HTTP OK: HTTP/1.1 200 OK - 15861 bytes in 3.919 second response time [21:02:31] Visible 503s. [21:03:27] RECOVERY - cp30 Stunnel Http for mw13 on cp30 is OK: HTTP OK: HTTP/1.1 200 OK - 15861 bytes in 0.372 second response time [21:03:38] RECOVERY - cp20 Stunnel Http for mw10 on cp20 is OK: HTTP OK: HTTP/1.1 200 OK - 15861 bytes in 0.011 second response time [21:03:41] RECOVERY - mw13 MediaWiki Rendering on mw13 is OK: HTTP OK: HTTP/1.1 200 OK - 20011 bytes in 0.143 second response time [21:03:50] RECOVERY - cp30 Varnish Backends on cp30 is OK: All 9 backends are healthy [21:04:49] RECOVERY - ns2 GDNSD Datacenters on ns2 is OK: OK - all datacenters are online [21:05:10] RECOVERY - ns1 GDNSD Datacenters on ns1 is OK: OK - all datacenters are online [21:05:38] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [21:05:40] RECOVERY - gluster3 Current Load on gluster3 is OK: OK - load average: 2.60, 4.46, 3.80 [21:06:43] PROBLEM - gluster4 Current Load on gluster4 is WARNING: WARNING - load average: 3.24, 5.43, 4.18 [21:08:43] RECOVERY - gluster4 Current Load on gluster4 is OK: OK - load average: 2.38, 4.58, 4.03 [21:08:48] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 14.98, 23.40, 23.11 [21:09:55] CosmicAlpha: looks to have recovered [21:12:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.80, 3.53, 3.99 [21:16:04] !log depool mw8 [21:16:08] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [21:18:53] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 10.04, 7.96, 6.68 [21:20:02] PROBLEM - cp20 Varnish Backends on cp20 is CRITICAL: 1 backends are down. mw8 [21:20:17] PROBLEM - mw13 Current Load on mw13 is CRITICAL: CRITICAL - load average: 8.91, 7.32, 5.96 [21:20:19] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 6.88, 6.14, 5.22 [21:20:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.06, 3.71, 3.87 [21:20:49] RECOVERY - cloud4 Current Load on cloud4 is OK: OK - load average: 13.77, 17.55, 20.18 [21:20:52] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.24, 7.61, 6.70 [21:21:42] !log reimaging mw8 as bullseye (11) [21:21:48] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [21:21:56] PROBLEM - cloud5 Current Load on cloud5 is WARNING: WARNING - load average: 21.22, 22.44, 23.88 [21:22:19] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 6.13, 5.97, 5.26 [21:22:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.34, 3.65, 3.84 [21:22:49] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.27, 6.85, 6.27 [21:22:51] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 8.51, 8.06, 6.98 [21:23:07] paladox: obviously every repo will be blank when imaged [21:23:20] RhinosF1: what do you mean [21:23:41] paladox: nothing in /srv/mediawiki will exist [21:23:50] oh ok [21:23:53] what do i do? [21:24:13] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 5.17, 6.83, 6.12 [21:24:19] PROBLEM - mw8 JobRunner Service on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:20] PROBLEM - mw8 php-fpm on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:24] PROBLEM - cp31 Stunnel Http for mw8 on cp31 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.02, 4.16, 4.00 [21:24:31] You can do deploy-mediawiki --world --config --landing --errorpages --ignore-time --servers=mw8 paladox [21:24:38] Which should push everything [21:24:40] PROBLEM - mw8 NTP time on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:41] ok [21:24:42] PROBLEM - mw8 APT on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:42] PROBLEM - ping6 on mw8 is CRITICAL: PING CRITICAL - Packet loss = 100% [21:24:46] PROBLEM - mw8 MediaWiki Rendering on mw8 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [21:24:47] Oh --l10n too paladox [21:24:51] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 6.33, 6.69, 6.27 [21:24:51] PROBLEM - cp20 Stunnel Http for mw8 on cp20 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:24:52] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.08, 7.51, 6.90 [21:24:55] PROBLEM - ping4 on mw8 is CRITICAL: PING CRITICAL - Packet loss = 100% [21:24:59] PROBLEM - cp30 Stunnel Http for mw8 on cp30 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:03] PROBLEM - mw8 Puppet on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:03] PROBLEM - mw8 conntrack_table_size on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:07] ok [21:25:08] Or just leave it for me to write some docs because I don't think we did [21:25:09] PROBLEM - mw8 Check Gluster Clients on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:14] PROBLEM - mw8 SSH on mw8 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [21:25:22] PROBLEM - cp21 Stunnel Http for mw8 on cp21 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:23] PROBLEM - mw8 ferm_active on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:25] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:26] PROBLEM - cp21 Varnish Backends on cp21 is CRITICAL: 1 backends are down. mw8 [21:25:27] But in short deploy everything but with --servers=mw8 [21:25:41] PROBLEM - mw8 PowerDNS Recursor on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:25:49] PROBLEM - cp30 Varnish Backends on cp30 is CRITICAL: 1 backends are down. mw8 [21:25:58] PROBLEM - cp31 Varnish Backends on cp31 is CRITICAL: 1 backends are down. mw8 [21:26:01] PROBLEM - mw8 HTTPS on mw8 is CRITICAL: CRITICAL - Socket timeout after 10 seconds [21:26:10] PROBLEM - mw8 Disk Space on mw8 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [21:26:50] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 5.46, 6.77, 6.71 [21:26:58] RECOVERY - ping4 on mw8 is OK: PING OK - Packet loss = 0%, RTA = 2.41 ms [21:28:08] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 6.44, 6.67, 6.23 [21:30:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.23, 3.90, 3.97 [21:31:56] RECOVERY - cloud5 Current Load on cloud5 is OK: OK - load average: 16.77, 17.29, 20.38 [21:32:00] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 6.86, 6.98, 6.47 [21:32:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.45, 4.16, 4.06 [21:32:46] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 6.84, 6.81, 6.72 [21:32:50] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 9.35, 8.28, 7.03 [21:32:52] RECOVERY - mw8 SSH on mw8 is OK: SSH OK - OpenSSH_8.4p1 Debian-5 (protocol 2.0) [21:34:44] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 5.97, 6.37, 6.57 [21:34:50] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.53, 7.46, 6.87 [21:35:56] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 6.38, 6.77, 6.51 [21:36:50] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 4.73, 6.56, 6.62 [21:37:15] RECOVERY - ping6 on mw8 is OK: PING OK - Packet loss = 0%, RTA = 4.66 ms [21:38:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.03, 3.81, 3.97 [21:42:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.16, 4.05, 4.02 [21:45:25] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JModF [21:45:27] [02miraheze/puppet] 07paladox 031ecc760 - mw8: Switch php versions to 7.4 [21:46:02] RECOVERY - mw8 Disk Space on mw8 is OK: DISK OK - free space: / 14698 MB (77% inode=90%); [21:46:26] RECOVERY - mw8 APT on mw8 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [21:46:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 3.48, 3.86, 3.96 [21:46:41] PROBLEM - cp30 Stunnel Http for mw8 on cp30 is WARNING: HTTP WARNING: HTTP/1.1 404 Not Found - 322 bytes in 0.234 second response time [21:46:47] PROBLEM - mw8 conntrack_table_size on mw8 is UNKNOWN: NRPE: Unable to read output [21:46:47] PROBLEM - mw8 Puppet on mw8 is UNKNOWN: NRPE: Unable to read output [21:46:47] PROBLEM - cp31 Stunnel Http for mw8 on cp31 is WARNING: HTTP WARNING: HTTP/1.1 404 Not Found - 322 bytes in 0.230 second response time [21:46:53] RECOVERY - mw8 Check Gluster Clients on mw8 is OK: PROCS OK: 1 process with args '/usr/sbin/glusterfs' [21:46:57] RECOVERY - mw8 NTP time on mw8 is OK: NTP OK: Offset -0.0002363324165 secs [21:47:00] PROBLEM - cp20 Stunnel Http for mw8 on cp20 is WARNING: HTTP WARNING: HTTP/1.1 404 Not Found - 322 bytes in 0.005 second response time [21:47:06] PROBLEM - mw8 ferm_active on mw8 is UNKNOWN: NRPE: Unable to read output [21:47:32] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 0.72, 0.72, 0.38 [21:47:32] PROBLEM - cp21 Stunnel Http for mw8 on cp21 is WARNING: HTTP WARNING: HTTP/1.1 404 Not Found - 322 bytes in 0.010 second response time [21:47:33] RECOVERY - mw8 PowerDNS Recursor on mw8 is OK: DNS OK: 0.124 seconds response time. miraheze.org returns 198.244.148.90,2001:41d0:801:2000::1b80,2001:41d0:801:2000::4c25,51.195.220.68 [21:47:44] RECOVERY - mw8 HTTPS on mw8 is OK: HTTP OK: HTTP/1.1 301 Moved Permanently - 541 bytes in 0.020 second response time [21:48:07] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'ignoretime': True} to mw8 [21:48:11] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [21:48:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 5.76, 4.41, 4.14 [21:48:42] RECOVERY - mw8 conntrack_table_size on mw8 is OK: OK: nf_conntrack is 0 % full [21:48:43] PROBLEM - mw8 Puppet on mw8 is CRITICAL: CRITICAL: Puppet has 2 failures. Last run 39 seconds ago with 2 failures. Failed resources (up to 3 shown): Package[php-luasandbox],Package[php7.4-dba] [21:48:47] RECOVERY - mw8 JobRunner Service on mw8 is OK: PROCS OK: 1 process with args 'redisJobRunnerService' [21:49:02] RECOVERY - mw8 ferm_active on mw8 is OK: OK ferm input default policy is set [21:49:21] paladox: puppet failing ^ [21:49:28] i'm aware [21:50:25] PROBLEM - mw8 MediaWiki Rendering on mw8 is WARNING: HTTP WARNING: HTTP/1.1 404 Not Found - 251 bytes in 0.039 second response time [21:50:50] RECOVERY - mw8 php-fpm on mw8 is OK: PROCS OK: 33 processes with command name 'php-fpm7.4' [21:52:38] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [21:57:35] PROBLEM - test3 APT on test3 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [21:58:08] (I know it's slow for single server but it's rarely used, I'll add some fixes during the v2 rewrite) [21:58:10] PROBLEM - cloud5 APT on cloud5 is CRITICAL: APT CRITICAL: 2 packages available for upgrade (1 critical updates). [21:58:29] PROBLEM - mw12 APT on mw12 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [21:59:46] PROBLEM - mail2 APT on mail2 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:00:50] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 9.87, 7.44, 6.13 [22:01:24] [02miraheze/puppet] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMoNQ [22:01:26] [02miraheze/puppet] 07paladox 03563881d - base: Install python-is-python3 on debian bullseye+ [22:02:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.53, 3.54, 3.93 [22:02:50] PROBLEM - graylog2 APT on graylog2 is CRITICAL: APT CRITICAL: 2 packages available for upgrade (1 critical updates). [22:04:28] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 3.90, 3.89, 4.02 [22:04:37] RECOVERY - mw8 Puppet on mw8 is OK: OK: Puppet is currently enabled, last run 36 seconds ago with 0 failures [22:05:50] !log [paladox@mw11] DEPLOY ABORTED: Canary check failed for mw8 [22:05:58] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:06:29] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 8.20, 6.66, 5.76 [22:06:41] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'ignoretime': True} to mw8 [22:06:48] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:06:50] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 5.78, 7.36, 6.59 [22:07:12] paladox: what was the error? Just canary or? [22:07:20] ssh key [22:07:26] well [22:07:29] known_host [22:07:32] !log [paladox@mw11] DEPLOY ABORTED: Canary check failed for mw8 [22:07:36] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:07:40] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'ignoretime': True} to mw8 [22:07:43] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:07:51] paladox: ah course! [22:08:04] presume i don't need to do l10n so i dropped it to speed things up [22:08:33] you could have known_host automatically generated [22:08:50] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 10.84, 8.57, 7.12 [22:09:08] PROBLEM - mw13 Current Load on mw13 is CRITICAL: CRITICAL - load average: 9.56, 7.42, 5.88 [22:09:15] paladox: yes you do [22:09:30] Or they'll be no l10n files on the remote host [22:09:42] E.g. mw8 [22:09:43] oh [22:10:04] And re known hosts, explain and I can add that [22:10:07] why is it so slow, what is it doing seeing as it's not copying the files [22:10:29] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 7.97, 7.70, 6.42 [22:10:35] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'ignoretime': True} to mw8 [22:10:44] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:10:53] paladox: the prep steps to make sure everything is up to date that should be [22:11:00] oh [22:11:10] could we just do --update [22:11:12] V2 will be faster [22:11:40] Without --ignore-time might work [22:12:19] PROBLEM - mw8 APT on mw8 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:12:29] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 9.45, 8.55, 6.90 [22:12:41] We do --update normally in the rsync call [22:13:06] PROBLEM - mwtask1 APT on mwtask1 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:13:22] PROBLEM - mw11 APT on mw11 is CRITICAL: CHECK_NRPE STATE CRITICAL: Socket timeout after 10 seconds. [22:13:28] It's because you always currently have to deploy to the deployment server first [22:13:35] That will be fixed in v2 [22:16:14] PROBLEM - cloud4 APT on cloud4 is CRITICAL: APT CRITICAL: 2 packages available for upgrade (1 critical updates). [22:20:30] [02miraheze/GlobalNewFiles] 07Universal-Omega pushed 031 commit to 03master [+0/-1/±0] 13https://git.io/JMopS [22:20:32] [02miraheze/GlobalNewFiles] 07Universal-Omega 03843754b - Fix mistake made when doing CI [22:20:38] PROBLEM - db12 APT on db12 is CRITICAL: APT CRITICAL: 9 packages available for upgrade (1 critical updates). [22:20:46] PROBLEM - cloud3 APT on cloud3 is CRITICAL: APT CRITICAL: 2 packages available for upgrade (1 critical updates). [22:21:14] paladox: https://phabricator.miraheze.org/T8369 [22:21:16] [url] ⚓ T8369 explicitly selected deploys should not imply canary | phabricator.miraheze.org [22:21:47] PROBLEM - mw13 APT on mw13 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:21:56] PROBLEM - cloud5 Current Load on cloud5 is CRITICAL: CRITICAL - load average: 24.72, 21.49, 19.45 [22:22:06] PROBLEM - mw9 APT on mw9 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:22:24] ack [22:23:22] [02puppet] 07Universal-Omega opened pull request 03#2139: Remove tideways_xhprof for PHP 7.3 - 13https://git.io/JMohY [22:24:08] PROBLEM - mw10 APT on mw10 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:24:38] PROBLEM - puppet3 APT on puppet3 is CRITICAL: APT CRITICAL: 1 packages available for upgrade (1 critical updates). [22:25:56] PROBLEM - cloud5 Current Load on cloud5 is WARNING: WARNING - load average: 20.39, 21.84, 20.15 [22:27:03] miraheze/GlobalNewFiles - Universal-Omega the build passed. [22:29:56] RECOVERY - cloud5 Current Load on cloud5 is OK: OK - load average: 17.89, 20.20, 19.90 [22:29:57] !log [paladox@mw11] DEPLOY ABORTED: Canary check failed for mw8 [22:30:02] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:30:29] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 5.82, 7.27, 7.99 [22:31:42] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'ignoretime': True} to mw8 [22:31:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:32:18] paladox: Am I OK to push two extension updates (Nuke and PageForms) to 1.36, or should I wait for you to finish that? If you want me to wait will do. [22:32:34] yeh, you can go ahead [22:32:58] PROBLEM - mw13 Current Load on mw13 is WARNING: WARNING - load average: 5.38, 6.51, 7.84 [22:33:00] alright, thanks! [22:34:41] paladox: https://phabricator.miraheze.org/T8370 too [22:34:42] [url] ⚓ T8370 Generate known_hosts for deploy tool automatically | phabricator.miraheze.org [22:34:44] [02miraheze/mediawiki] 07Universal-Omega pushed 031 commit to 03REL1_36 [+0/-0/±1] 13https://git.io/JMojP [22:34:46] [02miraheze/mediawiki] 07Universal-Omega 030e6af9c - Update Nuke [22:34:47] paladox: why error now? [22:35:05] permission errors [22:35:12] ugrh [22:35:15] urgh [22:35:55] --force might make more sense if you're watching because it'll carry on and just warn you about errors [22:36:00] rather than halting [22:36:34] like each error causes me to wait 10-20m [22:36:45] this is time wasting the tool is slower then our previous system [22:37:05] for new images it's slow [22:37:36] [02miraheze/mediawiki] 07Universal-Omega pushed 031 commit to 03REL1_36 [+0/-0/±1] 13https://git.io/JMojp [22:37:38] [02miraheze/mediawiki] 07Universal-Omega 0383f7ad9 - Update PageForms [22:38:14] if it hadn't of been for known_hosts, it wouldn't have been too bad [22:38:43] paladox: 4 -rw------- 1 root root 2886 Dec 1 22:06 known_hosts [22:38:57] it needs to be readable by www-data surely [22:40:29] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 5.84, 5.80, 6.75 [22:42:28] PROBLEM - mon2 Current Load on mon2 is WARNING: WARNING - load average: 2.81, 3.35, 3.98 [22:43:18] !log [paladox@mw11] DEPLOY ABORTED: Canary check failed for mw8 [22:43:24] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:43:25] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 [22:43:29] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:43:30] GRRRRRRRRRRRRRR [22:43:47] !log [paladox@mw11] finished deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 - FAIL: [2, 0, 2, 2, 2, 0, 2, 2, 2, 2, 2, 2, 2, 2, 2] in 21s [22:43:48] paladox: why don't you try landing first? [22:43:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:43:56] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 [22:43:59] as that should be extremely fast [22:43:59] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:44:02] or i can look [22:44:19] PROBLEM - mw8 MediaWiki Rendering on mw8 is CRITICAL: HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 241 bytes in 0.006 second response time [22:44:25] as long as the fingerprint is SHA256:WndbJVc0pm36kgJrDL1vv3TAVZAwHGPotY31orGLcWw [22:44:58] RECOVERY - mw13 Current Load on mw13 is OK: OK - load average: 5.77, 5.80, 6.66 [22:45:05] RECOVERY - cp21 Stunnel Http for mw8 on cp21 is OK: HTTP OK: HTTP/1.1 200 OK - 15864 bytes in 0.110 second response time [22:45:23] Failed to add the host to the list of known hosts (/var/www/.ssh/known_hosts). [22:45:39] !log [paladox@mw11] finished deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 - FAIL: [2, 0, 2, 2, 2, 2, 0, 0, 2, 0, 0, 0, 0, 0, 0] in 103s [22:45:39] as i said [22:45:49] RECOVERY - cp30 Varnish Backends on cp30 is OK: All 9 backends are healthy [22:45:51] 4 -rw------- 1 root root 2886 Dec 1 22:06 known_hosts [22:45:52] RECOVERY - cp31 Stunnel Http for mw8 on cp31 is OK: HTTP OK: HTTP/1.1 200 OK - 15850 bytes in 0.321 second response time [22:45:56] !log update mathoid on test3 [22:45:58] RECOVERY - cp31 Varnish Backends on cp31 is OK: All 9 backends are healthy [22:45:58] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:46:03] it needs to be readable by www-data [22:46:04] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:46:08] and writable [22:46:11] works now [22:46:20] RECOVERY - mw8 MediaWiki Rendering on mw8 is OK: HTTP OK: HTTP/1.1 200 OK - 19998 bytes in 0.108 second response time [22:46:29] !log root@mw11:/home/paladox# chown www-data:www-data /var/www/.ssh/known_hosts [22:46:32] RECOVERY - cp30 Stunnel Http for mw8 on cp30 is OK: HTTP OK: HTTP/1.1 200 OK - 15850 bytes in 0.304 second response time [22:46:32] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:46:59] RECOVERY - cp20 Stunnel Http for mw8 on cp20 is OK: HTTP OK: HTTP/1.1 200 OK - 15858 bytes in 0.006 second response time [22:47:25] RECOVERY - cp21 Varnish Backends on cp21 is OK: All 9 backends are healthy [22:47:37] paladox: is the fingerprint i gave you for 8 right [22:47:38] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [22:48:19] RECOVERY - mw8 APT on mw8 is OK: APT OK: 0 packages available for upgrade (0 critical updates). [22:49:59] No, it's changed key [22:50:30] paladox: that is the key my laptop just gave me [22:50:38] Oh [22:50:50] guess i'm looking in the wrong place [22:51:43] ok, yeh that's correct [22:51:47] found /usr/local/bin/gen_fingerprints [22:52:13] !log repooled mw8 [22:52:17] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:52:49] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 5.48, 5.66, 7.97 [22:52:54] paladox: i don't see a deploy that's passed [22:53:04] !log [paladox@mw11] finished deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 - FAIL: [2, 0, 2, 2, 2, 2, 0, 0, 2, 0, 0, 0, 0, 0, 0] in 103s [22:53:07] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:53:17] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 [22:53:21] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:57:34] RECOVERY - cp20 Varnish Backends on cp20 is OK: All 9 backends are healthy [22:58:51] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 11.23, 7.17, 7.70 [22:59:09] !log change maximum amount of days we keep logs in graylog from 30 to 20 [22:59:13] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [22:59:36] !log delete 5 indices in graylog [22:59:40] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:00:02] !log [@mw11] starting deploy of {'l10nupdate': True} to all [23:00:03] !log [@test3] starting deploy of {'l10nupdate': True} to skip [23:00:06] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:00:10] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:00:28] RECOVERY - mon2 Current Load on mon2 is OK: OK - load average: 2.90, 3.09, 3.40 [23:01:53] !log rhinos@mw11:/srv/mediawiki/w$ sudo -u www-data rm cb7d668.diff [23:01:57] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:02:01] !log rhinos@mw11:/srv/mediawiki/w$ sudo -u www-data rm cb7d668.diff.zip [23:02:05] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:02:34] oh f [23:02:39] l10nupdate kicked in [23:02:53] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.62, 7.06, 7.62 [23:04:51] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 8.66, 7.68, 7.78 [23:05:48] !log killed l10nupdate on mw11 for servers=all to avoid conflicting with mw8 reimage [23:05:52] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:06:07] PROBLEM - cloud5 Current Load on cloud5 is CRITICAL: CRITICAL - load average: 27.18, 28.42, 22.14 [23:06:29] PROBLEM - mw10 Current Load on mw10 is CRITICAL: CRITICAL - load average: 8.33, 8.19, 6.70 [23:06:49] PROBLEM - mw11 Current Load on mw11 is WARNING: WARNING - load average: 6.55, 7.92, 7.91 [23:07:59] !log [paladox@mw11] finished deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 - FAIL: [0, 0, 0, 0, 0, 0, 0, 0, 0, 65280, 0, 0, 5888, 5888, 0] in 881s [23:08:01] why is it all perm errors [23:08:03] PROBLEM - cloud5 Current Load on cloud5 is WARNING: WARNING - load average: 16.37, 23.73, 21.15 [23:08:04] on mw11 [23:08:04] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:08:29] PROBLEM - mw10 Current Load on mw10 is WARNING: WARNING - load average: 6.83, 7.79, 6.74 [23:08:40] paladox: paste the errrors? [23:09:11] stuff like [23:09:12] rsync: [receiver] open "/srv/mediawiki/ErrorPages/.git/objects/2e/6b76afbb1200777ba33ba10a7b0794d651dd41" failed: Permission denied (13) [23:09:15] i've chowned it now [23:09:21] but why hasn't this been picked up? [23:09:28] !log [paladox@mw11] starting deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 [23:09:31] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:09:59] RECOVERY - cloud5 Current Load on cloud5 is OK: OK - load average: 11.15, 19.56, 19.93 [23:10:06] k [23:10:29] RECOVERY - mw10 Current Load on mw10 is OK: OK - load average: 3.53, 6.13, 6.25 [23:11:09] paladox: if it's just 1 repo [23:11:21] you can just select that one [23:11:35] you don't have to run for everything again [23:17:13] !log [paladox@mw11] finished deploy of {'config': True, 'world': True, 'landing': True, 'errorpages': True, 'l10n': True, 'force': True, 'ignoretime': True} to mw8 - FAIL: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 5888, 5888, 0] in 464s [23:17:16] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:17:44] what [23:17:47] it's the same error [23:17:48] rsync: [receiver] open "/srv/mediawiki/ErrorPages/.git/objects/d6/0c86325cfe7c14ccbf4593ca9184be5ed12f63" failed: Permission denied (13) [23:17:51] RhinosF1: ^ [23:18:06] urgh [23:18:15] paladox: paste the full thing [23:18:49] RECOVERY - mw11 Current Load on mw11 is OK: OK - load average: 4.14, 5.46, 6.48 [23:18:56] that is the full thing [23:19:12] also failed for landing [23:19:45] no write perm [23:20:19] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 8.54, 7.47, 5.88 [23:20:46] !log rm -rf .git from mediawiki/(landing|ErrorPages) [23:20:49] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:21:19] !log [rhinos@mw11] starting deploy of {'landing': True, 'errorpages': True, 'force': True, 'ignoretime': True} to mw8 [23:21:22] !log [rhinos@mw11] finished deploy of {'landing': True, 'errorpages': True, 'force': True, 'ignoretime': True} to mw8 - SUCCESS in 2s [23:21:25] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:21:28] paladox: done [23:21:29] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:21:34] thanks [23:22:19] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 5.75, 6.75, 5.81 [23:22:29] paladox: am i okay to run puppet on mw8 [23:22:33] yeh [23:24:00] paladox: should be good [23:24:07] thanks [23:24:52] paladox: how do you get host key for every server [23:25:06] so we can automate known_hosts [23:25:38] we do that here https://github.com/miraheze/puppet/blob/d4573b12e6f1b6525800dd34f10f90316866d4fb/modules/salt/manifests/init.pp#L32 [23:25:39] [url] puppet/init.pp at d4573b12e6f1b6525800dd34f10f90316866d4fb · miraheze/puppet · GitHub | github.com [23:28:33] [02miraheze/CreateWiki] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JMKk6 [23:28:34] [02miraheze/CreateWiki] 07paladox 0322a9979 - Fix "Trying to access array offset on value of type null" [23:28:36] [02CreateWiki] 07paladox created branch 03paladox-patch-1 - 13https://git.io/vpJTL [23:28:37] [02CreateWiki] 07paladox opened pull request 03#263: Fix "Trying to access array offset on value of type null" - 13https://git.io/JMKki [23:29:38] [02miraheze/CreateWiki] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JMKk9 [23:29:40] [02miraheze/CreateWiki] 07paladox 03333db36 - Update CreateWikiJson.php [23:29:41] [02CreateWiki] 07paladox synchronize pull request 03#263: Fix "Trying to access array offset on value of type null" - 13https://git.io/JMKki [23:29:48] PROBLEM - test3 Current Load on test3 is CRITICAL: CRITICAL - load average: 7.01, 3.96, 1.93 [23:31:47] RECOVERY - test3 Current Load on test3 is OK: OK - load average: 1.93, 3.05, 1.83 [23:33:12] !log [@test3] finished deploy of {'l10nupdate': True} to skip - SUCCESS in 1990s [23:33:15] Logged the message at https://meta.miraheze.org/wiki/Tech:Server_admin_log [23:33:30] miraheze/CreateWiki - paladox the build has errored. [23:37:50] miraheze/CreateWiki - paladox the build passed. [23:40:21] [02miraheze/CreateWiki] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JMKI9 [23:40:23] [02miraheze/CreateWiki] 07paladox 0360ab2e6 - Update CreateWikiJson.php [23:40:24] [02CreateWiki] 07paladox synchronize pull request 03#263: Fix "Trying to access array offset on value of type null" - 13https://git.io/JMKki [23:41:38] PROBLEM - graylog2 Current Load on graylog2 is CRITICAL: CRITICAL - load average: 11.50, 5.58, 2.72 [23:42:19] PROBLEM - mw9 Current Load on mw9 is CRITICAL: CRITICAL - load average: 13.03, 8.51, 6.42 [23:42:36] PROBLEM - mw12 Current Load on mw12 is CRITICAL: CRITICAL - load average: 23.65, 11.60, 7.04 [23:42:48] PROBLEM - cloud4 Current Load on cloud4 is CRITICAL: CRITICAL - load average: 26.21, 21.43, 17.49 [23:43:20] PROBLEM - mw8 Current Load on mw8 is CRITICAL: CRITICAL - load average: 8.36, 6.93, 5.17 [23:43:38] alerting : [FIRING:1] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [23:44:19] PROBLEM - mw9 Current Load on mw9 is WARNING: WARNING - load average: 4.84, 6.89, 6.08 [23:44:48] PROBLEM - cloud4 Current Load on cloud4 is WARNING: WARNING - load average: 18.94, 20.52, 17.66 [23:45:20] RECOVERY - mw8 Current Load on mw8 is OK: OK - load average: 3.71, 5.68, 4.93 [23:46:19] RECOVERY - mw9 Current Load on mw9 is OK: OK - load average: 2.93, 5.49, 5.66 [23:48:17] miraheze/CreateWiki - paladox the build passed. [23:48:37] PROBLEM - mw12 Current Load on mw12 is WARNING: WARNING - load average: 4.12, 7.14, 6.66 [23:48:38] ok : [RESOLVED] (PHP-FPM Worker Usage High mediawiki) https://grafana.miraheze.org/d/dsHv5-4nz/mediawiki [23:48:48] RECOVERY - cloud4 Current Load on cloud4 is OK: OK - load average: 17.46, 19.06, 17.70 [23:50:36] RECOVERY - mw12 Current Load on mw12 is OK: OK - load average: 3.10, 5.64, 6.16 [23:51:38] PROBLEM - graylog2 Current Load on graylog2 is WARNING: WARNING - load average: 0.79, 3.23, 3.59 [23:53:38] RECOVERY - graylog2 Current Load on graylog2 is OK: OK - load average: 0.76, 2.41, 3.24 [23:53:44] [02miraheze/CreateWiki] 07paladox pushed 031 commit to 03paladox-patch-1 [+0/-0/±1] 13https://git.io/JMKtz [23:53:46] [02miraheze/CreateWiki] 07paladox 03592d15e - Update CreateWikiJson.php [23:53:47] [02CreateWiki] 07paladox synchronize pull request 03#263: Fix "Trying to access array offset on value of type null" - 13https://git.io/JMKki [23:54:10] [02CreateWiki] 07paladox closed pull request 03#263: Fix "Trying to access array offset on value of type null" - 13https://git.io/JMKki [23:54:11] [02miraheze/CreateWiki] 07paladox pushed 031 commit to 03master [+0/-0/±1] 13https://git.io/JMKtV [23:54:13] [02miraheze/CreateWiki] 07paladox 0352121d5 - Fix "Trying to access array offset on value of type null" (#263) [23:54:14] [02CreateWiki] 07paladox deleted branch 03paladox-patch-1 - 13https://git.io/vpJTL [23:54:16] [02miraheze/CreateWiki] 07paladox deleted branch 03paladox-patch-1 [23:55:50] PROBLEM - test3 Current Load on test3 is WARNING: WARNING - load average: 3.65, 2.48, 1.45 [23:57:48] RECOVERY - test3 Current Load on test3 is OK: OK - load average: 1.00, 1.85, 1.34 [23:58:32] PROBLEM - mon2 Current Load on mon2 is CRITICAL: CRITICAL - load average: 4.05, 3.43, 3.03 [23:58:50] PROBLEM - mw11 Current Load on mw11 is CRITICAL: CRITICAL - load average: 10.11, 7.68, 5.51 [23:59:34] [02miraheze/mediawiki] 07paladox pushed 031 commit to 03REL1_36 [+0/-0/±1] 13https://git.io/JMKqq [23:59:35] [02miraheze/mediawiki] 07paladox 038aab6ea - Update CreateWiki