[09:44:13] Heads-up that I'm going to be upgrading superset at 10:00 UTC, so it'll be unavailable for a while. You can still use superset-next if you need it while the work is going on. [09:48:14] <3 thanks a lot btullis!!! [10:15:12] <_joe_> btullis: thanks [10:18:25] hi all back from vacation, catching up on emails but please bing if there is something youd like me to take a look at first [10:19:10] hi jbond welcome back [10:19:18] welcome back jbond :D [10:19:42] thanks :) [10:19:55] welcome back! [10:23:11] jbond: o/ [10:24:58] elukey: is thats a raised hand or a high five :) [10:26:20] jbond: the latter, welcome back :D [10:26:39] in that case o/ and thanks :) [11:00:34] superset version 1.5.3 is in production now - please let me know if anything doesn't seem right to you and I'll look into it. [11:01:37] <3 [11:02:15] I'll apply the changes I did on -next to improve the filters, thanks a lot! [11:03:00] Great!You're very welcome. [11:11:36] jbond: welcome! [11:17:20] thanks kwakuofori [11:18:26] moritzm: re T326942, https://gerrit.wikimedia.org/r/c/operations/puppet/+/882602 will prevent from puppet writing invalid configs to acme-chief [11:18:27] T326942: Ci check for acme-chief changes - https://phabricator.wikimedia.org/T326942 [11:19:06] nice, I'll review in a bit [11:30:24] {done with the filters changes} [11:30:48] volans: ak, thanks. [11:33:51] whose is snapshot1004? it's started emailing every minute to say its sad... [11:37:13] Emperor: snapshot1014? [11:37:24] snapshot1014 is a Host being setup by Core Platform SREs (insetup::core_platform) [11:37:44] volans: that's the one, yes, sorry I can't type today [11:41:56] * Emperor doesn't know who Core Platform is :-/ [11:42:35] what became Platform [11:42:56] but I think it might be of interest to apergos mainly [11:43:13] Oh, yes, platform engineering, duh. [11:48:49] it's mine and I don't know why it's doing that [11:49:08] it's not fully set up and has been in that state for some time (pending both new dumpsdata hosts being ready) [11:49:14] well [11:49:20] "mine", anyways, you get the point [11:52:13] Emperor: ^^ [11:53:36] apergos: thanks for the update; weird it's only just started spamming us all :-/ [11:54:14] I mean it has been in the 'not ready' situation for some time [11:54:21] I don't know about the spam, that is weird all right [11:54:36] I wonder what changed [11:55:57] maybe if I apply role dumps testbed to it, it will shut up (because it will have all the directories and files and such [11:55:59] ) [11:58:05] but 15 should spam us the same way and it isn't. [11:58:10] any thoughts? [11:58:21] Emperor: ^ [12:00:21] snapshot1015 is certainly missing the last_run_summary.yaml file too [12:01:57] apergos: snapshot1015 has '# Puppet Name: prometheus_puppet_agent_stats' in prometheus crontab, and then not the job itself, cf snapshot1014 which has that Puppet header and then the cron job itself [12:02:28] why is there that difference? (since I have not done things to either host and they both have the same role applied) [12:02:30] prometheus_puppet_agent_stats should nowadays be a systemd timer, not a cron job [12:02:55] taavi: it's cron that's doing the running (and the spamming) here [12:03:13] apergos: I could try deleting that line in crontab and re-running puppet-agent to see if it gets put back? [12:03:39] also it's supposed to run as root, not as prometheus [12:04:06] I just report what is in crontab, not what _should_ be there :) [12:04:18] I would try deleting it and seeing what happens [12:05:12] doing so [12:05:24] now re-running puppet [12:05:30] crossing fingers [12:05:45] maybe the cleanup didn't get done completely on the one host [12:06:10] well, puppet hasn't put the crontab entry back, so I think we're good now [12:06:31] * Emperor waits for an "haha lol, you were wrong" email from cron in a minute [12:11:49] waiting for the sweet sound of spam silence myself... [12:23:24] silence is golden, that must have been the fix, tyvm! [14:27:33] akosiaris: unless you object ill take over 881872 (xihua) there are a few CI things i wanted to fix as well [14:30:01] jbond: no objections, go ahead [14:30:11] thanks! [14:30:14] great thanks [14:59:57] <_joe_> !incidents [14:59:57] 3266 (UNACKED) db1105 (paged)/MariaDB Replica SQL: s2 (paged) [15:00:03] <_joe_> !ack 3266 [15:00:04] 3266 (ACKED) db1105 (paged)/MariaDB Replica SQL: s2 (paged) [15:01:03] <_joe_> !incidents [15:01:04] 3266 (ACKED) db1105 (paged)/MariaDB Replica SQL: s2 (paged) [15:01:04] 3267 (UNACKED) db1170 (paged)/MariaDB Replica SQL: s7 (paged) [15:01:10] <_joe_> !ack 3267 [15:01:10] 3267 (ACKED) db1170 (paged)/MariaDB Replica SQL: s7 (paged) [15:02:32] <_joe_> btw oncall people ^^ [15:02:49] <_joe_> this is much handier to me than that damn application [15:03:03] !incidents [15:03:04] 3266 (ACKED) db1105 (paged)/MariaDB Replica SQL: s2 (paged) [15:03:04] 3267 (ACKED) db1170 (paged)/MariaDB Replica SQL: s7 (paged) [15:03:10] neat! [15:04:39] <_joe_> there's also !resolve [15:04:48] <_joe_> but I never tested if it works