[04:51:24] <marostegui>	 Amir1: you finished with the old s8 master?
[07:12:26] <Emperor>	 Would anyone care to 👀 and maybe +1 https://gerrit.wikimedia.org/r/c/operations/puppet/+/1037558 please? I know PCC is unhappy, but j.hathaway (who has been very helpful!) is of the view that this is a bug in PCC not the CR and that it'd be more useful to merge this and look at fixing PCC in due course rather than blocking on it.
[07:13:14] <Emperor>	 The change is a starter-for-ten on RGW (i.e. S3) setup for apus
[07:13:47] <Emperor>	 thanks arnaud.b :)
[07:13:54] <arnaudb>	 my pleasure!
[07:24:25] <jinxer-wm>	 FIRING: SystemdUnitFailed: envoyproxy.service on moss-fe1002:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed
[07:27:39] <Emperor>	 let me downtime that host for a bit, it's in dev
[07:34:21] <Emperor>	 [yes, puppet is happy, but has left me with an empty /etc/envoy/envoy.yaml so more work needed here...]
[08:04:46] <Emperor>	 Hm, fixed by running sudo /usr/local/sbin/build-envoy-config -c /etc/envoy which puppet should have done for me.
[08:26:50] <jynus>	 have you ever met: "resize2fs: Invalid argument While checking for on-line resizing support"
[08:30:40] <Emperor>	 sounds like the FS doesn't support it? [I'd lazily strace to see what actually got EINVAL]
[09:11:51] <Amir1>	 marostegui: sorry I just woke up. Yes. I'm done!
[09:13:51] <marostegui>	 Thanks!
[09:59:16] <Amir1>	 arnaudb: I will be done with s2 codfw in a day: https://orchestrator.wikimedia.org/web/cluster/alias/s2
[09:59:47] <Amir1>	 (the schema change goes alphabetically) 
[10:01:26] <Amir1>	 I can pick the old s3 master in codfw? Are you done marostegui and arnaudb ?
[10:04:24] <arnaudb>	 ok for me !
[10:51:12] <marostegui>	 Amir1: I'm not doing anything with it
[10:51:24] <Amir1>	 awesome
[13:31:12] <Emperor>	 Couple of tiny but useful apus hiera updates if anyone's feeling kind, please? https://gerrit.wikimedia.org/r/c/operations/puppet/+/1037792 and https://gerrit.wikimedia.org/r/c/operations/puppet/+/1037791
[13:33:04] <Emperor>	 [PCC still busted on moss-fe1002]
[13:42:26] <Emperor>	 arnaud.b: thanks :)
[13:43:14] <arnaudb>	 anytime!
[13:49:25] <jinxer-wm>	 FIRING: SystemdUnitFailed: logrotate.service on moss-be1001:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed
[14:30:19] <topranks>	 arnaudb, Emporer: thanks for the input on the eqiad switch upgrade task and creating all the sub-tasks 
[14:30:31] <topranks>	 Emperor: even, damn I always spell that wrong :) 
[14:30:47] <arnaudb>	 :D
[14:31:03] <topranks>	 I'll propose to do them at 15:00 UTC, which is 4pm for me in Ireland, and 11am Eastern in the US 
[14:31:14] <topranks>	 if that works for you guys?
[14:31:37] <topranks>	 in terms of the tasks you created I started editing the description in them but I'm not sure that was right 
[14:31:54] <topranks>	 do you want to use those tasks just to track the actions you need to take for hosts you guys manage?
[14:32:15] <arnaudb>	 oh no it was mostly to take a first inventory 
[14:32:17] <topranks>	 if so I will create new, per-rack tasks (assigned to me) to track the actual network switch upgrades, and make those children of them 
[14:32:34] <arnaudb>	 you can edit at will, don't worry :D
[14:32:40] <topranks>	 ok, I've tried to do the inventory as best I can on the google sheet 
[14:33:06] <topranks>	 cool no probs, basically I need a "master" task for each rack which should be assigned to me to do the actual upgrade if that makes sense 
[14:33:45] <topranks>	 so I don't know whether to add to your tasks to turn them into that, or create my own set and make your ones children of that 
[14:33:47] <topranks>	 either works for me 
[14:42:55] <Emperor>	 topranks: I don't mind - if you roll them in together we don't have to remember to separately close ours
[14:43:04] <arnaudb>	 exactly!
[14:43:27] <topranks>	 ok guys thanks, yeah that sounds good to me, don't want to overdo it with a million tasks 
[14:43:30] <topranks>	 cheers :)
[15:00:09] <Emperor>	 topranks: time-wise, that's fine with me except for when it clashes with the staff meeting, which I'd rather not miss if poss (but obviously if that's the only good time I can live with it!)
[15:04:02] <topranks>	 They're all planned for Tuesdays and Thursdays so with any luck that won't happen, but I'll double-check 
[15:04:29] <topranks>	 sorry staff meeting rather than SRE meeting - good call 
[15:04:56] <topranks>	 Maybe 14:00 UTC is better in that case to avoid it 
[15:08:41] <Emperor>	 that's good for me
[15:11:07] <topranks>	 cool thanks I'll do that 
[15:13:55] <Emperor>	 ta
[15:14:25] <jinxer-wm>	 RESOLVED: SystemdUnitFailed: logrotate.service on moss-be1001:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed