[09:39:00] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10fgiunchedi) Thank you for starting this @Dzahn ! I agree re: evaluating whether or not the alert itself is useful nowadays. As a general... [13:12:31] 10Beta-Cluster-Infrastructure: Replace deployment-prometheus02 - https://phabricator.wikimedia.org/T324782 (10TheresNoTime) 05Open→03Resolved [15:51:43] 10GitLab (Infrastructure), 10SRE, 10Traffic, 10serviceops-collab, 10Patch-For-Review: Deprecate and disable port 80 for one-off sites under canonical domains - https://phabricator.wikimedia.org/T238720 (10BCornwall) [15:53:41] 10GitLab (Infrastructure), 10SRE, 10Traffic, 10serviceops-collab, 10Patch-For-Review: Deprecate and disable port 80 for one-off sites under canonical domains - https://phabricator.wikimedia.org/T238720 (10BCornwall) 05In progress→03Resolved I went ahead and struck lists off of the.... list since it s... [16:05:19] 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) [16:08:08] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) [16:09:23] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) @Aklapper SRE would be happy about advice from you... [16:17:06] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) @Ladsgroup How do you feel about https://github.co... [16:25:33] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) The current tags that the bot adds the SRE tag to... [17:16:01] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Aklapper) >>! In T334294#8765358, @Dzahn wrote: > @Aklapp... [17:19:28] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) Thank you @fgiunchedi ! Yes, makes sense to me. I will start by questioning the etherpad check. It was added back in T82936 whic... [17:22:41] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) [17:39:31] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) [18:39:07] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) cc: @Arnoldokoth for the 2 VRTS-related checks (clam AV monitoring). This could be considered part of the general "improve VRTS m... [18:43:14] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) cc: @brennen Turns out I added process monitoring of `phd` back in T957 :) (https://gerrit.wikimedia.org/r/c/operations/puppet/+/1... [18:46:50] 10Release-Engineering-Team (Radar), 10serviceops-collab: sre-collab/releng: convert or remove all nrpe::monitor_service checks - https://phabricator.wikimedia.org/T334250 (10Dzahn) cc: @hashar @colewhite @fgiunchedi I see we already have T233089 "Export zuul metrics to Prometheus". Seems like that would be th... [18:48:04] 10Continuous-Integration-Infrastructure, 10Release-Engineering-Team (Seen), 10SRE, 10observability, 10Patch-For-Review: Export zuul metrics to Prometheus - https://phabricator.wikimedia.org/T233089 (10Dzahn) If we had this we could maybe remove the Icinga process checks we have for zuul and zuul-merger (... [20:31:34] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Ladsgroup) The switch from herald to maint bot was done b... [21:11:04] 10GitLab (Project Migration), 10Release-Engineering-Team (Priority Backlog 📥), 10Abstract Wikipedia team, 10Anti-Harassment, and 16 others: Migrate PipelineLib repos to GitLab - https://phabricator.wikimedia.org/T332953 (10thcipriani) [21:21:58] 10Beta-Cluster-Infrastructure: There is a link issue In the test wiki... - https://phabricator.wikimedia.org/T334309 (10datawow) [21:25:05] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) Thank you for all the details @Ladsgroup , it's mu... [21:36:00] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Ladsgroup) >>! In T334294#8765857, @Dzahn wrote: > Thank... [21:40:31] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) >>! In T334294#8765862, @Ladsgroup wrote: > defini... [21:46:25] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) {F36942751} see lower left corner in the "new proj... [21:47:40] 10Phabricator, 10Release-Engineering-Team (Radar), 10phabricator maintenance bot, 10SRE, 10serviceops-collab: phabricator maintenance bot should not add the SRE tag to (certain) subteam tasks any more - https://phabricator.wikimedia.org/T334294 (10Dzahn) {F36942753} [21:59:52] 10Release-Engineering-Team (Priority Backlog 📥), 10serviceops, 10Patch-For-Review, 10Release Pipeline (Blubber): Buildkit erroring with "cannot reuse body, request must be retried" upon multi-platform push - https://phabricator.wikimedia.org/T322453 (10dduvall) 05Open→03Resolved a:03dduvall Confirmed... [22:32:50] 10GitLab, 10Release-Engineering-Team: Error when using mutli arch build on gitlab with blubber and kokkuri - https://phabricator.wikimedia.org/T334254 (10dduvall) @dcausse thanks for filing this. Yes, I've been trying to get Blubber's multiplatform image published for a while now, but the problem there was re... [22:34:43] 10GitLab, 10Release-Engineering-Team (Priority Backlog 📥): Error when using mutli arch build on gitlab with blubber and kokkuri - https://phabricator.wikimedia.org/T334254 (10dduvall) [22:49:19] 10GitLab (CI & Job Runners), 10Release-Engineering-Team (Priority Backlog 📥): Consider enabling distributed caching for GitLab runners - https://phabricator.wikimedia.org/T328516 (10thcipriani) [22:49:25] 10GitLab (CI & Job Runners), 10Release-Engineering-Team (Priority Backlog 📥), 10serviceops-collab, 10User-brennen: Provision untrusted instance-wide GitLab job runners to handle user-level projects and merge requests from forks - https://phabricator.wikimedia.org/T297426 (10thcipriani) [22:50:17] 10GitLab (CI & Job Runners), 10Release-Engineering-Team (Priority Backlog 📥), 10serviceops-collab, 10User-brennen: Provision untrusted instance-wide GitLab job runners to handle user-level projects and merge requests from forks - https://phabricator.wikimedia.org/T297426 (10thcipriani) 05Open→03Resolved [22:51:18] 10GitLab (CI & Job Runners), 10Release-Engineering-Team (Priority Backlog 📥), 10serviceops-collab, 10User-brennen: Provision untrusted instance-wide GitLab job runners to handle user-level projects and merge requests from forks - https://phabricator.wikimedia.org/T297426 (10thcipriani) There are instance-w... [22:52:17] 10GitLab (CI & Job Runners), 10Release-Engineering-Team (Priority Backlog 📥): Consider enabling distributed caching for GitLab runners - https://phabricator.wikimedia.org/T328516 (10thcipriani) Removed and close the previous parent task of this task. I believe this feature may be used for a subset of our runn...