[09:33:36] 10netops, 06Infrastructure-Foundations: Testing liberica with ncredir@eqiad - https://phabricator.wikimedia.org/T378453#10291794 (10ops-monitoring-bot) Cookbook cookbooks.sre.hosts.reimage was started by vgutierrez@cumin1002 for host lvs1013.eqiad.wmnet with OS bookworm [10:07:47] 10netops, 06Infrastructure-Foundations: Testing liberica with ncredir@eqiad - https://phabricator.wikimedia.org/T378453#10291866 (10ops-monitoring-bot) Cookbook cookbooks.sre.hosts.reimage started by vgutierrez@cumin1002 for host lvs1013.eqiad.wmnet with OS bookworm completed: - lvs1013 (**PASS**) - Downtime... [13:11:22] 10CAS-SSO, 06Infrastructure-Foundations: Migrate cloudinfra-idp-1 to Bookworm - https://phabricator.wikimedia.org/T379069 (10SLyngshede-WMF) 03NEW [13:11:28] 10CAS-SSO, 06Infrastructure-Foundations: Migrate cloudinfra-idp-1 to Bookworm - https://phabricator.wikimedia.org/T379069#10292484 (10SLyngshede-WMF) p:05Triage→03Medium [13:36:40] 10netops, 06Infrastructure-Foundations, 06SRE: Top-of-rack 'MoveServersUplinks' Netbox scripts doesn't clean up the old trunk port - https://phabricator.wikimedia.org/T375216#10292564 (10ayounsi) I added some logging (`self.log_info(f"{interface} {interface.enabled} {interface.untagged_vlan} {interface.tagge... [13:58:43] 10netops, 06Infrastructure-Foundations, 06SRE: Top-of-rack 'MoveServersUplinks' Netbox scripts doesn't clean up the old trunk port - https://phabricator.wikimedia.org/T375216#10292651 (10ayounsi) Another point, after running the script, the changelog on a problematic interface shows 3 changes (for that inter... [15:45:02] jhathaway: if I remember properly, you are in charge of the Puppet Catalogue Compiler workers. I have switched them to use Java 17 (for those that are on bullseye) https://gerrit.wikimedia.org/g/cloud/instance-puppet/+/bdaffdd284cd801d21c442fe7f1e4b953daa706d%5E!/#F0 [15:45:10] that is required by the newer Jenkins version [15:45:24] which I have upgraded some minutes ago ) [15:45:29] thanks hashar [17:11:17] 07Puppet, 10MW-on-K8s, 10Observability-Alerting, 10SRE Observability (FY2024/2025-Q2): Clean up "git repo needs merge" checks - https://phabricator.wikimedia.org/T370530#10293583 (10lmata) [17:11:46] 10Packaging, 06Infrastructure-Foundations, 13Patch-For-Review, 10SRE Observability (FY2024/2025-Q2): upgrade prometheus-ipmi-exporter to 1.8.0 - https://phabricator.wikimedia.org/T368088#10293589 (10lmata) [18:07:51] 10netbox, 06Infrastructure-Foundations: Netbox: ImportPuppetDB uses wrong netmask for some hosts - https://phabricator.wikimedia.org/T378751#10293881 (10cmooney) 05Open→03Resolved a:03cmooney [18:47:39] 10netops, 10fundraising-tech-ops, 06Infrastructure-Foundations: Test prototype fundraising pybal replacement based on haproxy + anycast-healthchecker. - https://phabricator.wikimedia.org/T373942#10293980 (10Jgreen) 05In progress→03Open [19:40:31] 10netops, 06DC-Ops, 10fundraising-tech-ops, 06Infrastructure-Foundations, and 2 others: Frack eqiad network upgrade: design, installation and configuration - https://phabricator.wikimedia.org/T377381#10294144 (10cmooney) >>! In T377381#10274577, @Jclark-ctr wrote: > @cmooney fyi i have 10x of the 100g gre... [20:54:37] 10netops, 10SRE-tools, 06Infrastructure-Foundations, 06SRE, 13Patch-For-Review: Setup zero touch provisioning (ZTP) for network devices - https://phabricator.wikimedia.org/T336485#10294334 (10cmooney) I used the cookbook to provision the two new frack switches in eqiad this evening. Mostly it worked ok,... [21:53:25] FIRING: SystemdUnitFailed: wmf_auto_restart_prometheus-memcached-exporter.service on idp1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed