[01:16:11] 10netops, 10Infrastructure-Foundations, 10Metrics, 10SRE: replace check_ripe_atlas Python script with a check_prometheus backed by atlasexporter data - https://phabricator.wikimedia.org/T251155 (10lmata) [01:16:21] 10netops, 10Infrastructure-Foundations, 10Metrics, 10SRE: add traceroute measurements to RIPE Atlas prometheus data - https://phabricator.wikimedia.org/T251156 (10lmata) [01:16:56] 10Traffic, 10Metrics, 10SRE, 10Performance-Team (Radar), 10Sustainability (Incident Followup): Document and/or improve navigation of the various HTTP frontend Grafana dashboards - https://phabricator.wikimedia.org/T253655 (10lmata) [03:38:42] 10netops, 10Infrastructure-Foundations, 10Logging, 10SRE: Provision plaintext syslog collectors in esams/ulsfo/eqsin - https://phabricator.wikimedia.org/T243065 (10lmata) [06:17:06] 10netops, 10Infrastructure-Foundations, 10SRE: Lumen eqiad-codfw link down - https://phabricator.wikimedia.org/T288218 (10ayounsi) 05Open→03Resolved Back up Friday 6th, around 10am. Reason was fibercut due to a fire. [08:51:56] (VarnishTrafficDrop) firing: 52% GET drop in text@ during the past 30 minutes - https://grafana.wikimedia.org/d/000000180/varnish-http-requests?viewPanel=6 - https://alerts.wikimedia.org [08:56:56] (VarnishTrafficDrop) firing: (2) 14% GET drop in text@ during the past 30 minutes - https://grafana.wikimedia.org/d/000000180/varnish-http-requests?viewPanel=6 - https://alerts.wikimedia.org [09:01:56] (VarnishTrafficDrop) resolved: (2) 64% GET drop in text@ during the past 30 minutes - https://grafana.wikimedia.org/d/000000180/varnish-http-requests?viewPanel=6 - https://alerts.wikimedia.org [20:04:02] 10Traffic, 10SRE, 10serviceops: Unexpected upload speed to commons - https://phabricator.wikimedia.org/T288481 (10aborrero) [20:58:22] 10Traffic, 10SRE, 10serviceops: Unexpected upload speed to commons - https://phabricator.wikimedia.org/T288481 (10aborrero)