[00:07:46] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11113548 (10Ladsgroup) To be sure update category membership is the culprit, I went through all slow write queries reordered by the master around the time of th... [00:11:51] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11113549 (10Ladsgroup) Specifically these edits seemed to be the main reason: https://commons.wikimedia.org/w/index.php?title=Special:Contributions/Yac%C3%A0wot... [01:39:44] FIRING: [2x] NodeTextfileStale: Stale textfile for acmechief1002:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [01:49:44] FIRING: [7x] NodeTextfileStale: Stale textfile for cloudnet2007-dev:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [01:57:44] FIRING: [14x] NodeTextfileStale: Stale textfile for durum1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [01:58:44] FIRING: [56x] NodeTextfileStale: Stale textfile for cp1101:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [01:59:02] FIRING: [8x] NodeTextfileStale: Stale textfile for lvs1016:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [01:59:43] FIRING: [14x] NodeTextfileStale: Stale textfile for ncredir1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [02:03:44] FIRING: [14x] NodeTextfileStale: Stale textfile for doh1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [02:03:44] FIRING: [56x] NodeTextfileStale: Stale textfile for cp1100:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [02:04:43] FIRING: [15x] NodeTextfileStale: Stale textfile for lvs3008:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [02:10:43] FIRING: [16x] NodeTextfileStale: Stale textfile for dns1004:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [02:14:25] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11113595 (10Zache) @Ladsgroup : Just FYI, from the Cat-a-lot code side, the user was using a pre-August 18, 2024 version of Cat-a-lot which didn't have the thro... [02:39:03] 06Traffic, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: [Rollout Phase 1] Implement redirect-less mobile routing and enable for wikitech.wikimedia.org - https://phabricator.wikimedia.org/T401595#11113621 (10Krinkle) >>! In T401595#11113593, @gerritbot wrote: > Change #1181310 had a related patch se... [05:39:44] FIRING: [2x] NodeTextfileStale: Stale textfile for acmechief1002:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [05:49:44] FIRING: [7x] NodeTextfileStale: Stale textfile for cloudnet2007-dev:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [05:57:44] FIRING: [14x] NodeTextfileStale: Stale textfile for durum1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [05:58:44] FIRING: [56x] NodeTextfileStale: Stale textfile for cp1101:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [05:58:58] FIRING: [8x] NodeTextfileStale: Stale textfile for lvs1016:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [05:59:43] FIRING: [14x] NodeTextfileStale: Stale textfile for ncredir1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [06:03:44] FIRING: [14x] NodeTextfileStale: Stale textfile for doh1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [06:03:44] FIRING: [56x] NodeTextfileStale: Stale textfile for cp1100:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [06:04:43] FIRING: [15x] NodeTextfileStale: Stale textfile for lvs3008:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [06:10:43] FIRING: [16x] NodeTextfileStale: Stale textfile for dns1004:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:13:36] 06Traffic, 10envoy, 06serviceops, 06SRE, 13Patch-For-Review: Upgrade Envoy to v1.26.8 and drop buster - https://phabricator.wikimedia.org/T402584#11113754 (10MoritzMuehlenhoff) We also have 237 baremetal hosts with Envoy, how shall we handle these? We could e.g. add a profile parameter $use_future to pro... [07:33:41] 06Traffic, 10envoy, 06serviceops, 06SRE, 13Patch-For-Review: Upgrade Envoy to v1.26.8 and drop buster - https://phabricator.wikimedia.org/T402584#11113776 (10hashar) I have updated the [[ https://integration.wikimedia.org/ci/job/helm-lint/ | helm-lint ]] job to the new image :) [07:36:54] 06Traffic, 06SRE, 13Patch-For-Review, 10WE4.2 Bot detection (WE4.2 hCaptcha account creation trial): hCaptcha: Ensure GeoIP and WMF-Uniq cookies are removed in proxied requests - https://phabricator.wikimedia.org/T402713#11113808 (10kostajh) [07:54:28] FIRING: [15x] NodeTextfileStale: Stale textfile for lvs3008:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:54:29] FIRING: [14x] NodeTextfileStale: Stale textfile for ncredir1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:54:29] RESOLVED: [2x] NodeTextfileStale: Stale textfile for acmechief1002:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:54:57] RESOLVED: [7x] NodeTextfileStale: Stale textfile for cloudnet2007-dev:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:55:50] RESOLVED: [16x] NodeTextfileStale: Stale textfile for dns1004:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:58:25] RESOLVED: [14x] NodeTextfileStale: Stale textfile for durum1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:59:47] RESOLVED: [56x] NodeTextfileStale: Stale textfile for cp1100:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [07:59:52] RESOLVED: [56x] NodeTextfileStale: Stale textfile for cp1101:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [08:00:06] RESOLVED: [8x] NodeTextfileStale: Stale textfile for lvs1016:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [08:00:30] RESOLVED: [14x] NodeTextfileStale: Stale textfile for doh1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [08:00:47] RESOLVED: [15x] NodeTextfileStale: Stale textfile for lvs3008:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [08:00:50] RESOLVED: [14x] NodeTextfileStale: Stale textfile for ncredir1001:9100 - https://wikitech.wikimedia.org/wiki/Prometheus#Stale_file_for_node-exporter_textfile - https://grafana.wikimedia.org/d/knkl4dCWz/node-exporter-textfile - https://alerts.wikimedia.org/?q=alertname%3DNodeTextfileStale [08:24:43] FIRING: [10x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [08:29:43] FIRING: [30x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [08:34:43] RESOLVED: [48x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [12:12:40] 06Traffic, 10Page Content Service, 06Wikipedia-Android-App-Backlog, 10Content-Transform-Team (Work In Progress): [[2025 Coeur d'Alene shooting]] showing old version in Android app - https://phabricator.wikimedia.org/T398243#11114589 (10A_smart_kitten) >>! In T398243#11094672, @Jgiannelos wrote: > This... [12:20:58] vgutierrez: Happy Monday, regarding the thumb sizes rate limiting, this can give a pretty idea of what sizes are being requested [12:20:58] select cache_status, split(split(uri_path, '/')[7], 'px-')[0] as thumbsize, count(*) as hitcount from wmf.webrequest where webrequest_source = 'upload' and year = 2025 and month = 8 and day = 20 and http_status = 200 and uri_path like '/wikipedia/%/thumb/%' group by split(split(uri_path, '/')[7], 'px-')[0], cache_status order by hitcount desc limit 500; [12:21:24] If there is a ticket, I can post the sizes, etc. [12:33:29] 06Traffic: Consider rate limiting non-standard thumbnail sizes - https://phabricator.wikimedia.org/T402792 (10Vgutierrez) 03NEW [12:33:36] Amir1: ^^ [12:34:07] 06Traffic: Consider rate limiting non-standard thumbnail sizes - https://phabricator.wikimedia.org/T402792#11114636 (10Vgutierrez) p:05Triage→03Medium [12:34:19] Thanks! [12:41:24] Amir1: BTW an initial check suggests that mobile version of the wikis use some thumbnail sizes outside the standard sizes [12:42:10] Amir1: stuff ilke https://upload.wikimedia.org/wikipedia/commons/thumb/5/55/WMA_button2b.png/34px-WMA_button2b.png [12:43:34] 34x and 17x appear on the 3rd and 4rd most requested thumbnails according to turnilo in the last day [12:43:39] *px [12:44:41] that's actually not coming from mobile [12:45:14] it's people hard-coding the url to the thumb in common.css [12:45:33] mobile as in en.m.wikipedia.org [12:45:49] given it's the top 1 referer for that thumbnail [12:47:34] vgutierrez: it's this https://global-search.toolforge.org/?q=%22WMA_button2b.png%22&namespaces=8&title= [12:47:57] I can get someone with global rights to fix it [12:48:18] yeah.. we would need to bump those two to 20x and 40px respectively [12:48:24] yup [12:48:45] we could probably start with stuff bigger than the biggest standard size used by MW [12:48:55] so >960px [12:59:23] 06Traffic: Consider rate limiting non-standard thumbnail sizes - https://phabricator.wikimedia.org/T402792#11114747 (10Ladsgroup) I asked for someone with global interface admin rights to change wikimini atlas size. [13:18:17] vgutierrez: Bartosz is fixing the wikiminiatlas [13:20:06] thx [13:25:37] 10netops, 06Infrastructure-Foundations, 06SRE: Investigate using BGP addpath for unicast IBGP spine/leaf pods - https://phabricator.wikimedia.org/T402640#11114837 (10cmooney) [13:26:40] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11114838 (10TheDJ) There's reports that this breaks command line download of mediawiki tarballs via https://releases.wikimedia.org/mediawiki/1.44/ That se... [13:28:43] FIRING: [13x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [13:33:43] FIRING: [36x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [13:33:57] uh [13:38:43] RESOLVED: [48x] HaproxyKafkaSocketDroppedMessages: Unexpected rate of dropped messages from HaproxyKafka - https://wikitech.wikimedia.org/wiki/HAProxyKafka#HaproxyKafkaSocketDroppedMessages - https://alerts.wikimedia.org/?q=alertname%3DHaproxyKafkaSocketDroppedMessages [13:49:11] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11114961 (10Bugreporter) >>! In T400119#11114838, @TheDJ wrote: > There's reports that this breaks command line download of mediawiki tarballs via https://... [14:04:56] 10netops, 10Ganeti, 06Infrastructure-Foundations: magru: move sandbox vlan to routed Ganeti - https://phabricator.wikimedia.org/T402372#11115030 (10ayounsi) [14:10:49] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11115049 (10TheDJ) Yeah getting the swagger spec via `curl https://api.wikimedia.org/core/v1/wikipedia/en/search/page?q=earth&limit=10` also no longer work... [14:49:05] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11115266 (10Bugreporter) curl/wget should still be rate limited with 1/s. [14:56:25] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11115304 (10Vgutierrez) [14:57:20] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11115308 (10Vgutierrez) [14:57:59] 06Traffic, 06SRE, 13Patch-For-Review, 07User-notice: Block traffic from user-agents not honoring our policy - https://phabricator.wikimedia.org/T400119#11115313 (10Vgutierrez) [15:13:04] 06Traffic, 06DC-Ops, 10ops-magru: planned power redundancy depreciation 2025-09-20 @ 18:00 GMT to 2025-09-21 @ 21:00 GMT - https://phabricator.wikimedia.org/T402818 (10RobH) 03NEW p:05Triage→03Medium [16:19:49] 06Traffic, 10envoy, 06serviceops, 06SRE: Upgrade Envoy to v1.26.8 and drop buster - https://phabricator.wikimedia.org/T402584#11115783 (10RLazarus) >>! In T402584#11113754, @MoritzMuehlenhoff wrote: > We also have 237 baremetal hosts with Envoy, how shall we handle these? We could e.g. add a profile parame... [17:12:22] 06Traffic: Consider rate limiting non-standard thumbnail sizes - https://phabricator.wikimedia.org/T402792#11116049 (10matmarex) It looks like WikiMiniAtlas is maintained in a GitHub repo, I proposed a patch: https://github.com/dschwen/wikiminiatlas/pull/42 and I can copy it over to the wikis once it's accepted. [19:09:12] 06Traffic, 10Maps, 06SRE: Allow Wikimedia Maps usage on  - https://phabricator.wikimedia.org/T402846 (10GuidoSP) 03NEW Closing this task as invalid due to missing information. [19:56:11] 06Traffic, 06Fundraising Tech - Chaos Crew, 06Fundraising-Backlog, 10MediaWiki-extensions-CentralNotice, and 3 others: ESI test string is still shipped by CentralNotice - https://phabricator.wikimedia.org/T400472#11116690 (10Ejegg) I've given the config change a C+1 - I believe it's not supposed to get a C... [22:34:41] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11117263 (10Ladsgroup) >>! In T402749#11113595, @Zache wrote: > @Ladsgroup : Just FYI, from the Cat-a-lot code side, the user was using a pre-August 18, 2024 ve... [22:54:25] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11117358 (10JJMC89) >>! In T402749#11117263, @Ladsgroup wrote: >>>! In T402749#11113595, @Zache wrote: >> @Ladsgroup : Just FYI, from the Cat-a-lot code side, t... [23:29:40] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11117413 (10Ladsgroup) >>! In T402749#11117358, @JJMC89 wrote: >>>! In T402749#11117263, @Ladsgroup wrote: >>>>! In T402749#11113595, @Zache wrote: >>> @Ladsgro... [23:33:40] 06Traffic, 06Commons, 06DBA, 06SRE: Unable to save edits or delete pages on Commons – database lag - https://phabricator.wikimedia.org/T402749#11117423 (10Josve05a) >>! In T402749#11117413, @Ladsgroup wrote: > [...] Maybe someone should mention it to them? There is https://commons.wikimedia.org/wiki/U...