[07:56:42] 06Data-Engineering, 06Traffic: Request for a new request dataset for caching research - https://phabricator.wikimedia.org/T401331#11200325 (10yazhuoz) Hi there! Just wanted to follow up and check if there have been any updates on this request. I’d be happy to provide any additional context or clarifications if... [08:51:12] 06Data-Engineering, 06Data-Engineering-Radar, 10Data-Platform-SRE (2025.09.05 - 2025.09.26): Do performance testing of a big Hadoop Table hosted by Ceph - https://phabricator.wikimedia.org/T381416#11200577 (10BTullis) a:03BTullis [08:53:38] 06Data-Engineering, 06cloud-services-team, 06Community-Tech, 10Data-Services, and 2 others: Unexpected error "Subquery returns more than 1 row" on wiki replicas - https://phabricator.wikimedia.org/T404473#11200603 (10fnegri) [11:34:24] 06Data-Engineering: Requesting Kerberos access for sd - https://phabricator.wikimedia.org/T405219 (10SD0001) 03NEW [13:34:35] 06Data-Engineering, 06Traffic: Request for a new request dataset for caching research - https://phabricator.wikimedia.org/T401331#11201482 (10ssingh) >>! In T401331#11200325, @yazhuoz wrote: > Hi there! Just wanted to follow up and check if there have been any updates on this request. I’d be happy to provide a... [14:13:21] 06Data-Engineering, 06Data-Engineering-Radar, 10CirrusSearch, 10Discovery-Search (2025.09.05 - 2025.09.26), and 2 others: SUP: Serde w/o RowTypeInfo - https://phabricator.wikimedia.org/T404597#11201635 (10Ottomata) Let's chat! I think I need more context. [14:15:03] !log restarting the hadoop-yarn-resourcemanager.service on an-master1003 and then an-master1004 for T404871 [14:15:06] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [14:15:07] T404871: Add resource preemption to Hadoop Yarn scheduler - https://phabricator.wikimedia.org/T404871 [14:30:18] 06Data-Engineering: Requesting Kerberos access for sd - https://phabricator.wikimedia.org/T405219#11201694 (10Ottomata) [14:30:35] 06Data-Engineering: Requesting Kerberos access for sd - https://phabricator.wikimedia.org/T405219#11201698 (10Ottomata) [14:31:02] 06Data-Engineering, 06Data-Platform-SRE, 10SRE-Access-Requests: Requesting Kerberos access for sd - https://phabricator.wikimedia.org/T405219#11201701 (10Ottomata) Approved. [14:45:20] 06Data-Engineering, 06Data-Engineering-Radar, 06Data-Platform-SRE, 06Infrastructure-Foundations, 13Patch-For-Review: proposal: allow analytics-admins to also trigger puppet runs - https://phabricator.wikimedia.org/T404630#11201768 (10LSobanski) Approved in the I/F meeting. [14:45:34] 06Data-Engineering, 06Data-Engineering-Radar, 06Data-Platform-SRE, 06Infrastructure-Foundations, 13Patch-For-Review: proposal: allow analytics-admins to also trigger puppet runs - https://phabricator.wikimedia.org/T404630#11201769 (10CDanis) 05Open→03Resolved approved, and merged [14:48:00] +%analytics-admins ALL = NOPASSWD: /usr/local/sbin/puppet-run [14:56:18] (03CR) 10Ottomata: [C:03+1] "Nit about comment docs, +1 otherwise!" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) (owner: 10Santiago Faci) [15:10:22] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th): Deploy mediawiki-event-enrichment Flink jobs running 1.20 - https://phabricator.wikimedia.org/T401725#11201915 (10Ottomata) a:05Ottomata→03tchin [15:21:30] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10Event-Platform, 10MW-1.45-notes (1.45.0-wmf.18; 2025-09-09), 13Patch-For-Review: Update event-producing tools to overwrite `meta.dt` - https://phabricator.wikimedia.org/T376026#11201991 (10Ottomata) I still see 2 offenders: ` {"name":"eventgate-... [15:26:34] (03PS5) 10Santiago Faci: Added `agent.ua_string` as a possible source when parsing user agent [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) [15:32:25] (03CR) 10Ottomata: [C:03+1] Added `agent.ua_string` as a possible source when parsing user agent [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) (owner: 10Santiago Faci) [15:35:57] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10Event-Platform, 10MW-1.45-notes (1.45.0-wmf.18; 2025-09-09), 13Patch-For-Review: Update event-producing tools to overwrite `meta.dt` - https://phabricator.wikimedia.org/T376026#11202076 (10Ottomata) https://gitlab.wikimedia.org/repos/search-platf... [15:41:37] (03CR) 10Santiago Faci: "Documentation has been updated according to the change" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) (owner: 10Santiago Faci) [15:53:40] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10Event-Platform, 10MW-1.45-notes (1.45.0-wmf.18; 2025-09-09), 13Patch-For-Review: Update event-producing tools to overwrite `meta.dt` - https://phabricator.wikimedia.org/T376026#11202203 (10Ottomata) Wow, I think I found the page_change offender.... [15:57:07] (03CR) 10Ottomata: [C:03+2] Added `agent.ua_string` as a possible source when parsing user agent [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) (owner: 10Santiago Faci) [15:58:18] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 06Experimentation Lab: NEW/CHANGE FEATURE REQUEST: make available the centralauth.globaluser table in Data Lake - https://phabricator.wikimedia.org/T389666#11202272 (10Ottomata) Scheduled for this week's DPE ops week refinery train deployment. https://... [16:12:43] (03Merged) 10jenkins-bot: Added `agent.ua_string` as a possible source when parsing user agent [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1186049 (https://phabricator.wikimedia.org/T385180) (owner: 10Santiago Faci) [17:18:41] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10Event-Platform, 13Patch-For-Review: Add schema diffing support to jsonschema-tools and run diff in CI - https://phabricator.wikimedia.org/T321850#11202622 (10Ottomata) Submitted MRs to add a Reviewing schema changes section to the READMEs of both p... [17:46:23] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10Movement-Insights (FY25-26 H1): event.editattemptstep is not logging some revisions that appear in mediawiki_history - https://phabricator.wikimedia.org/T394961#11202715 (10Ottomata) 05Open→03Declined I'm going to be bold and decline this tic... [19:55:13] 06Data-Engineering, 10CirrusSearch, 10DPE-Mediawiki-Content, 10Discovery-Search (2025.09.05 - 2025.09.26), and 2 others: Source the CirrusSearch index dumps from hadoop instead of a MW maintenance script - https://phabricator.wikimedia.org/T366248#11203073 (10EBernhardson) What do we think is the right way... [20:18:18] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10MediaWiki-Page-derived-data, 07OKR-Work: Global Editor Metrics - Data Pipeline - https://phabricator.wikimedia.org/T405039#11203105 (10Ottomata) [20:25:56] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10MediaWiki-Page-derived-data, 07OKR-Work: Global Editor Metrics - Data Pipeline - https://phabricator.wikimedia.org/T405039#11203135 (10Ottomata) We'd like to generate a few days of `editor_metrics_per_user_per_page_daily` data ASAP in order to bett... [20:26:24] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 10MediaWiki-Page-derived-data, 07OKR-Work: Global Editor Metrics - Data Pipeline - https://phabricator.wikimedia.org/T405039#11203138 (10Ottomata) @mforns ! From [[ https://datahub.wikimedia.org/dataset/urn:li:dataset:(urn:li:dataPlatform:hive,wmf.p... [20:28:59] 10Data-Engineering (Q1 FY25/26 July 1st - September 30th), 07Essential-Work, 13Patch-For-Review: [Data Quality] Implement wiki completeness check for MediaWiki History - https://phabricator.wikimedia.org/T365203#11203144 (10Ahoelzl) a:05mforns→03Snwachukwu