[02:20:51] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11527522 (10Ottomata) >> This is awkward (as usual) since page.is_redirect exists but redirectness is a proper... [02:28:26] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11527524 (10Ottomata) > I think it would be useful to explicitly model the current rendering of a page apart f... [02:41:08] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11527543 (10Ottomata) Also relevant: {T360794}. Years ago we intended the [[ https://gitlab.wikimedia.org/rep... [05:47:47] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop ar_sha1 from archive table in wmf production - https://phabricator.wikimedia.org/T411163#11527660 (10Marostegui) [05:47:49] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop rev_sha1 from revision table in wmf production - https://phabricator.wikimedia.org/T411164#11527661 (10Marostegui) [05:48:28] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop ar_sha1 from archive table in wmf production - https://phabricator.wikimedia.org/T411163#11527662 (10Marostegui) [05:48:49] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop ar_sha1 from archive table in wmf production - https://phabricator.wikimedia.org/T411163#11527667 (10Marostegui) [05:51:46] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop rev_sha1 from revision table in wmf production - https://phabricator.wikimedia.org/T411164#11527668 (10Marostegui) [06:16:22] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527692 (10ops-monitoring-bot) Cookbook cookbooks.sre.wikireplicas.update-views run by marostegui: Started updating wiki replica views [06:21:15] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527696 (10ops-monitoring-bot) Cookbook cookbooks.sre.wikireplicas.update-views started by marostegui completed: - an-redacteddb1001.eqiad.wmnet (**PAS... [06:21:21] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527697 (10ops-monitoring-bot) Cookbook cookbooks.sre.wikireplicas.update-views run by marostegui: Started updating wiki replica views [06:25:03] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527699 (10Marostegui) @fnegri @Tacsipacsi I've recreated the views on the pending section: s4 - as the schema change went through there already. [06:27:08] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527702 (10ops-monitoring-bot) Cookbook cookbooks.sre.wikireplicas.update-views started by marostegui completed: - an-redacteddb1001.eqiad.wmnet (**PAS... [07:14:50] (03CR) 10Joal: [C:03+1] "LGTM! Thanks" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1227377 (https://phabricator.wikimedia.org/T396031) (owner: 10Xcollazo) [08:48:20] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11527862 (10brouberol) I can take care of spinning up the airflow instance if required @BTullis. [08:55:03] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11527881 (10elukey) @brouberol yeah let's do it if you have time! [09:07:16] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Add il_target_id to imagelinks table in wmf production - https://phabricator.wikimedia.org/T413525#11527903 (10Tacsipacsi) >>! In T413525#11527699, @Marostegui wrote: > @fnegri @Tacsipacsi I've recreated the views on the pending section: s4 - as the s... [09:30:04] 06Data-Engineering, 06SRE, 10SRE-Access-Requests, 13Patch-For-Review: Requesting access to analytics-privatedata-users for kareid - https://phabricator.wikimedia.org/T413364#11527946 (10Dzahn) a:05thcipriani→03None [09:59:07] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11528045 (10BTullis) >>! In T402512#11527862, @brouberol wrote: > I can take care of spinning up the airflow instance if required... [10:01:20] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11528049 (10brouberol) Sure thing. I'd need a couple of details. from you @elukey, namely the defaut team name DAGs would be labe... [10:06:43] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Change delete selection for SLO metric - https://phabricator.wikimedia.org/T414779 (10APizzata-WMF) 03NEW [10:13:55] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic, 13Patch-For-Review: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11528106 (10elukey) @brouberol I'd say team name "sre" and the root wikimedia email as starter, then later... [10:31:48] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic, 13Patch-For-Review: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11528160 (10MoritzMuehlenhoff) For logging into the instance we can use cn=ops,ou=groups,dc=wikimedia,dc=org [10:54:16] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Test the dbt+skein approach to running dbt Spark jobs in K8s - https://phabricator.wikimedia.org/T414784 (10amastilovic) 03NEW [10:57:00] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11528222 (10daniel) > So while we may want to associate links with a revision, ideally we'd have some other st... [11:00:52] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11528223 (10daniel) > I like this idea. I'm not really sure how to start though. Let's keep it simple and use... [13:00:44] (03CR) 10Xcollazo: [C:03+2] Drop DDL HQL for mediawiki_wikitext_* tables. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1227377 (https://phabricator.wikimedia.org/T396031) (owner: 10Xcollazo) [13:01:06] (03CR) 10Xcollazo: [V:03+2 C:03+2] Drop DDL HQL for mediawiki_wikitext_* tables. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1227377 (https://phabricator.wikimedia.org/T396031) (owner: 10Xcollazo) [13:02:07] 06Data-Engineering, 10CirrusSearch, 06Data-Platform-SRE, 10DPE-Mediawiki-Content, and 4 others: Source the CirrusSearch index dumps from hadoop instead of a MW maintenance script - https://phabricator.wikimedia.org/T366248#11528624 (10pfischer) 05Open→03Resolved [13:07:10] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Fix reconcile bug where user_id is not being populated correctly. - https://phabricator.wikimedia.org/T411803#11528647 (10xcollazo) >There are still reconcile events being ingested, but the majority are taken care of. The long tail is done, and at the en... [13:09:32] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 10DPE-Mediawiki-Content: Wait till november wmf_raw.mediawiki_slots sqoop table is available, and apply origin_rev_id fix to mw_content tables - https://phabricator.wikimedia.org/T407237#11528649 (10xcollazo) a:03xcollazo [13:45:01] 06Data-Engineering, 10CirrusSearch, 10DPE-Mediawiki-Content, 10MediaWiki-Page-derived-data, and 4 others: Source the CirrusSearch index dumps from hadoop instead of a MW maintenance script - https://phabricator.wikimedia.org/T366248#11528752 (10Gehel) [13:55:00] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Update MediaWiki Content History SLO draft for SRE review - https://phabricator.wikimedia.org/T401892#11528821 (10xcollazo) Final Asana status of this completed work at https://app.asana.com/0/1210776717300332/progress/1212800339589065. [14:27:48] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th): Test the dbt+skein approach to running dbt Spark jobs in K8s - https://phabricator.wikimedia.org/T414784#11529040 (10Ottomata) [14:27:57] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Movement-Insights, 06Data-Platform-SRE (2026.01.05 - 2026.01.23), 07Essential-Work, 13Patch-For-Review: Run dbt from Airflow - https://phabricator.wikimedia.org/T410268#11529041 (10Ottomata) [14:39:53] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11529086 (10Ottomata) > We introduced a "render ID" into mediawiki core a while ago, for this reason. But it'S... [14:40:49] 06Data-Engineering, 06Data-Persistence, 10GlobalBlocking, 07Essential-Work, and 3 others: Change type of gb_address to VARBINARY - https://phabricator.wikimedia.org/T414211#11529090 (10Dreamy_Jazz) [14:47:21] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 06Research, 10Event-Platform, 13Patch-For-Review: Implement stream of HTML content on mw.page_change event - https://phabricator.wikimedia.org/T360794#11529115 (10Ottomata) @JMonton-WMF, based on a conversation I'm having with @daniel in T331399#115... [15:09:44] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Switch Refine DAG priority weights to absolute - https://phabricator.wikimedia.org/T414810 (10Antoine_Quhen) 03NEW [15:12:57] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11529227 (10Isaac) Just to add a point about the rendering being separate from the revision (I didn't know abo... [15:13:53] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11529228 (10Ottomata) QQ: are templatelinks part of the current canonical rendering of the current revision? [15:14:41] 06Data-Engineering, 06Machine-Learning-Team, 10Event-Platform: Create new mediawiki.page_links_change stream based on fragment/mediawiki/state/change/page - https://phabricator.wikimedia.org/T331399#11529229 (10Ottomata) > you have to make sure that both revisions that you're compared were rendered at the sa... [15:35:38] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Migrate refinery-drop-older-than from refinery to Airflow - https://phabricator.wikimedia.org/T414815 (10Antoine_Quhen) 03NEW [15:50:35] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 13Patch-For-Review: Publish Dumps 2 to dumps.wikimedia.org and provide only monthly dumps - https://phabricator.wikimedia.org/T414389#11529388 (10xcollazo) 05Open→03In progress p:05Triage→03High a:03xcollazo [16:26:56] 06Data-Engineering, 06Data-Platform-SRE, 06ServiceOps new, 10ServiceOps-good-first-task, and 2 others: DRY kafka broker declaration in helmfiles - https://phabricator.wikimedia.org/T253058#11529623 (10MLechvien-WMF) [16:28:02] 06Data-Engineering, 06Data-Platform-SRE, 06ServiceOps new, 10ServiceOps-Datastores, and 3 others: DRY kafka broker declaration in helmfiles - https://phabricator.wikimedia.org/T253058#11529628 (10MLechvien-WMF) [16:29:44] 06Data-Engineering, 06Infrastructure-Foundations, 06Traffic, 13Patch-For-Review: Export development_network_probe data to Puppet servers for CDN deployment - https://phabricator.wikimedia.org/T402512#11529638 (10elukey) Next steps: * DP to create the Airflow SRE instance. * Me and DP to configure the rsy... [16:33:57] 06Data-Engineering, 06Data-Platform-SRE, 06ServiceOps new, 10ServiceOps-Datastores, and 3 others: DRY kafka broker declaration in helmfiles - https://phabricator.wikimedia.org/T253058#11529661 (10MLechvien-WMF) a:03brouberol @brouberol bringing back this task as we're going through Serviceops backlog.... [16:36:22] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th), 07OKR-Work: SDS 2.2.6 Improve experiment event data data lake management - https://phabricator.wikimedia.org/T414105#11529672 (10mpopov) Thank you so much for investigating that and proposing a short term solution! Once I added the partition pushdown to... [17:25:09] 06Data-Engineering: Backfill `user_central_id` on wmf_content.mediawiki_content_* tables - https://phabricator.wikimedia.org/T414832 (10xcollazo) 03NEW [17:26:27] 06Data-Engineering (Q2 FY25/26 October 1st - December 31th), 10MediaWiki-Page-derived-data, 07OKR-Work: Add user_central_id to mediawiki_content_history_v1 (and mediawiki_content_current_v1) - https://phabricator.wikimedia.org/T406515#11529933 (10xcollazo) 05Open→03Resolved I am closing this ticket a... [19:39:35] 06Data-Engineering (Q3 FY25/26 January 1st - March 31th): Publish Dumps 2 to dumps.wikimedia.org and provide only monthly dumps - https://phabricator.wikimedia.org/T414389#11530302 (10xcollazo) >>! In T414389#11529535, @CodeReviewBot wrote: > xcollazo **merged** https://gitlab.wikimedia.org/repos/data-engineerin...