[03:24:10] 06Data-Engineering, 10ContentTranslation, 06Language and Product Localization, 10Metrics Platform, 13Patch-For-Review: Update WMF-deployed extensions to use mw.config checks instead of manual m-dot URL hacks - https://phabricator.wikimedia.org/T390923#10706564 (10Krinkle) [03:24:12] 06Data-Engineering, 10ContentTranslation, 06Language and Product Localization, 10Metrics Platform, 13Patch-For-Review: Update WMF-deployed extensions to use mw.config checks instead of manual m-dot URL hacks - https://phabricator.wikimedia.org/T390923#10706565 (10Krinkle) p:05Triage→03High [03:29:28] 10Data-Engineering (Q3 2025 January 1st - March 31th): Assess data platform implications for RFC m. domain name unification - https://phabricator.wikimedia.org/T389696#10706571 (10Krinkle) [03:29:43] 06Data-Engineering, 06SRE, 06Traffic-Icebox, 10MobileFrontend (Tracking): RFC: Remove m-dot subdomain, serve mobile and desktop variants through the same URL - https://phabricator.wikimedia.org/T214998#10706572 (10Krinkle) [03:30:46] 10Data-Engineering (Q3 2025 January 1st - March 31th): Assess data platform implications for RFC m. domain name unification - https://phabricator.wikimedia.org/T389696#10706574 (10Krinkle) [03:32:38] 10Data-Engineering (Q3 2025 January 1st - March 31th): Update webrequest refinery to support access_method="mobile web" without m-dot domain - https://phabricator.wikimedia.org/T389696#10706575 (10Krinkle) [03:49:31] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10MobileFrontend, 06Traffic: Add ismobile attribute to X-Analytics header - https://phabricator.wikimedia.org/T390924 (10Krinkle) 03NEW [03:50:01] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10MobileFrontend, 06Traffic: Add ismobile attribute to X-Analytics header - https://phabricator.wikimedia.org/T390924#10706593 (10Krinkle) [06:52:05] 10Data-Engineering (Q3 2025 January 1st - March 31th): Update webrequest and unique devices pipelines to derive access_method without m-dot domain - https://phabricator.wikimedia.org/T389696#10706663 (10Krinkle) [08:06:45] !log Rerun failed webrequest-job with lower error threshold - traffic spike [08:06:47] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [08:31:22] 06Data-Engineering, 06Data-Engineering-Radar, 06Traffic, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: New software: haproxykafka - https://phabricator.wikimedia.org/T370668#10706866 (10Fabfur) [08:32:08] 06Data-Engineering, 06Traffic, 10DPE HAProxy Migration: Add HAproxy termination field to webrequest - https://phabricator.wikimedia.org/T387454#10706872 (10Fabfur) [08:32:10] 06Data-Engineering, 06Data-Engineering-Radar, 06Traffic, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: New software: haproxykafka - https://phabricator.wikimedia.org/T370668#10706873 (10Fabfur) [08:32:26] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Traffic, 10DPE HAProxy Migration: Make webrequest_frontend being ingested using the in-data `dt` field - https://phabricator.wikimedia.org/T388397#10706874 (10Fabfur) [08:32:28] 06Data-Engineering, 06Data-Engineering-Radar, 06Traffic, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: New software: haproxykafka - https://phabricator.wikimedia.org/T370668#10706875 (10Fabfur) [08:33:09] 06Data-Engineering, 06Data-Engineering-Radar, 06Traffic, 10MediaWiki-Platform-Team (Radar), 13Patch-For-Review: New software: haproxykafka - https://phabricator.wikimedia.org/T370668#10706878 (10Fabfur) [12:10:44] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10MobileFrontend, 06Traffic: Add ismobile attribute to X-Analytics header - https://phabricator.wikimedia.org/T390924#10707573 (10phuedx) > Other ideas? Preferences? IIRC Varnish is the decision maker in production – MobileFrontend simply responds to th... [12:40:14] 06Data-Engineering: Unable to save Jupyter Notebooks or start IPython kernel on stat1008 - https://phabricator.wikimedia.org/T390959 (10CDobbins) 03NEW [12:52:25] 10Data-Engineering (Q3 2025 January 1st - March 31th), 13Patch-For-Review: Enable Spark data lineage for all Airflow instances - https://phabricator.wikimedia.org/T386862#10707732 (10tchin) I think it actually is broken on main as well and it's just been silently failing. I opened a patch to add the port back in [13:54:10] (03PS3) 10Snwachukwu: 1.Add a closed flag to the project namespace map dataset 2. Add a whether to sqoop flag by checking if wikidb exists in cloud replica. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [14:15:03] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10MobileFrontend, 06Traffic: Add ismobile attribute to X-Analytics header - https://phabricator.wikimedia.org/T390924#10708135 (10Jdforrester-WMF) If this code is in MobileFrontend, it won't detect mobile hits to non-MF wikis like Wikifunctions, so that... [14:15:50] (03CR) 10Gmodena: [C:03+1] Add columns to data_quality_alerts to support inserting ResultKey (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1127967 (https://phabricator.wikimedia.org/T384962) (owner: 10TChin) [14:41:57] 10Data-Engineering (Q4 2025 April 1st - June 30th): Update webrequest and unique devices pipelines to derive access_method without m-dot domain - https://phabricator.wikimedia.org/T389696#10708281 (10Ahoelzl) [14:42:00] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10MediaWiki-DomainEvents, 10MW-Interfaces-Team (MWI-Roadmap): DomainEvents - [Hypothesis] WE5.2.6 Event Broadcasting Discovery & Design - https://phabricator.wikimedia.org/T384874#10708285 (10Ahoelzl) [14:42:04] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Commons-Impact-Metrics, 13Patch-For-Review: [CIM] Skewed ranking with the top Editors monthly API - https://phabricator.wikimedia.org/T370470#10708283 (10Ahoelzl) [14:43:22] 10Data-Engineering (Q4 2025 April 1st - June 30th): Airflow skips canary-event tasks - https://phabricator.wikimedia.org/T380836#10708306 (10Ahoelzl) [14:43:25] 10Data-Engineering (Q4 2025 April 1st - June 30th): Some search entries in wmf.webrequest have their query appended to their uri_path - https://phabricator.wikimedia.org/T383135#10708310 (10Ahoelzl) [14:43:27] 10Data-Engineering (Q4 2025 April 1st - June 30th), 13Patch-For-Review: [Refine Refactoring] Refine jobs should be scheduled by Airflow: deployment - https://phabricator.wikimedia.org/T369845#10708308 (10Ahoelzl) [14:44:48] 10Data-Engineering (Q4 2025 April 1st - June 30th): Switch webrequest dataset to feed from HAProxy instead of VarnishKafka - https://phabricator.wikimedia.org/T386177#10708320 (10Ahoelzl) [14:44:50] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work, 13Patch-For-Review: [Data Quality] Implement wiki completeness check for MediaWiki History - https://phabricator.wikimedia.org/T365203#10708322 (10Ahoelzl) [14:44:53] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work, 10Event-Platform: Gobblin-wmf Gitlab migration and maintenance - https://phabricator.wikimedia.org/T370368#10708324 (10Ahoelzl) [14:45:01] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 13Patch-For-Review: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10708326 (10Ahoelzl) [14:45:04] 10Data-Engineering (Q4 2025 April 1st - June 30th): Provide Data Engineering Q4 draft - https://phabricator.wikimedia.org/T387385#10708328 (10Ahoelzl) [14:45:07] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: List out all migration candidates for mediawiki_content_history - https://phabricator.wikimedia.org/T386757#10708330 (10Ahoelzl) [14:45:15] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Commons-Impact-Metrics: [Commons Impact Metrics] Add page wiki to the corresponding top endpoints - https://phabricator.wikimedia.org/T372805#10708332 (10Ahoelzl) [14:45:23] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content, 13Patch-For-Review: Implement alerting for wmf_content.mediawiki_content_history_v1 - https://phabricator.wikimedia.org/T384962#10708334 (10Ahoelzl) [14:45:27] 10Data-Engineering (Q4 2025 April 1st - June 30th), 13Patch-For-Review: Enable Spark data lineage for all Airflow instances - https://phabricator.wikimedia.org/T386862#10708336 (10Ahoelzl) [14:47:56] 06Data-Engineering, 10MobileFrontend, 06Traffic: Add ismobile attribute to X-Analytics header - https://phabricator.wikimedia.org/T390924#10708340 (10Ahoelzl) [14:48:19] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work: Support for 4.3.11 - webrequest based scraping detection - https://phabricator.wikimedia.org/T388721#10708346 (10Ahoelzl) [14:48:43] 06Data-Engineering, 10DPE-Mediawiki-Content: Add metrics for monthly reconciles - https://phabricator.wikimedia.org/T388439#10708351 (10Ahoelzl) [14:55:42] 10Data-Engineering (Q3 2025 January 1st - March 31th), 07Essential-Work: Migrate Gobblin to Airflow - https://phabricator.wikimedia.org/T388246#10708390 (10Ahoelzl) a:03amastilovic [14:55:59] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work: Migrate Gobblin to Airflow - https://phabricator.wikimedia.org/T388246#10708393 (10Ahoelzl) [14:59:25] 10Data-Engineering (Q4 2025 April 1st - June 30th): [Airflow-test] refine_to_hive_hourly_test.refine_hive_dataset.wait_for_gobblin_export is failing recurrently - https://phabricator.wikimedia.org/T382901#10708417 (10Ahoelzl) [15:02:22] 10Data-Engineering (Q4 2025 April 1st - June 30th): Handle Late-Arrived Events from Gobblin into Airflow triggered Refine - https://phabricator.wikimedia.org/T370665#10708448 (10Ahoelzl) [15:03:32] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: [Dumps 2] Investigate reasons for remaining inconsistencies - https://phabricator.wikimedia.org/T385112#10708449 (10Ahoelzl) [15:04:37] 10Data-Engineering (Q4 2025 April 1st - June 30th): test_produced_by_config SLA miss configured to be too small for upstream dataset run time - https://phabricator.wikimedia.org/T388861#10708455 (10Ahoelzl) [15:05:39] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Event-Platform: Some events in mediawiki.page_change.v1 refers to auth.wikimedia.org in meta.uri and meta.domain - https://phabricator.wikimedia.org/T388825#10708458 (10Ahoelzl) [15:06:03] 10Data-Engineering (Q4 2025 April 1st - June 30th): [Refine Refactoring] Refine Data Quality - late events, RefineMonitor refactor, etc. - https://phabricator.wikimedia.org/T377739#10708459 (10Ahoelzl) [15:07:21] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: When doing ADD COLUMN to a struct under a map, Iceberg fails to SELECT it - https://phabricator.wikimedia.org/T388793#10708467 (10Ahoelzl) [15:08:49] 06Data-Engineering: Figure sqoop changes to continue ingesting wbt tables to analytics cluster - https://phabricator.wikimedia.org/T390975 (10xcollazo) 03NEW [15:09:05] 06Data-Engineering: Figure sqoop changes to continue ingesting wbt tables to analytics cluster - https://phabricator.wikimedia.org/T390975#10708488 (10xcollazo) [15:11:44] 06Data-Engineering: Figure sqoop changes to continue ingesting wbt tables to analytics cluster - https://phabricator.wikimedia.org/T390975#10708511 (10xcollazo) 05Open→03Declined @JAllemandou mentions: >While we have the ability to sqoop those tables, we don't do it on a regular basis. The last snapshot... [15:14:03] 10Data-Engineering (Q4 2025 April 1st - June 30th): Move more of refine_hive_hourly dag logic into RefineConfiguration - https://phabricator.wikimedia.org/T375064#10708585 (10Ahoelzl) [15:14:46] (03CR) 10Xcollazo: [C:03+1] Add columns to data_quality_alerts to support inserting ResultKey [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1127967 (https://phabricator.wikimedia.org/T384962) (owner: 10TChin) [15:15:21] 10Data-Engineering (Q4 2025 April 1st - June 30th): Reduce `refine_to_hive_hourly` airflow task number - https://phabricator.wikimedia.org/T380856#10708605 (10Ahoelzl) a:03Antoine_Quhen [15:16:37] 10Data-Engineering (Q4 2025 April 1st - June 30th): [Refine DAG Improvement] Add Parameter to Reduce Spark Driver Logs in Skein Log Collection - https://phabricator.wikimedia.org/T381074#10708609 (10Ahoelzl) [15:17:36] 10Data-Engineering (Q4 2025 April 1st - June 30th): [Refine Simplification] Remove Schema Merging in Refine Process by Enforcing Backward Compatibility - https://phabricator.wikimedia.org/T381072#10708619 (10Ahoelzl) [15:19:26] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work: 8 new wikis missing from mediawiki_history - https://phabricator.wikimedia.org/T368788#10708637 (10Ahoelzl) [15:20:04] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10MediaWiki-Page-rename, 10Event-Platform, 07Wikimedia-production-error: InvalidArgumentException: No page moved from 'File:Hospital sign.svg' to 'File:MUTCD D9-2.svg' with ID 1208082 could be found - https://phabricator.wikimedia.org/T387695#10708638 (10... [15:20:15] 10Data-Engineering (Q4 2025 April 1st - June 30th), 06Trust and Safety Product Team, 13Patch-For-Review, 10Product-Analytics (Kanban): Add mediawiki_product_metrics_incident_reporting_system_interaction to the sanitization allowlist - https://phabricator.wikimedia.org/T384650#10708639 (10Ahoelzl) [15:20:36] 06Data-Engineering: [sqoop] Check if a sqoop needs changes when one mariadb field changes from int to smallint - https://phabricator.wikimedia.org/T383803#10708642 (10Ahoelzl) [15:20:38] 06Data-Engineering, 06Product-Analytics, 10Event-Platform: [Event Platform] Disable default collection of user agent for analytics streams - https://phabricator.wikimedia.org/T384964#10708644 (10Ahoelzl) [15:20:40] 06Data-Engineering: [Developer Experience] Implement CI hql Linting - https://phabricator.wikimedia.org/T360967#10708648 (10Ahoelzl) [15:20:42] 06Data-Engineering, 10DPE-Mediawiki-Content, 13Patch-For-Review: Modify code to dump all slots - https://phabricator.wikimedia.org/T384945#10708640 (10Ahoelzl) [15:20:44] 06Data-Engineering, 06Traffic, 07Essential-Work, 10Experimentation Lab Radar: Cookie % has been rejected because it is foreign and does not have the "Partitioned" attribute - https://phabricator.wikimedia.org/T375256#10708646 (10Ahoelzl) [15:20:45] 06Data-Engineering: Warning of mismatch in declarations of Webrequest schema - https://phabricator.wikimedia.org/T380916#10708650 (10Ahoelzl) [15:20:50] 06Data-Engineering, 10MediaWiki-DomainEvents, 06MW-Interfaces-Team, 05FY2024-25 KR 5.2 Simplify feature development, 07OKR-Work: Design and document new Domain Events feature in MediaWiki core - https://phabricator.wikimedia.org/T379959#10708652 (10Ahoelzl) [15:20:55] 06Data-Engineering: Find a way to make spark have an intermediate data materialization step before coalescing - https://phabricator.wikimedia.org/T379194#10708654 (10Ahoelzl) [15:20:59] 06Data-Engineering: [Placeholder] Clean Up Corresponding Hive Tables After Deprecating Older Stream Configs - https://phabricator.wikimedia.org/T368800#10708658 (10Ahoelzl) [15:21:04] 06Data-Engineering, 06Data-Platform-SRE, 06Movement-Insights: Fail Spark job or airflow task if unexpected number of output files - https://phabricator.wikimedia.org/T377006#10708656 (10Ahoelzl) [15:21:10] 06Data-Engineering, 06Experimentation Lab: Make jsonschema-tools merge values of enums when merging allOf - https://phabricator.wikimedia.org/T345317#10708660 (10Ahoelzl) [15:21:54] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Traffic: Migrate Benthos `webrequest_sampled_live` to feed from HAProxy data - https://phabricator.wikimedia.org/T390029#10708662 (10Ahoelzl) 05Open→03Resolved [15:21:55] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE HAProxy Migration: Update webrequest_frontend validation jobs to use `` instead of invalid dt - https://phabricator.wikimedia.org/T389797#10708664 (10Ahoelzl) 05Open→03Resolved [15:21:58] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Traffic, 10DPE HAProxy Migration: Make webrequest_frontend being ingested using the in-data `dt` field - https://phabricator.wikimedia.org/T388397#10708666 (10Ahoelzl) 05Open→03Resolved [15:22:02] 10Data-Engineering (Q3 2025 January 1st - March 31th): Investigate why the mw-content-history-reconcile-enrich Flink job failed. - https://phabricator.wikimedia.org/T387906#10708670 (10Ahoelzl) 05Open→03Resolved [15:22:06] 10Data-Engineering (Q3 2025 January 1st - March 31th): Fix service-utils metrics routing naming discrepancy - https://phabricator.wikimedia.org/T387824#10708671 (10Ahoelzl) 05In progress→03Resolved [15:22:10] 10Data-Engineering (Q3 2025 January 1st - March 31th): Provide a list of Phabricator tags relevant for Data-Engineering and use them on Data-Engineering board - https://phabricator.wikimedia.org/T386752#10708673 (10Ahoelzl) 05Open→03Resolved [15:22:14] 10Data-Engineering (Q3 2025 January 1st - March 31th), 13Patch-For-Review: [eventstreams] Fix event detail yaml error - https://phabricator.wikimedia.org/T386750#10708676 (10Ahoelzl) 05Open→03Resolved [15:22:18] 10Data-Engineering (Q3 2025 January 1st - March 31th): [Airflow Optimization] Reduce Overhead in Refine DAG by Precomputing Parameters - https://phabricator.wikimedia.org/T381073#10708680 (10Ahoelzl) 05Open→03Resolved [15:22:22] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE HAProxy Migration, 13Patch-For-Review: [HAProxy migration] HAProxy and VarnishKafka should produce compatible datasets - https://phabricator.wikimedia.org/T382571#10708679 (10Ahoelzl) 05Open→03Resolved [15:22:26] 10Data-Engineering (Q3 2025 January 1st - March 31th): Write documentation on usage of RestExternalTaskSensor - https://phabricator.wikimedia.org/T378000#10708683 (10Ahoelzl) 05Open→03Resolved [15:22:34] 10Data-Engineering (Q3 2025 January 1st - March 31th), 13Patch-For-Review: Migrate and re-deploy eventstreams using service-utils - https://phabricator.wikimedia.org/T361769#10708687 (10Ahoelzl) 05In progress→03Resolved [15:22:38] 10Data-Engineering (Q3 2025 January 1st - March 31th), 13Patch-For-Review: Publish Data Engineering maintained NodeJS packages to GitLab and use them in depender code - https://phabricator.wikimedia.org/T366612#10708685 (10Ahoelzl) 05Open→03Resolved [15:22:43] 10Data-Engineering (Q3 2025 January 1st - March 31th): Replace service runner with a simplified library to better support metrics and debugging: service-utils - https://phabricator.wikimedia.org/T360924#10708689 (10Ahoelzl) 05Open→03Resolved [15:22:47] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Event-Platform: Bug: jsonschema-tools generates non deterministic examples for date format fields - https://phabricator.wikimedia.org/T389881#10708690 (10Ahoelzl) 05Open→03Resolved [15:22:51] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Infrastructure-Foundations, 10netops: Update `netflow` retention strategy in Druid (too much data) - https://phabricator.wikimedia.org/T387839#10708693 (10Ahoelzl) 05Open→03Resolved [15:22:59] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Product-Analytics, 10Event-Platform: [BUG] eventgate-logging-external drops previously collected http request headers - https://phabricator.wikimedia.org/T387908#10708692 (10Ahoelzl) 05In progress→03Resolved [15:23:03] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE HAProxy Migration: [HAProxy migration] Compile expected migration delta, switch over plan and communicate - https://phabricator.wikimedia.org/T387750#10708695 (10Ahoelzl) 05Open→03Resolved [15:23:07] 10Data-Engineering (Q3 2025 January 1st - March 31th): HDFS capacity needs data engineering and platform users - https://phabricator.wikimedia.org/T384100#10708698 (10Ahoelzl) 05Open→03Resolved [15:23:11] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Language and Product Localization, 10MediaWiki-extensions-Translate, 06MW-Interfaces-Team, and 3 others: Intermittent JobQueueError due to "Unable to deliver all events: 500: Internal Server Error"... - https://phabricator.wikimedia.org/T386138#10708697 [15:23:17] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06SRE, 06Traffic, 13Patch-For-Review: Refine add_is_wmf_domain TransformFunction fails if no source field exists - https://phabricator.wikimedia.org/T383914#10708700 (10Ahoelzl) 05Open→03Resolved [15:23:25] 10Data-Engineering (Q3 2025 January 1st - March 31th), 07Essential-Work: Identify Internal Users of MediaWiki Wikitext Tables - https://phabricator.wikimedia.org/T383743#10708704 (10Ahoelzl) 05Open→03Resolved [15:23:29] 10Data-Engineering (Q3 2025 January 1st - March 31th), 07Essential-Work: Analyze Dumps Usage Through Apache Logs - https://phabricator.wikimedia.org/T383175#10708705 (10Ahoelzl) 05Open→03Resolved [15:23:33] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Product-Analytics, 10Event-Platform: Enable Event Platform streams to opt out of collecting User-Agent data - https://phabricator.wikimedia.org/T382173#10708706 (10Ahoelzl) 05Open→03Resolved [15:23:37] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Data Pipelines, 10Observability-Metrics, 07Essential-Work, and 2 others: Disable Data Platform Engineering generated graphite metrics and dashboards - https://phabricator.wikimedia.org/T372855#10708707 (10Ahoelzl) 05Open→03Resolved [15:23:41] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06Product-Analytics, 10Event-Platform: Migrate legacy metawiki schemas to Event Platform - https://phabricator.wikimedia.org/T259163#10708709 (10Ahoelzl) 05Open→03Resolved [15:23:45] 10Data-Engineering (Q3 2025 January 1st - March 31th), 06MediaWiki-Engineering, 10TemplateData, 10VisualEditor, and 4 others: PHP Unknown error: EventLoggingLegacyConverter: Failed proxying legacy EventLogging event query string to WMF Event Platform JSON... - https://phabricator.wikimedia.org/T383939#10708711 [15:23:55] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10EventStreams, 10Event-Platform: EventStreams: kafka key should be serialized as a string - https://phabricator.wikimedia.org/T373689#10708714 (10Ahoelzl) 05Open→03Resolved [15:23:59] 10Data-Engineering (Q3 2025 January 1st - March 31th), 07Essential-Work, 10Event-Platform, 13Patch-For-Review: Upgrade eventgate-wikimedia to node20 - https://phabricator.wikimedia.org/T383814#10708713 (10Ahoelzl) 05Open→03Resolved [15:24:03] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10MediaWiki-extensions-EventLogging, 10Event-Platform, 13Patch-For-Review: Decommission EventLogging backend components by migrating to MEP - https://phabricator.wikimedia.org/T238230#10708717 (10Ahoelzl) 05Open→03Resolved [15:42:00] 10Data-Engineering (Q4 2025 April 1st - June 30th): Migrate Gobblin job repository to GitLab - https://phabricator.wikimedia.org/T390247#10708956 (10Ahoelzl) →14Duplicate dup:03T370368 [15:42:01] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work, 10Event-Platform: Gobblin-wmf Gitlab migration and maintenance - https://phabricator.wikimedia.org/T370368#10708958 (10Ahoelzl) [15:44:10] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 13Patch-For-Review: [Data Quality] Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10708966 (10Ahoelzl) [15:45:00] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE HAProxy Migration: [HAProxy migration] Drop webrequest_deprecated dag & linked wmf_deprecated tables (202) - https://phabricator.wikimedia.org/T390752#10708968 (10Ahoelzl) p:05Triage→03High [15:46:15] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Commons-Impact-Metrics: Update Commons Impact Metrics to account for new File table - https://phabricator.wikimedia.org/T389800#10708982 (10Ahoelzl) p:05Triage→03High [15:46:56] 10Data-Engineering (Q4 2025 April 1st - June 30th): Analytics Cluster Dataset Usage Discovery Task - https://phabricator.wikimedia.org/T389903#10708987 (10Ahoelzl) p:05Triage→03Medium [15:47:12] 10Data-Engineering (Q4 2025 April 1st - June 30th): Handle Late-Arrived Events from Gobblin into Airflow triggered Refine - https://phabricator.wikimedia.org/T370665#10708989 (10Ahoelzl) p:05Medium→03High [15:49:35] 10Data-Engineering (Q4 2025 April 1st - June 30th): [Refine Refactoring] Refine Data Quality - late events, RefineMonitor refactor, etc. - https://phabricator.wikimedia.org/T377739#10709013 (10Ahoelzl) →14Duplicate dup:03T370665 [15:49:37] 10Data-Engineering (Q4 2025 April 1st - June 30th): Handle Late-Arrived Events from Gobblin into Airflow triggered Refine - https://phabricator.wikimedia.org/T370665#10709015 (10Ahoelzl) [15:49:54] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10Event-Platform: Update event-producing tools to overwrite `meta.dt` - https://phabricator.wikimedia.org/T376026#10709019 (10Ahoelzl) [16:05:35] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: List out all migration candidates for mediawiki_content_history - https://phabricator.wikimedia.org/T386757#10709113 (10Ahoelzl) Authoritative list of migration targets is here: https://docs.google.com/document/d/1Ov8FJkR96HSDquJh5Wq3... [16:10:39] 10Data-Engineering (Q4 2025 April 1st - June 30th): Switch webrequest dataset to feed from HAProxy instead of VarnishKafka - https://phabricator.wikimedia.org/T386177#10709159 (10Ahoelzl) Migration impact and documentation: https://wikitech.wikimedia.org/w/index.php?title=Data_Platform%2FData_Lake%2FTraffic%2FWe... [16:14:19] 06Data-Engineering, 10MediaWiki-Page-rename, 10Event-Platform, 07Wikimedia-production-error: InvalidArgumentException: No page moved from 'File:Hospital sign.svg' to 'File:MUTCD D9-2.svg' with ID 1208082 could be found - https://phabricator.wikimedia.org/T387695#10709177 (10Ahoelzl) [16:14:50] 06Data-Engineering, 10MediaWiki-Page-rename, 10Event-Platform, 07Wikimedia-production-error: InvalidArgumentException: No page moved from 'File:Hospital sign.svg' to 'File:MUTCD D9-2.svg' with ID 1208082 could be found - https://phabricator.wikimedia.org/T387695#10709191 (10Reedy) [16:27:19] (03PS4) 10Snwachukwu: 1.Add a closed flag to the project namespace map dataset 2. Add a whether to sqoop flag by checking if wikidb exists in cloud replica. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [16:29:21] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Epic: Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003 (10JAllemandou) 03NEW [16:30:39] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Epic: MAY-2025 Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003#10709278 (10JAllemandou) [16:36:09] 10Data-Engineering (Q4 2025 April 1st - June 30th): MAY-2025 Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003#10709302 (10JAllemandou) [16:38:55] (03CR) 10Snwachukwu: 1.Add a closed flag to the project namespace map dataset 2. Add a whether to sqoop flag by checking if wikidb exists in cloud replica. (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [16:49:43] (03CR) 10Milimetric: "A couple of minor things, looks good otherwise. I'd recommend getting review from Marcel or someone else?" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [16:50:18] 06Data-Engineering: Remove sqoop code for wikibase term storage - https://phabricator.wikimedia.org/T391006 (10Ladsgroup) 03NEW [16:53:13] (03CR) 10Snwachukwu: "Thanks. I will call their attention to it." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [17:05:49] (03PS5) 10Snwachukwu: 1.Add a closed flag to the project namespace map dataset 2. Add a whether to sqoop flag by checking if wikidb exists in cloud replica. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [17:15:45] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE HAProxy Migration: MAY-2025 Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003#10709480 (10JAllemandou) [17:16:30] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE HAProxy Migration: [HAProxy Migration ]MAY-2025 Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003#10709492 (10JAllemandou) [17:16:46] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE HAProxy Migration: [HAProxy migration] Drop webrequest_deprecated dag & linked wmf_deprecated tables (202) - https://phabricator.wikimedia.org/T390752#10709495 (10JAllemandou) →14Duplicate dup:03T391003 [17:16:48] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE HAProxy Migration: [HAProxy Migration ]MAY-2025 Delete `webrequest_deprecated` data and DAGs - https://phabricator.wikimedia.org/T391003#10709497 (10JAllemandou) [18:11:12] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10709849 (10Ladsgroup) I'm reviewing the s... [18:22:54] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10709898 (10Dreamy_Jazz) >>! In T390873#10... [18:27:00] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10709911 (10Dreamy_Jazz) Given the idea of... [18:33:44] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10709942 (10Dreamy_Jazz) Because we need t... [18:35:05] 06Data-Engineering, 06Traffic: Unable to save Jupyter Notebooks or start IPython kernel on stat1008 - https://phabricator.wikimedia.org/T390959#10709949 (10CDobbins) [18:36:03] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10709953 (10matej_suchanek) >>! In T390873... [18:38:35] 06Data-Engineering, 10AbuseFilter, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027 (10Dreamy_Jazz) 03NEW [18:42:49] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710034 (10Dreamy_Jazz) I've filed T39102... [18:45:07] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 4 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710051 (10Dreamy_Jazz) [18:45:57] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710061 (10Dreamy_Jazz) Removing #schema-... [18:53:02] (03CR) 10Mforns: 1.Add a closed flag to the project namespace map dataset 2. Add a whether to sqoop flag by checking if wikidb exists in cloud replica. (034 comments) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1125184 (https://phabricator.wikimedia.org/T365203) (owner: 10Milimetric) [19:06:18] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710226 (10Ladsgroup) Something to note i... [19:11:32] 06Data-Engineering, 10AbuseFilter, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10710244 (10Ladsgroup) 😭 Do you feel like making a patch or reviewing it if I do it? [19:12:20] 06Data-Engineering, 10AbuseFilter, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10710248 (10Dreamy_Jazz) >>! In T391027#10710244, @Ladsgroup wrote: > 😭 > > Do you feel like making a patch or reviewing it if I do it? I'm happy to either make a patch or revi... [19:17:32] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710253 (10Dreamy_Jazz) >>! In T390873#10... [19:23:14] 06Data-Engineering, 10AbuseFilter, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10710283 (10Dreamy_Jazz) Looking at git blame, it seems that it was added in 2009 with the description "Update abusefilter.tables.sql" [[ https://gerrit.wikimedia.org/r/plugins/g... [19:24:16] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710284 (10Ladsgroup) >>! In T390873#1071... [19:25:25] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710296 (10Ladsgroup) It's a blob: ` `afl... [19:27:03] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710304 (10Dreamy_Jazz) Thanks. In which... [19:28:14] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710307 (10Dreamy_Jazz) [19:35:07] 06Data-Engineering, 10AbuseFilter, 06Data-Persistence, 10MediaWiki-extensions-IPReputation, and 3 others: AbuseFilter protected variables: Make it possible for protected variable values expire when the IP address expires - https://phabricator.wikimedia.org/T390873#10710316 (10Ladsgroup) Awesome! I stay sub... [19:38:09] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: [Dumps 2] Investigate reasons for remaining inconsistencies - https://phabricator.wikimedia.org/T385112#10710323 (10xcollazo) `2025-04-01` montly reconcile of all of wikitime just finished: ` presto:wmf_content> select count(1) as co... [19:52:39] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: [Dumps 2] Investigate reasons for remaining inconsistencies - https://phabricator.wikimedia.org/T385112#10710354 (10xcollazo) Are there inconsistencies that are carrying over from last couple reconciles? ` with inconsistencies_from_20... [20:07:31] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: [Dumps 2] Investigate reasons for remaining inconsistencies - https://phabricator.wikimedia.org/T385112#10710412 (10xcollazo) Looks like `labswiki` is a good candidate to drill in, as the amount of recurrent inconsistencies (`53037`)... [20:08:09] 06Data-Engineering, 10AbuseFilter, 13Patch-For-Review, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10710418 (10Dreamy_Jazz) a:03Ladsgroup [20:14:11] 10Data-Engineering (Q4 2025 April 1st - June 30th), 10DPE-Mediawiki-Content: [Dumps 2] Investigate reasons for remaining inconsistencies - https://phabricator.wikimedia.org/T385112#10710452 (10xcollazo) Doesn't seem to be coming from Flink, as the total amount of errors recently is low, which doesn't correlate... [20:56:00] 06Data-Engineering, 10AbuseFilter, 13Patch-For-Review, 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10710633 (10Dreamy_Jazz) Next is to create the #schema-change-in-production ticket. Should I leave that to you? [21:24:39] 10Data-Engineering (Q4 2025 April 1st - June 30th), 07Essential-Work: Remove sqoop code for wikibase term storage - https://phabricator.wikimedia.org/T391006#10710731 (10Ahoelzl) p:05Triage→03High [22:36:21] 06Data-Engineering, 06SRE, 06Traffic-Icebox, 10MobileFrontend (Tracking): RFC: Remove m-dot subdomain, serve mobile and desktop variants through the same URL - https://phabricator.wikimedia.org/T214998#10710939 (10toni.stoev) >>! In T214998#10676078, @bd808 wrote: > @toni.stoev Please read https://www.medi... [23:01:36] 06Data-Engineering, 06Data-Engineering-Radar, 06Experimentation Lab: WebClientError events have version in unexpected format - https://phabricator.wikimedia.org/T383275#10711009 (10Jdlrobson-WMF) This is more a Jon thing than a web team thing. [23:19:10] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop afl_patrolled_by from abuse_filter_log in production - https://phabricator.wikimedia.org/T391056 (10Ladsgroup) 03NEW [23:19:40] 06Data-Engineering, 10AbuseFilter, 10MW-1.44-notes (1.44.0-wmf.24; 2025-04-08), 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10711133 (10Ladsgroup) 05Open→03Resolved >>! In T391027#10710633, @Dreamy_Jazz wrote: > Next is to create the #schema-chang... [23:19:48] 06Data-Engineering, 10AbuseFilter, 06DBA, 10MW-1.44-notes (1.44.0-wmf.24; 2025-04-08), 07Schema-change: AbuseFilter: Drop afl_patrolled_by - https://phabricator.wikimedia.org/T391027#10711137 (10Ladsgroup) [23:22:34] 10Data-Engineering (Q4 2025 April 1st - June 30th): Update webrequest and unique devices pipelines to derive access_method without m-dot domain - https://phabricator.wikimedia.org/T389696#10711154 (10Jdlrobson-WMF) It seems there have been misunderstandings about what the **existing** access_method does and how...