[00:37:27] 06Data-Engineering, 10Data Pipelines: Add support for repository artifacts in Airflow - https://phabricator.wikimedia.org/T322690#10227393 (10Ottomata) > ArtifactSource is defined/constructed through class name and base_uri which is optional, but in practice base_uri is not optional and points to either a dire... [10:58:12] 06Data-Engineering, 10CheckUser, 06Data Products, 06DBA, 07Schema-change-in-production: Remove cuc_actiontext, cuc_only_for_read_old, and cuc_private from cu_changes on WMF wikis - https://phabricator.wikimedia.org/T370903#10228495 (10Ladsgroup) [11:03:46] 06Data-Engineering, 06Data-Platform, 06DBA, 07Schema-change-in-production: Change page.page_links_updated to fixed-length timestamp in wmf wikis - https://phabricator.wikimedia.org/T371742#10228521 (10Ladsgroup) [11:04:33] 06Data-Engineering, 06Data-Platform, 06DBA, 07Schema-change-in-production: Change page.page_links_updated to fixed-length timestamp in wmf wikis - https://phabricator.wikimedia.org/T371742#10228522 (10Ladsgroup) [11:16:29] 06Data-Engineering, 10Event-Platform: Update eventutilities_python wrappers to support Flink 1.19 - https://phabricator.wikimedia.org/T374359#10228543 (10gmodena) [11:45:23] 06Data-Engineering, 10Dumps 2.0, 10Event-Platform, 13Patch-For-Review: [SPIKE] how can we support Spark producer/consumers in Event Platform - https://phabricator.wikimedia.org/T374341#10228638 (10pfischer) @Ottomata, any final thoughts on the naming of the output format? That appears to be the only open q... [12:11:53] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform, 06Traffic, 10Data Products (Data Products Sprint 20 🎯), 13Patch-For-Review: NEW BUG REPORT - Issues in calculation logic for unique devices tables - https://phabricator.wikimedia.org/T375527#10228684 (10Milimetric) [12:19:06] 06Data-Engineering, 06Data-Platform, 06DBA, 07Schema-change-in-production: Change page.page_links_updated to fixed-length timestamp in wmf wikis - https://phabricator.wikimedia.org/T371742#10228699 (10Ladsgroup) [12:58:44] 06Data-Engineering, 10Dumps 2.0, 03Discovery-Search (Current work), 10Event-Platform, 13Patch-For-Review: [SPIKE] how can we support Spark producer/consumers in Event Platform - https://phabricator.wikimedia.org/T374341#10228886 (10Gehel) [14:50:27] (03PS1) 10Aqu: Event deduplication via windowing [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080306 (https://phabricator.wikimedia.org/T369845) [15:43:20] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Movement-Insights, 10Data-Platform-SRE (2024.09.28 - 2024.10.18): 2024-10-10 Data Loss Incident - webrequest Hive table - https://phabricator.wikimedia.org/T376882#10229874 (10Gehel) [15:45:40] 06Data-Engineering, 06Discovery-Search, 10Data-Platform-SRE (2024.09.28 - 2024.10.18): Upload an image with flink-k8s-operator version that supports flink 1.19 - https://phabricator.wikimedia.org/T377137#10229895 (10Gehel) [15:46:09] 06Data-Engineering, 06Discovery-Search, 10Data-Platform-SRE (2024.09.28 - 2024.10.18): Create and distribute a flink base image with flink 1.19.1 - https://phabricator.wikimedia.org/T377134#10229893 (10Gehel) [15:47:45] 06Data-Engineering, 10Data-Platform-SRE (2024.09.28 - 2024.10.18), 03Discovery-Search (Current work): Unable to find ingested tables in datahub - https://phabricator.wikimedia.org/T376657#10229912 (10Gehel) [15:51:32] (03CR) 10Joal: [C:03+1] "Yay!" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080306 (https://phabricator.wikimedia.org/T369845) (owner: 10Aqu) [15:54:12] 06Data-Engineering, 10Data-Platform-SRE (2024.09.28 - 2024.10.18), 03Discovery-Search (Current work): Unable to find ingested tables in datahub - https://phabricator.wikimedia.org/T376657#10229941 (10BTullis) p:05Triage→03Medium [15:54:58] 06Data-Engineering, 10Data-Platform-SRE (2024.09.28 - 2024.10.18), 03Discovery-Search (Current work): Unable to find ingested tables in datahub - https://phabricator.wikimedia.org/T376657#10229938 (10BTullis) a:05EBernhardson→03BTullis [16:01:04] 06Data-Engineering, 10Data Pipelines: Add support for repository artifacts in Airflow - https://phabricator.wikimedia.org/T322690#10229982 (10amastilovic) > But for this MR, what do you think of limiting the change to just restricting to fsspec, perhaps, perhaps by just renaming FsArtifactSource and FsArtifact... [16:19:22] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 10Observability-Metrics, 10Event-Platform, and 3 others: Add Prometheus support to statsd.js via mw.track() - https://phabricator.wikimedia.org/T355837#10230114 (10Ottomata) Hello! We just had a [[ https://docs.google.com/document/d/12omoVrYDfHMA... [16:21:44] (03CR) 10Ottomata: Event deduplication via windowing (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1080306 (https://phabricator.wikimedia.org/T369845) (owner: 10Aqu) [16:22:02] 06Data-Engineering, 10Event-Platform, 10Web Team Essential Work 2024 (Migrate to new Event Platform), 10Web-Team-Backlog (FY2024-25 Q2 Sprint 2): Deprecate use of desktop- and mobilewebuiactions in Event Platform - https://phabricator.wikimedia.org/T368678#10230134 (10ovasileva) [16:23:20] 06Data-Engineering, 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.27; 2024-10-15), 13Patch-For-Review, and 2 others: Delete redundant mobile- and desktopwebuiactions event in WikimediaEvents - https://phabricator.wikimedia.org/T376065#10230144 (10ovasileva) [16:23:41] 06Data-Engineering, 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.27; 2024-10-15), 13Patch-For-Review, and 2 others: Delete redundant mobile- and desktopwebuiactions event in WikimediaEvents - https://phabricator.wikimedia.org/T376065#10230147 (10ovasileva) a:05Jdlrobson→03KSarabia-WMF [16:30:14] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 10Observability-Metrics, 10Event-Platform, and 3 others: Add Prometheus support to statsd.js via mw.track() - https://phabricator.wikimedia.org/T355837#10230184 (10gmodena) +1 for consolidation. Will this add significant traffic volumes to `even... [17:17:55] 06Data-Engineering, 10Dumps 2.0, 03Discovery-Search (Current work), 10Event-Platform, 13Patch-For-Review: [SPIKE] how can we support Spark producer/consumers in Event Platform - https://phabricator.wikimedia.org/T374341#10230468 (10Ottomata) > any final thoughts on the naming of the output format? Either... [17:26:10] 06Data-Engineering, 10Dumps 2.0, 03Discovery-Search (Current work), 10Event-Platform, 13Patch-For-Review: [SPIKE] how can we support Spark producer/consumers in Event Platform - https://phabricator.wikimedia.org/T374341#10230502 (10Ottomata) @pfischer just curious about your thoughts about the last part... [17:32:46] 06Data-Engineering, 10Dumps 2.0, 03Discovery-Search (Current work), 10Event-Platform, 13Patch-For-Review: [SPIKE] how can we support Spark producer/consumers in Event Platform - https://phabricator.wikimedia.org/T374341#10230527 (10Ottomata) Oh, I see the other usages of 'event-stream', e.g. 'event-strea... [17:39:51] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10Dumps 2.0 (Kanban Board), 13Patch-For-Review: Flink job to enrich reconciliation events - https://phabricator.wikimedia.org/T368787#10230562 (10xcollazo) [17:40:33] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10Dumps 2.0 (Kanban Board), 13Patch-For-Review: MediaWiki Reconciliation API - https://phabricator.wikimedia.org/T368782#10230542 (10xcollazo) 05Open→03Declined I am closing this ticket, as we have abandoned the idea of building this reconcili... [17:58:37] 06Data-Engineering, 10Dumps 2.0 (Kanban Board): [Iceberg Migration] Extend Iceberg table maintenance mechanism to support data rewrite - https://phabricator.wikimedia.org/T373694#10230660 (10xcollazo) 05In progress→03Resolved [19:00:24] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform, 06Traffic, 10Data Products (Data Products Sprint 20 🎯), 13Patch-For-Review: NEW BUG REPORT - Issues in calculation logic for unique devices tables - https://phabricator.wikimedia.org/T375527#10230899 (10Mayakp.wiki) [19:25:36] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10231052 (10nshahquinn-wmf) >>! In T356701#10175441, @fkaelin wrote: > What about also adding a `... [19:43:18] 06Data-Engineering, 06Data-Platform-SRE: Upgrade Spark to a version with long term Iceberg support, and with fixes to support Dumps 2.0 - https://phabricator.wikimedia.org/T338057#10231095 (10nshahquinn-wmf) [20:22:32] 06Data-Engineering, 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.27; 2024-10-15), 13Patch-For-Review, and 2 others: Delete redundant mobile- and desktopwebuiactions event in WikimediaEvents - https://phabricator.wikimedia.org/T376065#10231261 (10Jdlrobson) a:05KSarabia-WMF→03Edtadros [20:22:58] 06Data-Engineering, 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.27; 2024-10-15), 13Patch-For-Review, and 2 others: Delete redundant mobile- and desktopwebuiactions event in WikimediaEvents - https://phabricator.wikimedia.org/T376065#10231266 (10Jdlrobson) p:05Medium→03High [21:06:45] 06Data-Engineering, 10Data Pipelines: Add support for repository artifacts in Airflow - https://phabricator.wikimedia.org/T322690#10231492 (10Ottomata) Awesome! Added some comments. > FsVersionedArtifactCache refactored to accept a callable argument that provides the final component of the cache output path,... [21:16:13] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10231538 (10Ottomata) Hm, interesting! TIL about MW's user_is_permanent status. That's new then?... [21:22:17] 06Data-Engineering, 10CheckUser, 06Data Products, 06DBA, 07Schema-change-in-production: Remove cuc_actiontext, cuc_only_for_read_old, and cuc_private from cu_changes on WMF wikis - https://phabricator.wikimedia.org/T370903#10231581 (10Ladsgroup) [22:08:47] 10Analytics-Canonical-Data, 06Movement-Insights: Periodically update the canonical wiki dataset while Neil is on sabbatical - https://phabricator.wikimedia.org/T372018#10231734 (10nshahquinn-wmf) 05Open→03Resolved Thank you for taking care of this, @Hghani! You're released from duty 😁 [22:25:38] 06Data-Engineering, 10CheckUser, 06Data Products, 06DBA, 07Schema-change-in-production: Remove cuc_actiontext, cuc_only_for_read_old, and cuc_private from cu_changes on WMF wikis - https://phabricator.wikimedia.org/T370903#10231792 (10Ladsgroup) [22:30:46] 06Data-Engineering, 10CheckUser, 06Data Products, 06DBA, 07Schema-change-in-production: Remove cuc_actiontext, cuc_only_for_read_old, and cuc_private from cu_changes on WMF wikis - https://phabricator.wikimedia.org/T370903#10231808 (10Ladsgroup) [22:49:02] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10231873 (10nshahquinn-wmf) >>! In T356701#10231536, @Ottomata wrote: > Hm, interesting! TIL abou... [23:12:43] 06Data-Engineering, 10Data Pipelines: Add support for repository artifacts in Airflow - https://phabricator.wikimedia.org/T322690#10231923 (10amastilovic) > Cool! How will this be used via artifact.yaml config? It won't :-) Joking aside, HDFS sync aka Blunderbuss has full control of Artifact library so it wi... [23:59:26] 06Data-Engineering, 10Event-Platform, 10Web Team Essential Work 2024 (Migrate to new Event Platform), 10Web-Team-Backlog (FY2024-25 Q2 Sprint 2): Deprecate use of desktop- and mobilewebuiactions in Event Platform - https://phabricator.wikimedia.org/T368678#10232018 (10KSarabia-WMF) @Edtadros Sorry for the...