[01:41:16] 10Quarry: Quarry shows error: This web service cannot be reached - https://phabricator.wikimedia.org/T375988#10236338 (10GTrang) [01:41:17] 10Quarry: worker nodes issue with garbage collection - https://phabricator.wikimedia.org/T375997#10236339 (10GTrang) [01:43:16] 10Quarry: Quarry shows error: This web service cannot be reached - https://phabricator.wikimedia.org/T375988#10236341 (10GTrang) 05Resolved→03Open >>! In T375988#10186586, @rook wrote: > Quarry is working again. Though I didn't have time to investigate what is happening so this may happen again. Opening T375... [02:50:06] 10Analytics-Canonical-Data, 06Movement-Insights: Document fields of canonical wiki dataset - https://phabricator.wikimedia.org/T371766#10236402 (10nshahquinn-wmf) [06:03:12] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10236532 (10ABran-WMF) [10:53:11] 06Data-Engineering, 10[DEPRECATED] wdwb-tech, 10Citoid, 06Content-Transform-Team, and 9 others: Migrate node-based services in production to node18 - https://phabricator.wikimedia.org/T349118#10237211 (10Mvolz) >>! In T349118#10214863, @Ottomata wrote: >> What service are you using this for? > > https://w... [10:57:26] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10CheckUser, 06Privacy Engineering: Add cu_log_event and cu_private_event CheckUser tables to data lake - https://phabricator.wikimedia.org/T376752#10237224 (10Tgr) Yeah, this used to be a single table that got split up into multiple ones for engineer... [11:13:29] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10CheckUser, 06Privacy Engineering: Add cu_log_event and cu_private_event CheckUser tables to data lake - https://phabricator.wikimedia.org/T376752#10237274 (10Dreamy_Jazz) I have had some researchers working with the #tsp team have asked why these ta... [12:21:27] 06Data-Engineering, 06Data Products, 06DBA, 10GlobalBlocking, 07Schema-change-in-production: Change default for gb_autoblock_parent_id from 'NULL' to '0' - https://phabricator.wikimedia.org/T377444 (10Dreamy_Jazz) 03NEW [12:22:30] 06Data-Engineering, 06Data Products, 06DBA, 10GlobalBlocking, 07Schema-change-in-production: Change default for gb_autoblock_parent_id from 'NULL' to '0' - https://phabricator.wikimedia.org/T377444#10237490 (10Dreamy_Jazz) [12:35:10] 06Data-Engineering, 03Discovery-Search (Current work), 10Dumps 2.0 (Kanban Board), 10Event-Platform, 13Patch-For-Review: Bump eventutilities to support flink 1.19 - https://phabricator.wikimedia.org/T377130#10237520 (10gmodena) a:03gmodena [12:35:21] 06Data-Engineering, 03Discovery-Search (Current work), 10Dumps 2.0 (Kanban Board), 10Event-Platform, 13Patch-For-Review: Bump eventutilities to support flink 1.19 - https://phabricator.wikimedia.org/T377130#10237523 (10gmodena) [12:36:15] 06Data-Engineering, 10Dumps 2.0 (Kanban Board), 10Event-Platform: Update eventutilities_python wrappers to support Flink 1.19 - https://phabricator.wikimedia.org/T374359#10237525 (10gmodena) [12:36:37] 06Data-Engineering, 10Dumps 2.0 (Kanban Board), 10Event-Platform: Update eventutilities_python wrappers to support Flink 1.19 - https://phabricator.wikimedia.org/T374359#10237527 (10gmodena) [12:37:24] 06Data-Engineering, 03Discovery-Search (Current work), 07Epic, 10Event-Platform, 13Patch-For-Review: EPIC: Update flink jobs to support Flink 1.19 - https://phabricator.wikimedia.org/T376812#10237528 (10gmodena) [12:57:18] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10237615 (10ABran-WMF) [13:14:23] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 10Observability-Metrics, 10Event-Platform, and 3 others: Add Prometheus support to statsd.js via mw.track() - https://phabricator.wikimedia.org/T355837#10237689 (10phuedx) >>! In T355837#10230184, @gmodena wrote: > Will this add significant traffi... [13:25:21] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10237757 (10ABran-WMF) [13:26:35] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10237761 (10Milimetric) +1 on `user_is_permanent`. I'll make it so in all the temp accounts work... [13:26:38] 06Data-Engineering, 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10237762 (10ABran-WMF) ["db2155", "db2172", "db2219"] were missing on s4, doing them before doing dc masters [13:32:22] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10Dumps 2.0 (Kanban Board): Enable HA for the mw-dump-rev-content-reconcile-enrich flink application - https://phabricator.wikimedia.org/T375176#10237805 (10gmodena) a:03tchin [13:42:08] (03PS1) 10Aqu: Event deduplication via windowing - backport on 0.2.49 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1081154 (https://phabricator.wikimedia.org/T369845) [13:42:15] (03CR) 10CI reject: [V:04-1] Event deduplication via windowing - backport on 0.2.49 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1081154 (https://phabricator.wikimedia.org/T369845) (owner: 10Aqu) [13:52:42] (03Abandoned) 10Aqu: Event deduplication via windowing - backport on 0.2.49 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1081154 (https://phabricator.wikimedia.org/T369845) (owner: 10Aqu) [13:53:13] (03PS1) 10Aqu: Event deduplication via windowing - backport on 0.2.49 [analytics/refinery/source] (0.2.49) - 10https://gerrit.wikimedia.org/r/1081164 (https://phabricator.wikimedia.org/T369845) [13:55:50] 06Data-Engineering, 10[DEPRECATED] wdwb-tech, 10Citoid, 06Content-Transform-Team, and 9 others: Migrate node-based services in production to node18 - https://phabricator.wikimedia.org/T349118#10237935 (10akosiaris) >>! In T349118#10237211, @Mvolz wrote: > Well, I'm not sure we need them but we have the de... [14:04:18] 10Data-Engineering (Q2 2024 October 1st - December 31th), 13Patch-For-Review: Some Gobblin folders don't have `_IMPORTED` flags - https://phabricator.wikimedia.org/T376144#10237971 (10JAllemandou) After a talk with @Antoine_Quhen and @Ottomata we've decided to simplify the Gobblin algorithm: we'll use what wou... [14:11:22] 10Data-Engineering (Q2 2024 October 1st - December 31th): Handle Late-Arrived Events from Gobblin into Airflow triggered Refine - https://phabricator.wikimedia.org/T370665#10238015 (10JAllemandou) While talking with @Ottomata , I realized that we can relatively easily monitor late-arrived events in Gobblin when... [15:45:08] 06Data-Engineering, 10[DEPRECATED] wdwb-tech, 10Citoid, 06Content-Transform-Team, and 9 others: Migrate node-based services in production to node18 - https://phabricator.wikimedia.org/T349118#10238469 (10Ottomata) > First thing would be to set num_workers to 0 in the Chart. This shortcircuits service-runne... [16:28:00] 06Data-Engineering, 10Event-Platform, 10Web Team Essential Work 2024 (Migrate to new Event Platform), 10Web-Team-Backlog (FY2024-25 Q2 Sprint 2): Deprecate use of desktop- and mobilewebuiactions in Event Platform - https://phabricator.wikimedia.org/T368678#10238691 (10KSarabia-WMF) a:05Edtadros→03SToyof... [16:28:46] 06Data-Engineering, 10Event-Platform, 10Web Team Essential Work 2024 (Migrate to new Event Platform), 10Web-Team-Backlog (FY2024-25 Q2 Sprint 2): Deprecate use of desktop- and mobilewebuiactions in Event Platform - https://phabricator.wikimedia.org/T368678#10238690 (10KSarabia-WMF) @SToyofuku-WMF For QA,... [17:08:51] 14Analytics, 06Data-Engineering, 10AQS2.0, 06Data Products, 10Pageviews-API: Track page views by page ID rather than title (handles moved pages) - https://phabricator.wikimedia.org/T159046#10238890 (10Ottomata) [17:08:53] 06Data-Engineering, 10Temporary accounts, 10Event-Platform: Update Data Engineering-owned products that may be affected by IP Masking - https://phabricator.wikimedia.org/T326875#10238907 (10Ottomata) [17:08:54] 06Data-Engineering, 06Data Products, 10MediaWiki-extensions-EventLogging, 10Temporary accounts: Prepare EventLogging for temp accounts - https://phabricator.wikimedia.org/T374812#10238908 (10Ottomata) [17:08:56] 06Data-Engineering, 10Temporary accounts, 10Event-Platform: Prepare EventBus for temp accounts - https://phabricator.wikimedia.org/T374811#10238909 (10Ottomata) [17:09:08] 06Data-Engineering, 10DPE Temporary Accounts, 10Temporary accounts, 10Event-Platform: Update Data Engineering-owned products that may be affected by IP Masking - https://phabricator.wikimedia.org/T326875#10238910 (10Ahoelzl) [17:13:00] 06Data-Engineering, 10DPE Temporary Accounts, 10Temporary accounts, 10Event-Platform: Update Data Engineering-owned products that may be affected by IP Masking - https://phabricator.wikimedia.org/T326875#10238912 (10Ottomata) [17:13:00] 06Data-Engineering, 10DPE Temporary Accounts, 10Temporary accounts, 10Event-Platform: Update Data Engineering-owned products that may be affected by IP Masking - https://phabricator.wikimedia.org/T326875#10238914 (10Ahoelzl) [17:14:08] 10Data-Engineering (Q2 2024 October 1st - December 31th), 13Patch-For-Review: Timeout hive-metastore locks - https://phabricator.wikimedia.org/T365563#10238932 (10Ottomata) a:03Antoine_Quhen [17:15:14] 14Analytics, 06Data-Engineering, 10AQS2.0, 06Data Products, 10Pageviews-API: Track page views by page ID rather than title (handles moved pages) - https://phabricator.wikimedia.org/T159046#10238921 (10Ottomata) [17:15:15] 10Data-Engineering (Q2 2024 October 1st - December 31th), 13Patch-For-Review: Timeout hive-metastore locks - https://phabricator.wikimedia.org/T365563#10238931 (10Ottomata) [17:15:53] 06Data-Engineering, 10probenet: Include geocoded subdivision ISO code in webrequest table - https://phabricator.wikimedia.org/T365594#10238935 (10Ottomata) 05Open→03Resolved a:03Ottomata @JAllemandou This looks like it is done, please reopen if I am incorrect. [17:16:10] 14Analytics, 06Data-Engineering, 10AQS2.0, 06Data Products, 10Pageviews-API: Track page views by page ID rather than title (handles moved pages) - https://phabricator.wikimedia.org/T159046#10238924 (10Ottomata) [17:20:45] 06Data-Engineering, 06Data-Platform-SRE: MaxMind seems to be mapping the same IP to different countries - https://phabricator.wikimedia.org/T366369#10238956 (10Ottomata) [17:21:14] 06Data-Engineering, 06Data-Platform-SRE: MaxMind seems to be mapping the same IP to different countries - https://phabricator.wikimedia.org/T366369#10238958 (10Ottomata) @BTullis I added DPE-SRE. Can you look into this and see if the version of the maxmind dbs is the same on all hadoop workers? [17:22:28] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10238962 (10nshahquinn-wmf) >>! In T356701#10237761, @Milimetric wrote: > +1 on `user_is_permanen... [17:27:05] 10Data-Engineering (Q2 2024 October 1st - December 31th), 07Epic: [Epic] Migrate Data Engineering maintained NodeJS repositories to GitLab - https://phabricator.wikimedia.org/T366614#10238974 (10Ahoelzl) [17:27:06] 10Data-Engineering (Q2 2024 October 1st - December 31th), 07Epic: [Epic] Migrate Data Engineering maintained NodeJS repositories to GitLab - https://phabricator.wikimedia.org/T366614#10238976 (10Ahoelzl) [17:27:52] 06Data-Engineering, 10AQS2.0, 10Cassandra, 06Data Products: DELETE mechanism for Cassandra Analytics datasets - https://phabricator.wikimedia.org/T366631#10238979 (10Ahoelzl) [17:31:28] 06Data-Engineering, 10Event-Platform: Migrate Event Platform Schema Respositories to Gitlab - https://phabricator.wikimedia.org/T366836#10238989 (10Ahoelzl) [17:32:23] 10Data-Engineering (Q2 2024 October 1st - December 31th): Migrate Event Platform Schema Respositories to Gitlab - https://phabricator.wikimedia.org/T366836#10238992 (10Ottomata) [17:34:05] 10Data-Engineering (Q2 2024 October 1st - December 31th): Migrate Event Platform Schema Respositories to Gitlab - https://phabricator.wikimedia.org/T366836#10239012 (10Ahoelzl) [17:34:57] 10Data-Engineering (Q2 2024 October 1st - December 31th): Migrate Event Platform Schema Respositories to Gitlab - https://phabricator.wikimedia.org/T366836#10238999 (10Ahoelzl) [17:39:37] 06Data-Engineering, 06Java-Scala-Standardization: Update hdfs-tools to new parent pom - https://phabricator.wikimedia.org/T377492 (10Ottomata) 03NEW [17:39:58] 06Data-Engineering, 06Data-Platform-SRE, 06Java-Scala-Standardization, 13Patch-For-Review: Migrate WMF Maven projects to new parent pom - https://phabricator.wikimedia.org/T360219#10239025 (10Ottomata) [17:41:36] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform, 10Temporary accounts: Add MW table 'cu_log' to data lake - https://phabricator.wikimedia.org/T364398#10239049 (10Ahoelzl) [17:42:33] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10239056 (10Ottomata) @Milimetric do we sqoop these? [17:42:49] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10239060 (10Ahoelzl) [17:43:08] 06Data-Engineering, 06Data-Platform-SRE, 06Java-Scala-Standardization, 13Patch-For-Review: Migrate WMF Maven projects to new parent pom - https://phabricator.wikimedia.org/T360219#10239040 (10Ottomata) [17:44:17] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data Products, 06DBA, 07Schema-change-in-production: Cleanup revision table schema - https://phabricator.wikimedia.org/T367856#10239063 (10Ahoelzl) [17:50:01] 06Data-Engineering, 10CirrusSearch, 06Data Products, 10MediaWiki-extensions-EventLogging, and 3 others: Error: Call to a member function getPageAsLinkTarget() on null - https://phabricator.wikimedia.org/T368543#10239084 (10Ottomata) [17:52:42] 10Data-Engineering (Q2 2024 October 1st - December 31th): [Placeholder] Clean Up Corresponding Hive Tables After Deprecating Older Stream Configs - https://phabricator.wikimedia.org/T368800#10239119 (10Ahoelzl) [17:54:37] 10Data-Engineering (Q2 2024 October 1st - December 31th): [SPIKE] Define process to build out lineage in DataHub - https://phabricator.wikimedia.org/T369758#10239135 (10Ahoelzl) [17:55:00] 07Analytics-Data-Problem, 06Data-Engineering, 06Data Products, 10Pageviews-API: Missed pageview data over API - https://phabricator.wikimedia.org/T370108#10239136 (10Ottomata) [17:55:32] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform-SRE, 06SRE: Streamline Data Platform access approvals for WMF staff - https://phabricator.wikimedia.org/T370424#10239141 (10Ottomata) a:03Ottomata [17:55:35] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform-SRE, 06SRE: Streamline Data Platform access approvals for WMF staff - https://phabricator.wikimedia.org/T370424#10239142 (10Ahoelzl) [18:01:15] 06Data-Engineering, 10Data-Engineering-Wikistats, 06Data Products, 07dark-mode: Dark mode support for stats.wikimedia.org - https://phabricator.wikimedia.org/T370758#10239157 (10Ottomata) [18:01:43] 06Data-Engineering, 06Data Products, 10Wmfdata-Python: Specify Conda-Pack as a dependency - https://phabricator.wikimedia.org/T370718#10239155 (10Ottomata) [18:01:53] 10Data-Engineering (Q2 2024 October 1st - December 31th), 10CheckUser, 06Data Products, 06DBA, 07Schema-change-in-production: Remove cuc_actiontext, cuc_only_for_read_old, and cuc_private from cu_changes on WMF wikis - https://phabricator.wikimedia.org/T370903#10239166 (10Ahoelzl) [18:08:16] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Data-Platform, 10Temporary accounts: Add MW table 'cu_log' to data lake - https://phabricator.wikimedia.org/T364398#10239197 (10Ahoelzl) a:03Snwachukwu [19:08:20] 06Data-Engineering, 06Data Products, 06DBA, 10GlobalBlocking, 07Schema-change-in-production: Change default for gb_autoblock_parent_id from 'NULL' to '0' - https://phabricator.wikimedia.org/T377444#10239424 (10Ladsgroup) 05Open→03Resolved a:03Ladsgroup [19:24:43] 06Data-Engineering, 06Data Products, 06Data-Platform, 06Movement-Insights, and 2 others: Temporary Accounts Initiative (IP Masking) - Add user_is_temp to data tables - https://phabricator.wikimedia.org/T356701#10239444 (10Milimetric) @nshahquinn-wmf I agree on temporary vs temp but the field in the mariadb... [19:49:54] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06DBA, 07Schema-change-in-production: Drop deprecated abuse filter fields on wmf wikis - https://phabricator.wikimedia.org/T367781#10239507 (10Milimetric) >>! In T367781#10239056, @Ottomata wrote: > @Milimetric do we sqoop these? not in the regular... [20:00:55] 06Data-Engineering, 10CampaignEvents, 10EntitySchema, 10JsonConfig, and 15 others: Add namespace descriptions for Special:NamespaceInfo in WMF-deployed extensions - https://phabricator.wikimedia.org/T373070#10239539 (10SBisson) [20:01:17] 10Data-Engineering (Q2 2024 October 1st - December 31th), 06Movement-Insights, 10Data-Platform-SRE (2024.09.28 - 2024.10.18): 2024-10-10 Data Loss Incident - webrequest Hive table - https://phabricator.wikimedia.org/T376882#10239542 (10Ottomata) FYI, just updated Ops week page with docs on using Airflow cli... [20:56:46] 06Data-Engineering, 10Event-Platform, 10Web Team Essential Work 2024 (Migrate to new Event Platform), 10Web-Team-Backlog (FY2024-25 Q2 Sprint 2): Deprecate use of desktop- and mobilewebuiactions in Event Platform - https://phabricator.wikimedia.org/T368678#10239929 (10SToyofuku-WMF) a:05SToyofuku-WMF→03... [23:06:18] 06Data-Engineering, 10Data Pipelines: Add support for repository artifacts in Airflow - https://phabricator.wikimedia.org/T322690#10240378 (10amastilovic) Update: We've refactored the library to support `cache_key_fn` config parameter, which enabled us to get rid of `FsVersionedArtifactCache` in favor of simpl...