[00:31:17] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform: [NEEDS GROOMING] We should improve the code health of gobblin-wmf - https://phabricator.wikimedia.org/T370368#9993256 (10Ottomata) > Automate the process of adding gobblin-wmf jars to refine, or at least document the git lfs manual proc... [02:23:57] 06Data-Engineering, 06DC-Ops, 10ops-eqiad, 06SRE: Q4:rack/setup/install an-conf100[4-6] - https://phabricator.wikimedia.org/T364429#9993327 (10Papaul) @Jclark-ctr check first with the service owner if those servers are ready for puppet 7 if they need to added to "hieradata/hosts" with ` profile::puppet::a... [08:44:05] 06Data-Engineering, 10[DEPRECATED] wdwb-tech, 10Citoid, 06Content-Transform-Team, and 9 others: Migrate node-based services in production to node18 - https://phabricator.wikimedia.org/T349118#9993694 (10Pginer-WMF) [09:03:28] 06Data-Engineering, 06Data Products, 06DBA, 07Schema-change-in-production: Drop gb_by from globalblocks table - https://phabricator.wikimedia.org/T370394 (10Zabe) 03NEW [09:04:31] 06Data-Engineering, 06Data Products, 06DBA, 07Schema-change-in-production: Drop gb_by from globalblocks table - https://phabricator.wikimedia.org/T370394#9993854 (10Zabe) [09:37:56] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Dumps 2.0 (Kanban Board), 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.14; 2024-07-16), 13Patch-For-Review: [Event Platform] Instrument EventBus with prometheus MW Statslib - https://phabricator.wikimedia.org/T363587#9993948 (10gmodena) [09:38:03] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Dumps 2.0 (Kanban Board), 10Event-Platform, 10MW-1.43-notes (1.43.0-wmf.14; 2024-07-16), 13Patch-For-Review: [Event Platform] Instrument EventBus with prometheus MW Statslib - https://phabricator.wikimedia.org/T363587#9993949 (10gmodena) A dashboa... [09:41:25] 06Data-Engineering, 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization: Validate CI integration so that Ci can release Maven artifacts on user's demand - https://phabricator.wikimedia.org/T367403#9993971 (10Gehel) Currently, only the [[ https://gerrit.wikimedia.org/r/c/integration/config... [09:42:21] 06Data-Engineering, 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Validate CI integration so that Ci can release Maven artifacts on user's demand - https://phabricator.wikimedia.org/T367403#9993973 (10Gehel) [09:49:36] 06Data-Engineering, 06Discovery-Search, 06Java-Scala-Standardization: Java projects hosted on Gerrit should publish artifacts to Gitlab - https://phabricator.wikimedia.org/T370400 (10Gehel) 03NEW [09:51:14] 06Data-Engineering, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Java projects hosted on Gerrit should publish artifacts to Gitlab - https://phabricator.wikimedia.org/T370400#9994019 (10Gehel) @brennen : Do you have an opinion on where to host artifacts coming from projects... [09:51:17] 06Data-Engineering, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Java projects hosted on Gerrit should publish artifacts to Gitlab - https://phabricator.wikimedia.org/T370400#9994024 (10Gehel) [10:18:49] 06Data-Engineering, 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Validate CI integration so that Ci can release Maven artifacts on user's demand - https://phabricator.wikimedia.org/T367403#9994109 (10hashar) The images do not have a SSH client by defa... [10:57:26] 06Data-Engineering, 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization, and 2 others: Validate CI integration so that Ci can release Maven artifacts on user's demand - https://phabricator.wikimedia.org/T367403#9994191 (10hashar) The ssh client is now available in the image `docker-regist... [11:14:25] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform: [NEEDS GROOMING] We should improve the code health of gobblin-wmf - https://phabricator.wikimedia.org/T370368#9994221 (10gmodena) [11:14:38] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform: [NEEDS GROOMING] We should improve the code health of gobblin-wmf - https://phabricator.wikimedia.org/T370368#9994224 (10gmodena) > When gobblin moves to airflow, we can use Artifact sync to deploy the jar, instead of relying on analytic... [13:07:53] 10Data-Engineering (Q1 2024 July 1st - September 30th): Develop Airflow ExternalTaskSensor to orchestrate DAG dependencies - https://phabricator.wikimedia.org/T369900#9994589 (10lbowmaker) [13:08:17] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform, 13Patch-For-Review: Migrate Data Engineering NodeJS library repos to GitLab - https://phabricator.wikimedia.org/T366611#9994593 (10Snwachukwu) [13:09:26] 10Data-Engineering (Q1 2024 July 1st - September 30th): [Refine Refactoring] Switch new Refine system outputs to production location and monitor - https://phabricator.wikimedia.org/T369845#9994598 (10lbowmaker) [13:10:40] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Data Pipelines: Improve Airflow DAG testing process - https://phabricator.wikimedia.org/T368944#9994603 (10lbowmaker) [13:11:25] 10Data-Engineering (Q1 2024 July 1st - September 30th), 06Data-Platform: 7 new wikis missing from mediawiki_history - https://phabricator.wikimedia.org/T368788#9994605 (10lbowmaker) [13:12:11] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform: [Event Platform] - Add schema CI test that array ensures properties with object types also enumerate object properties - https://phabricator.wikimedia.org/T366562#9994609 (10lbowmaker) [13:12:46] 10Data-Engineering (Q1 2024 July 1st - September 30th): Implement automatic sync of refinery HQL files to HDFS - https://phabricator.wikimedia.org/T365659#9994614 (10lbowmaker) [13:13:29] 10Data-Engineering (Q1 2024 July 1st - September 30th): Migrate refinery HQL files to CI/CD supported GitLab repository - https://phabricator.wikimedia.org/T362832#9994618 (10lbowmaker) [13:14:11] 10Data-Engineering (Q1 2024 July 1st - September 30th), 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization: Migrate existing Java packages to deploying to Gitlab, including new version of parent pom, validation that all dependencies are a... - https://phabricator.wikimedia.org/T367405#9994620 [13:16:34] 10Data-Engineering (Q1 2024 July 1st - September 30th), 06Movement-Insights, 07Epic: [Data Quality] Implement basic data quality metrics for Unique Devices datasets - https://phabricator.wikimedia.org/T357833#9994627 (10lbowmaker) [13:17:40] 10Data-Engineering (Q1 2024 July 1st - September 30th), 10Event-Platform: Event Platform schemas should not support type changes to structs as array element or map value types - https://phabricator.wikimedia.org/T366487#9994635 (10lbowmaker) [13:19:42] 10Data-Engineering (Q1 2024 July 1st - September 30th): [SPIKE] Define process to build out lineage in DataHub - https://phabricator.wikimedia.org/T369758#9994643 (10lbowmaker) Moving to backlog for now. We will try this approach first: https://phabricator.wikimedia.org/T306896 [13:20:20] 06Data-Engineering: [SPIKE] Define process to build out lineage in DataHub - https://phabricator.wikimedia.org/T369758#9994646 (10lbowmaker) [13:27:52] 06Data-Engineering, 10Event-Platform: Implement stream of HTML content on mw.page_change event - https://phabricator.wikimedia.org/T360794#9994672 (10lbowmaker) Hi @Isaac - we were hoping to get to this last quarter but didn’t manage to. This quarter, we are now working on the Dumps 2.0 implementation, annual... [13:29:49] 06Data-Engineering, 06DC-Ops, 10ops-eqiad, 06SRE: Q4:rack/setup/install an-conf100[4-6] - https://phabricator.wikimedia.org/T364429#9994678 (10ops-monitoring-bot) Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host an-conf1004.eqiad.wmnet with OS bookworm [13:50:52] (03PS1) 10Peter Fischer: Introducing cirrussearch/weighted_tags [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) [13:51:24] (03CR) 10CI reject: [V:04-1] Introducing cirrussearch/weighted_tags [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) (owner: 10Peter Fischer) [13:52:58] (03PS2) 10Peter Fischer: Introducing cirrussearch/weighted_tags [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) [14:21:18] (03CR) 10DCausse: Introducing cirrussearch/weighted_tags (033 comments) [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) (owner: 10Peter Fischer) [14:21:54] (03CR) 10DCausse: "adding Erik to the discussion" [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) (owner: 10Peter Fischer) [14:40:25] 06Data-Engineering, 06DC-Ops, 10ops-eqiad, 06SRE: Q4:rack/setup/install an-conf100[4-6] - https://phabricator.wikimedia.org/T364429#9994990 (10ops-monitoring-bot) Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host an-conf1004.eqiad.wmnet with OS bookworm executed with errors: - an-co... [14:50:05] (03PS22) 10Aqu: Refactor Refine to be triggerd by Airflow [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1016808 (https://phabricator.wikimedia.org/T356762) [14:58:15] 06Data-Engineering, 06Data-Platform-SRE, 06SRE, 10SRE-Access-Requests: Streamline Data Platform access approvals for WMF staff - https://phabricator.wikimedia.org/T370424 (10Ottomata) 03NEW [15:03:09] 06Data-Engineering, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Java projects hosted on Gerrit should publish artifacts to Gitlab - https://phabricator.wikimedia.org/T370400#9995143 (10Ottomata) @gehel, would it be easier if projects are hosted in GitLab? We are planning... [15:16:39] 06Data-Engineering, 06Data-Platform, 10Event-Platform, 07Wikimedia-production-error: PHP Warning: Invalid argument supplied for foreach() - https://phabricator.wikimedia.org/T370428 (10thcipriani) 03NEW [15:23:22] 06Data-Engineering, 06Data-Platform, 10Event-Platform, 07Wikimedia-production-error: PHP Warning: Invalid argument supplied for foreach() in EventBus.php - https://phabricator.wikimedia.org/T370428#9995295 (10Ottomata) [[ https://gerrit.wikimedia.org/r/plugins/gitiles/mediawiki/extensions/EventBus/+/d3d136... [15:23:26] 06Data-Engineering, 06Data-Platform, 10Event-Platform, 07Wikimedia-production-error: PHP Warning: Invalid argument supplied for foreach() in EventBus.php - https://phabricator.wikimedia.org/T370428#9995298 (10Ottomata) [15:52:42] 06Data-Engineering, 06Data-Platform-SRE, 06SRE, 10SRE-Access-Requests: Streamline Data Platform access approvals for WMF staff - https://phabricator.wikimedia.org/T370424#9995479 (10odimitrijevic) Thanks @Ottomata. Ftr, I approve the proposal. [16:15:47] 10Data-Engineering (Q1 2024 July 1st - September 30th): Airflow mapped tasks UI & metrics - https://phabricator.wikimedia.org/T357430#9995567 (10Ottomata) [16:15:52] 06Data-Engineering, 10Data Pipelines, 10Data-Platform-SRE (2024.07.08 - 2024.07.28), 13Patch-For-Review, 10Release-Engineering-Team (Seen): Upgrade Airflow to 2.9.3 - https://phabricator.wikimedia.org/T365449#9995566 (10Ottomata) [16:26:43] 06Data-Engineering, 06Data-Platform-SRE: an-launcher1002 /srv filling up mostly because of logs from dynamic mapped Airflow tasks - https://phabricator.wikimedia.org/T370437 (10Ottomata) 03NEW [16:28:20] 10Data-Engineering (Q1 2024 July 1st - September 30th), 06Data-Platform-SRE, 06Discovery-Search, 06Java-Scala-Standardization, 06Release-Engineering-Team: Validate CI integration so that Ci can release Maven artifacts on user's demand - https://phabricator.wikimedia.org/T367403#9995650 (10lbowmaker) [16:34:53] 10Data-Engineering (Q1 2024 July 1st - September 30th): Migrate and re-deploy eventstreams using new service runner - https://phabricator.wikimedia.org/T361769#9995660 (10Ottomata) a:03tchin [16:39:57] 10Data-Engineering (Q1 2024 July 1st - September 30th): Migrate and re-deploy eventstreams using service-utils - https://phabricator.wikimedia.org/T361769#9995694 (10tchin) [16:57:41] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine Refactoring] Changes to EventStreamConfig needed for scheduling Refine via airflow - https://phabricator.wikimedia.org/T367134#9995754 (10Ottomata) [17:02:18] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine refactoring] Refine jobs should be scheduled by Airflow: implementation - https://phabricator.wikimedia.org/T356762#9995779 (10Ottomata) As discussed in standup, I've rewritten this task to encompass the work attached to it. [17:02:26] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine refactoring] Refine jobs should be scheduled by Airflow: implementation - https://phabricator.wikimedia.org/T356762#9995780 (10Ottomata) [17:02:26] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine Refactoring] Changes to EventStreamConfig needed for scheduling Refine via airflow - https://phabricator.wikimedia.org/T367134#9995781 (10Ottomata) [17:02:35] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine refactoring] Refine jobs should be scheduled by Airflow: implementation - https://phabricator.wikimedia.org/T356762#9995776 (10Ottomata) [17:05:39] 10Data-Engineering (Q1 2024 July 1st - September 30th), 13Patch-For-Review: [Refine refactoring] Refine jobs should be scheduled by Airflow: implementation - https://phabricator.wikimedia.org/T356762#9995784 (10Ottomata) [18:54:35] (03CR) 10Mforns: "LGTM! +1 (because there's some testing lines that need to be restored). But after that I think this is ready for prod!" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1049281 (https://phabricator.wikimedia.org/T342267) (owner: 10Milimetric) [18:56:38] 07Analytics-Data-Problem, 06Data-Engineering, 10Data-Engineering-Dashiki, 10Data Products (Data Products Sprint 16), and 2 others: Investigate surprising "10% Other" portion of Analytics Browsers report - https://phabricator.wikimedia.org/T342267#9996364 (10mforns) Heya @Milimetric, sorry for taking so lon... [19:23:15] (03CR) 10Ebernhardson: Introducing cirrussearch/weighted_tags (031 comment) [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) (owner: 10Peter Fischer) [19:42:36] (03CR) 10Ebernhardson: Introducing cirrussearch/weighted_tags (032 comments) [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/1055226 (https://phabricator.wikimedia.org/T366253) (owner: 10Peter Fischer) [21:41:37] 14Analytics-Radar, 06Data-Engineering, 06Data-Platform-SRE, 06serviceops-radar, and 2 others: Configuration Management for Kafka settings - https://phabricator.wikimedia.org/T276088#9996973 (10bking) >>! In T276088#8443651, @Ottomata wrote: > To do the ACLs right we also need some authentication for Kafka....