[09:02:33] (03PS10) 10Phuedx: Update the WikiLambda instrumentation to use core interaction events [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/992224 (https://phabricator.wikimedia.org/T350497) (owner: 10Santiago Faci) [09:05:11] (03CR) 10Phuedx: "PS10 fixes a trivial whitespace issue." [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/992224 (https://phabricator.wikimedia.org/T350497) (owner: 10Santiago Faci) [09:06:06] (03CR) 10Phuedx: [C:03+2] "Thanks @DMartin for confirming that the properties in the analytics/product_metrics/wikilambda/common fragment aren't required." [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/992224 (https://phabricator.wikimedia.org/T350497) (owner: 10Santiago Faci) [09:06:39] (03Merged) 10jenkins-bot: Update the WikiLambda instrumentation to use core interaction events [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/992224 (https://phabricator.wikimedia.org/T350497) (owner: 10Santiago Faci) [10:05:50] 06Data-Engineering, 10Observability-Logging, 06Traffic, 10Event-Platform, 13Patch-For-Review: Remove extra fields currently sent to Kafka - https://phabricator.wikimedia.org/T360642#9656880 (10Fabfur) >>! In T360642#9655231, @Ottomata wrote: >> meta.id and meta.request_id > > `meta.id` is used to unique... [11:34:30] 06Data-Engineering, 10Observability-Logging, 06Traffic, 13Patch-For-Review: Install new Benthos instance on cp hosts - https://phabricator.wikimedia.org/T358109#9657223 (10Fabfur) [12:03:10] 06Data-Engineering, 10Data Products (Epics Timeline): Bot Detection - https://phabricator.wikimedia.org/T321707#9657324 (10VirginiaPoundstone) [12:03:38] 06Data-Engineering, 10Data Products (Epics Timeline): Bot Detection - https://phabricator.wikimedia.org/T321707#9657325 (10VirginiaPoundstone) [12:03:39] 06Data-Engineering, 06Data Products, 10Pageviews-API, 10RESTBase-API, and 2 others: There are anomalies in some of the mostread data on zhwiki for March 2024 - https://phabricator.wikimedia.org/T360499#9657326 (10VirginiaPoundstone) [12:15:01] 06Data-Engineering, 06Data Products: NEW BUG REPORT - Pageviews Missing Hourly Partition - https://phabricator.wikimedia.org/T358142#9657388 (10lbowmaker) Looks like the file is there now (ran a day or so later at: 21-Feb-2024 21:39) https://dumps.wikimedia.org/other/pageviews/2024/2024-02/pageviews-20240220-... [12:46:43] 06Data-Engineering, 10Data-Platform-SRE (2024.03.25 - 2024.04.14), 13Patch-For-Review: Update the From: addresses of all email from DPE pipelines so that they use routable addresses - https://phabricator.wikimedia.org/T358675#9657473 (10BTullis) Although the default email address for systemd timers has chang... [12:57:11] 06Data-Engineering, 10Data-Platform-SRE (2024.03.25 - 2024.04.14), 13Patch-For-Review: 14Cleanup superset related resources from puppet - 14https://phabricator.wikimedia.org/T358570#9657489 (10brouberol) 05Openā†’03Resolved [13:07:02] (03PS8) 10Gmodena: development: add webrequest schema [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/983898 (https://phabricator.wikimedia.org/T314956) (owner: 10Ottomata) [13:07:29] (03CR) 10CI reject: [V:04-1] development: add webrequest schema [schemas/event/primary] - 10https://gerrit.wikimedia.org/r/983898 (https://phabricator.wikimedia.org/T314956) (owner: 10Ottomata) [13:18:38] 06Data-Engineering, 10Data-Platform-SRE (2024.03.25 - 2024.04.14), 13Patch-For-Review: Update the From: addresses of all email from DPE pipelines so that they use routable addresses - https://phabricator.wikimedia.org/T358675#9657578 (10BTullis) Oh right, it seems that RefinerySource has its own built-in ema... [13:42:49] (03PS1) 10Btullis: Update the from address of refine reports to be routable [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1014004 (https://phabricator.wikimedia.org/T358675) [13:44:53] (03PS2) 10Btullis: Update the from address of refine reports to be routable [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1014004 (https://phabricator.wikimedia.org/T358675) [14:34:40] 06Data-Engineering, 07Spike: [SPIKE] [Dataset Config Store] - Design how config store feeds DataHub - https://phabricator.wikimedia.org/T360896 (10lbowmaker) 03NEW [15:02:56] !log updating the ssl_provider for eventstreams schema servers to cfssl for T360412 [15:02:59] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:02:59] T360412: Phase out cergen for Data Platform services - https://phabricator.wikimedia.org/T360412 [15:18:16] (03Abandoned) 10Milimetric: [DNM] sqoop the data from the machinevision tables before they're dropped [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1013531 (https://phabricator.wikimedia.org/T352884) (owner: 10Cparle) [15:43:08] 10Data-Engineering (Sprint 9), 06Machine-Learning-Team, 06Wikimedia Enterprise, 07Epic, 10Event-Platform: [Event Platform] Implement PoC Event-Driven Data Pipeline for Revert Risk Model Scores using Event Platform Capabilities - https://phabricator.wikimedia.org/T338792#9658089 (10lbowmaker) [16:49:59] 06Data-Engineering, 06Structured-Data-Backlog: Bump memory to enable large artifacts sync on HDFS - https://phabricator.wikimedia.org/T348958#9658469 (10xcollazo) Ah, good find! >>! In T348958#9651862, @Ottomata wrote: > > ` > $ curl -I https://gitlab.wikimedia.org/repos/structured-data/seal/-/package_files/... [17:06:32] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Dataset Config Store] Setup initial CI checks - https://phabricator.wikimedia.org/T357468#9658563 (10lbowmaker) [17:11:16] 06Data-Engineering, 07Spike: [Status Store] [SPIKE] Document Approach for Iceberg Sensors - https://phabricator.wikimedia.org/T360922 (10lbowmaker) 03NEW [17:14:19] 06Data-Engineering, 06Structured-Data-Backlog: Make HTML Dumps available in hadoop - https://phabricator.wikimedia.org/T305688#9658594 (10lbowmaker) [17:14:41] 10Data-Engineering (Q4 2024 April 1st - June 30th), 06Structured-Data-Backlog: Make HTML Dumps available in hadoop - https://phabricator.wikimedia.org/T305688#9658598 (10lbowmaker) [17:18:34] 06Data-Engineering, 10Event-Platform, 10GitLab (Pipeline Services MigrationšŸ¤): Migrate Data Engineering Pipelinelib repos to GitLab - https://phabricator.wikimedia.org/T344730#9658609 (10lbowmaker) [17:31:15] 10Data-Engineering (Sprint 9): Improve service runner to better support metrics and debugging - https://phabricator.wikimedia.org/T360924 (10Ahoelzl) 03NEW [18:22:30] 10Quarry, 10ChangeProp, 06collaboration-services, 10GitLab, and 9 others: Figure out a plan to move forward with regarding Redis License changes - https://phabricator.wikimedia.org/T360596#9658856 (10bd808) [20:04:42] 10Data-Engineering (Sprint 9): We should provide DQ integration with Python - https://phabricator.wikimedia.org/T353940#9659288 (10gmodena) > I need to add a wrapper to the Alert generation SerDe Done. There's an example at https://gitlab.wikimedia.org/gmodena/refinery-python/-/blob/main/examples/alerts.py?ref_... [20:13:13] 10Data-Engineering (Sprint 9): We should provide DQ integration with Python - https://phabricator.wikimedia.org/T353940#9659326 (10gmodena) [20:33:48] 10Quarry, 10ChangeProp, 06collaboration-services, 10GitLab, and 9 others: Figure out a plan to move forward with regarding Redis License changes - https://phabricator.wikimedia.org/T360596#9659379 (10Tgr) >>! In T360596#9652082, @Krinkle wrote: > In MediaWiki (as deployed at WMF), there exists 1 use of Red... [20:51:53] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [22:09:03] (GobblinKafkaRecordsExtractedNotEqualRecordsExpected) firing: Gobblin job event_default ingested an unexpected number of records for a Kafka topic partition. ... [22:09:04] - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Gobblin - https://grafana.wikimedia.org/d/pAQaJwEnk/gobblin?orgId=1&var-gobblin_job_name=event_default&var-kafka_topic=codfw.mediawiki.cirrussearch.page_rerender.v1&viewPanel=4 - https://alerts.wikimedia.org/?q=alertname%3DGobblinKafkaRecordsExtractedNotEqualRecordsExpected [22:27:59] 06Data-Engineering, 10Data Pipelines, 13Patch-For-Review: [Refine refactoring] Refactor and migrate navigationtiming to Airflow - https://phabricator.wikimedia.org/T356192#9659728 (10CodeReviewBot) aqu merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/617 Migrate and r... [22:47:01] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 06Web-Team-Backlog: Update mediawiki.web_ui_actions Stream Config - https://phabricator.wikimedia.org/T360955 (10KSarabia-WMF) 03NEW [22:49:34] 06Data-Engineering, 10MediaWiki-extensions-WikimediaEvents, 06Web-Team-Backlog: Update mediawiki.web_ui_actions Stream Config - https://phabricator.wikimedia.org/T360955#9659815 (10KSarabia-WMF) [22:51:53] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [23:09:04] (GobblinKafkaRecordsExtractedNotEqualRecordsExpected) resolved: Gobblin job event_default ingested an unexpected number of records for a Kafka topic partition. ... [23:09:04] - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Gobblin - https://grafana.wikimedia.org/d/pAQaJwEnk/gobblin?orgId=1&var-gobblin_job_name=event_default&var-kafka_topic=codfw.mediawiki.cirrussearch.page_rerender.v1&viewPanel=4 - https://alerts.wikimedia.org/?q=alertname%3DGobblinKafkaRecordsExtractedNotEqualRecordsExpected