[09:40:08] 06Data-Engineering, 06Data Products, 10Observability-Logging, 06Traffic, 13Patch-For-Review: Move analytics log from Varnish to HAProxy - https://phabricator.wikimedia.org/T351117#9752321 (10gmodena) >> - there's couple of CRs pending (linked to this phab) and I'd like to have a second run on the event s... [10:57:44] 06Data-Engineering, 06cloud-services-team, 10Data-Services, 10Temporary accounts: Surface Temporary user information to Cloud Wiki Replicas - https://phabricator.wikimedia.org/T346679#9752609 (10kostajh) [11:12:53] (HdfsDataNodeHeapUsage) firing: Datanode heap usage on an-worker1167:51010 is above 90%. - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hadoop/Alerts#HDFS_Datanode_JVM_Heap_Usage - https://grafana.wikimedia.org/d/000000585/hadoop?var-hadoop_cluster=analytics-hadoop&panelId=1&fullscreen&orgId=1 - https://alerts.wikimedia.org/?q=alertname%3DHdfsDataNodeHeapUsage [11:17:53] (HdfsDataNodeHeapUsage) resolved: Datanode heap usage on an-worker1167:51010 is above 90%. - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hadoop/Alerts#HDFS_Datanode_JVM_Heap_Usage - https://grafana.wikimedia.org/d/000000585/hadoop?var-hadoop_cluster=analytics-hadoop&panelId=1&fullscreen&orgId=1 - https://alerts.wikimedia.org/?q=alertname%3DHdfsDataNodeHeapUsage [11:22:17] (03CR) 10Urbanecm: [C:03+2] Add analytics for Impressions, Success and Abandonment of account creation [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/962569 (https://phabricator.wikimedia.org/T300273) (owner: 10Cyndywikime) [11:22:57] (03Merged) 10jenkins-bot: Add analytics for Impressions, Success and Abandonment of account creation [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/962569 (https://phabricator.wikimedia.org/T300273) (owner: 10Cyndywikime) [14:13:12] (03CR) 10Xcollazo: [C:03+2] "Now that we are getting to a stable version of this code, it would be great to add some unit tests. Not sure whether this graph code is ea" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1023492 (https://phabricator.wikimedia.org/T358699) (owner: 10Mforns) [14:31:33] (03Merged) 10jenkins-bot: Correctly apply distanceToPrimary in CommonsCategoryGraphBuilder [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1023492 (https://phabricator.wikimedia.org/T358699) (owner: 10Mforns) [14:55:10] (03PS4) 10Mforns: Modify Commons Impact Metrics queries to ignore ancestor categories [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1023491 (https://phabricator.wikimedia.org/T358699) [14:55:16] (03CR) 10Mforns: [V:03+2] Modify Commons Impact Metrics queries to ignore ancestor categories [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1023491 (https://phabricator.wikimedia.org/T358699) (owner: 10Mforns) [16:10:30] 06Data-Engineering, 06Data Products, 10Observability-Logging, 06Traffic, 13Patch-For-Review: Move analytics log from Varnish to HAProxy - https://phabricator.wikimedia.org/T351117#9754000 (10gmodena) >> I like the overall idea, but I'd prefer to proceed DC-by-DC, in switching topics and shutting down Var... [16:24:03] (03PS1) 10Mforns: Update changelog for 0.2.37 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1025407 [16:24:38] (03CR) 10Mforns: [V:03+2 C:03+2] Update changelog for 0.2.37 [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1025407 (owner: 10Mforns) [16:29:55] Starting build #3 for job analytics-refinery-maven-release [16:55:19] Project analytics-refinery-maven-release build #3: 09SUCCESS in 25 min: https://integration.wikimedia.org/ci/job/analytics-refinery-maven-release/3/ [17:15:53] 06Data-Engineering, 06Data Products, 10FY2023-24-WE 2.1 Typography and palette customizations, 13Patch-For-Review, 10Web-Team-Backlog (FY2023-24 Q4 Sprint 2): Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962#9754428 (10KSarabia-WMF) a:05KSarabia-WMF→03phuedx [17:17:37] 06Data-Engineering, 06Data Products, 10FY2023-24-WE 2.1 Typography and palette customizations, 13Patch-For-Review, 10Web-Team-Backlog (FY2023-24 Q4 Sprint 2): Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962#9754427 (10Jdlrobson) @phuedx will review this. [17:31:05] (KafkaReplicationFactorTooLow) firing: ... [17:31:05] Kafka topic codfw.cirrussearch.update_pipeline.fetch_error replication factor is too low on jumbo-eqiad - https://wikitech.wikimedia.org/wiki/Kafka/Administration#Increase_a_topic's_replication_factor - https://grafana.wikimedia.org/d/000000234/kafka-by-topic?var-kafka_cluster=jumbo-eqiad&var-kafka_broker=All&var-topic=codfw.cirrussearch.update_pipeline.fetch_error&viewPanel=40 - ... [17:31:05] https://alerts.wikimedia.org/?q=alertname%3DKafkaReplicationFactorTooLow [17:36:05] (KafkaReplicationFactorTooLow) resolved: ... [17:36:05] Kafka topic codfw.cirrussearch.update_pipeline.fetch_error replication factor is too low on jumbo-eqiad - https://wikitech.wikimedia.org/wiki/Kafka/Administration#Increase_a_topic's_replication_factor - https://grafana.wikimedia.org/d/000000234/kafka-by-topic?var-kafka_cluster=jumbo-eqiad&var-kafka_broker=All&var-topic=codfw.cirrussearch.update_pipeline.fetch_error&viewPanel=40 - ... [17:36:05] https://alerts.wikimedia.org/?q=alertname%3DKafkaReplicationFactorTooLow [17:51:51] 06Data-Engineering, 06Data-Platform: NEW BUG REPORT Cannot add terms in DataHub glossary - https://phabricator.wikimedia.org/T363612#9754618 (10lbowmaker) [18:21:54] Starting build #3 for job analytics-refinery-update-jars [18:23:53] (03PS1) 10Maven-release-user: Add refinery-source jars for v0.2.37 to artifacts [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1024759 [18:23:54] Project analytics-refinery-update-jars build #3: 09SUCCESS in 2 min 0 sec: https://integration.wikimedia.org/ci/job/analytics-refinery-update-jars/3/ [18:34:44] 10Data-Engineering (Q4 2024 April 1st - June 30th), 10Event-Platform, 07Spike: [SPIKE] Can we express Event Platform configs in Datasets Config? - https://phabricator.wikimedia.org/T361017#9754923 (10gmodena) === Summary Tl;dr: We can easily express a stream config as jsonschema, and expose via [datasets... [18:36:21] 10Data-Engineering (Q4 2024 April 1st - June 30th), 10Event-Platform, 07Spike: [SPIKE] Can we express Event Platform configs in Datasets Config? - https://phabricator.wikimedia.org/T361017#9754944 (10gmodena) a:03gmodena [18:36:47] (03CR) 10Mforns: [V:03+2 C:03+2] Add refinery-source jars for v0.2.37 to artifacts [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1024759 (owner: 10Maven-release-user) [18:38:20] !log deployed refinery-source v0.2.37 [18:38:22] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:11:35] !log started refinery deployment (with v0.2.37 jars) [19:11:36] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [19:54:50] 10Quarry: [bug] Internal server error & backed up queue - https://phabricator.wikimedia.org/T363644#9755193 (10rook) →14Duplicate dup:03T362213 [19:54:58] 10Quarry: Error 500 when clicking "stop query" - https://phabricator.wikimedia.org/T362213#9755195 (10rook) [20:22:01] !log finished refinery deployment (with v0.2.37 jars) [20:22:21] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [20:59:49] !log deployed airflow-dags/analytics [20:59:51] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [22:07:23] (03CR) 10Aleksandar Mastilovic: [C:03+2] "LGTM!" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1024530 (owner: 10Cicalese)