[00:03:53] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [02:50:53] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [06:51:08] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [06:55:53] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [08:23:36] FYI since an-coord1002 was decommissioned last week the cumin alias check reports: Alias hadoop-coordinator-secondary matched 0 hosts. I guess it's just a matter of puppet code cleanup as I still see the role assigned to an-coord1002 in site.pp for example. [11:09:44] I see that mori.tz has sent https://gerrit.wikimedia.org/r/c/operations/puppet/+/1016308 for the above [11:10:53] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [12:51:51] 10Quarry, 10ChangeProp, 06collaboration-services, 10GitLab, and 9 others: Figure out a plan to move forward with regarding Redis License changes - https://phabricator.wikimedia.org/T360596#9679718 (10jijiki) >>! In T360596#9676049, @akosiaris wrote: > > My 2, operationally minded, cents says to wait for th... [13:10:53] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [13:29:04] 10Data-Engineering (Q4 2024 April 1st - June 30th), 13Patch-For-Review: [Data Quality] Implement basic data quality metrics for MW history - https://phabricator.wikimedia.org/T354692#9680111 (10CodeReviewBot) ebysans merged https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/629... [14:26:23] 10Quarry: 14Move quarry to magnum - 14https://phabricator.wikimedia.org/T349029#9680448 (10rook) 05Open→03Resolved [17:58:19] 06Data-Engineering, 10[DEPRECATED] wdwb-tech, 10Citoid, 06Content-Transform-Team-WIP, and 11 others: Migrate node-based services in production to node18 - https://phabricator.wikimedia.org/T349118#9681758 (10Mvolz) [19:24:27] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Data Quality] Migrate MWHistoryChecker to DeeQu checks - https://phabricator.wikimedia.org/T361016#9682045 (10Snwachukwu) a:03Snwachukwu [21:03:26] (03PS1) 10Santiago Faci: Updating changelog to prepare next deployment [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1016440