[21:07:35] 10Data-Engineering (Sprint 6), 10Data-Platform-SRE (23/24 Q3 Milestone 1), 10Observability-Metrics, 10Patch-For-Review: Configure Airflow to send metrics to Prometheus - https://phabricator.wikimedia.org/T343232 (10Gehel) [21:08:05] 10Data-Platform-SRE (23/24 Q3 Milestone 1), 10Discovery-Search (Current work): Cirrus-streaming-updater test: validate relforge indices are correctly updated - https://phabricator.wikimedia.org/T350186 (10Gehel) [21:09:32] 10Data-Engineering (Sprint 6), 10Data-Platform-SRE (23/24 Q3 Milestone 1), 10Patch-For-Review: Monitor the availability of the spark history server deployments - https://phabricator.wikimedia.org/T353717 (10Gehel) [21:09:35] 10Data-Platform-SRE (23/24 Q3 Milestone 1), 10Patch-For-Review: Create a helm chart for Superset - https://phabricator.wikimedia.org/T352166 (10Gehel) [21:09:37] 10Data-Platform-SRE (23/24 Q3 Milestone 1): Refactor sre.wdqs.data-transfer to use new spicerack class api - https://phabricator.wikimedia.org/T347624 (10Gehel) [21:09:39] 10Data-Platform-SRE (23/24 Q3 Milestone 1): Check log rotation settings on airflow instances - https://phabricator.wikimedia.org/T339015 (10Gehel) [21:11:41] 10Data-Engineering (Sprint 6), 10Data-Platform-SRE (23/24 Q3 Milestone 1): Collect metrics from the spark-history server - https://phabricator.wikimedia.org/T353694 (10Gehel) [21:11:45] 10Data-Platform-SRE (23/24 Q3 Milestone 1): Root cause Archiva outage from 2023-09-24 - https://phabricator.wikimedia.org/T347343 (10Gehel) [21:16:27] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1001:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1001:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [22:06:29] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1001:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1001:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage