[09:58:43] FIRING: BenthosKafkaConsumerLag: Too many messages in jumbo-eqiad for group benthos-webrequest-sampled-live-franz - TODO - https://grafana.wikimedia.org/d/000000484/kafka-consumer-lag?var-cluster=jumbo-eqiad&var-datasource=eqiad%20prometheus/ops&var-consumer_group=benthos-webrequest-sampled-live-franz - https://alerts.wikimedia.org/?q=alertname%3DBenthosKafkaConsumerLag [10:03:43] RESOLVED: BenthosKafkaConsumerLag: Too many messages in jumbo-eqiad for group benthos-webrequest-sampled-live-franz - TODO - https://grafana.wikimedia.org/d/000000484/kafka-consumer-lag?var-cluster=jumbo-eqiad&var-datasource=eqiad%20prometheus/ops&var-consumer_group=benthos-webrequest-sampled-live-franz - https://alerts.wikimedia.org/?q=alertname%3DBenthosKafkaConsumerLag [16:01:22] o/ is there documentation on adding new things to modules/profile/manifests/prometheus/ops.pp ? I'm trying to figure my way through adding monitoring for the ceph endpoint in the apus clusters; I've found the comments on the prometheus::class_config but wikitech basically just says "add things to that file" [16:01:43] I can probably feel my way through it with only mild incompetence, but if there were docs I'm missing they might help :) [17:03:52] Further to the above, I'd be grateful for a review (and +1 if appropriate) of https://gerrit.wikimedia.org/r/c/operations/puppet/+/1084174 please? It's my first prometheus-related CR... [17:17:33] Emperor: didn't check the whole change but you'd need to add the new job config to line 2590 [17:18:10] (logging off but I can help review tomorrow if needed) [19:02:19] ta, have pushed an update with that