[16:01:18] <denisse>	 Hello team, someone asked on Slack if the WMF site was down (it's not, don't worry).
[16:01:19] <denisse>	 From what I understand the site is managed by Automatic therefore a downtime would not be visible in our Grafana instance.
[16:01:19] <denisse>	 Do we have a place where we can see if https://wikimediafoundation.org is down?
[16:07:16] <godog>	 denisse: yes, we monitor wikimediafoundation.org from within the infra and reaching out to the internet, via "watchrat"
[16:07:38] <godog>	 which is an internal name for the "thing" that replaced ... wait for it ... watchmouse
[16:07:44] <godog>	 https://grafana.wikimedia.org/d/GYciEga7z/watchrat is the dashboard
[16:13:49] <denisse>	 godog: Thanks a lot Filippo, I'll pass the word on the Slack thread.
[16:14:04] <volans>	 that dashboard show no data to me :)
[16:14:18] <denisse>	 Btw, watchmouse and watchrat are funny names. :)
[16:14:28] <godog>	 denisse: sure np!
[16:14:34] <denisse>	 volans: I think it only shows the non successful requests.
[16:14:41] <godog>	 yeah I think so too
[16:15:36] <volans>	 sure, but how does one know that it's all good vs we're not monitoring it anymore? :)
[16:15:44] <denisse>	 But I do think that can be a little bit confusing. It may be a good idea to also show the successful requests as that would not only make it less confusing but it'd also help us to know if the daemon is working correctly.
[16:16:09] <herron>	 volans: currently it's split with http response >400 on the left and blackbox probe errors on the right
[16:16:59] <herron>	 we could show all the healthy probes yeah
[16:17:35] <herron>	 annnnd nerd snipe successful
[16:42:30] <cdanis>	 denisse: I've considered also asking about Automattic adding our same NEL response headers to get data that way, but it hasn't been high-priority
[16:42:38] <cdanis>	 https://wikitech.wikimedia.org/wiki/Network_Error_Logging
[17:02:52] <herron>	 hopefully the watchrat dash is a bit clearer about what's being checked now