[09:01:35] any tip on how to handle the new service monitoring after reimage from the recipe, for databases? [09:02:06] jynus: not sure I get your question [09:02:46] "Attempt to run 'spicerack.icinga.IcingaHosts.wait_for_optimal..check' raised: Not all services are recovered: db2097" [09:03:03] jynus: ah, it is ok, you can let it expire [09:03:07] it will be just a warning [09:03:09] the reimage process waits for all icinga checks to be green [09:03:17] but they won't until I start mariadb [09:03:18] yeah, but it will timeout at somepoint [09:03:33] they won't be green either after starting mysql, as there will be lag [09:03:40] yeah that too [09:03:50] But you can let it timeout, the reimage process will end up just fine [09:03:57] maybe there was a parameter to ignore that [09:03:59] and will just issue a warning about that, but it will have exit 0 code [09:04:06] or you did some scripting after it, or something [09:04:22] No, I just ignore it and let them timeout [09:04:38] thanks, that helps- maybe we can give some feature requests to riccardo to accomodate our needs :-) [09:11:05] jynus: ahhhhhhh [09:11:06] > Oh, I see the source of confusion- as I am upgrading the hosts at the same time as restarting it, they will appear on both the bullseye and buster lists- hopefully that will be solved in the end. [09:11:08] 💡 [09:11:23] that explains something that confused me yesterday. thanks for figuring that out! [09:11:26] yeah, no problem [09:11:33] as long as people are aware of it [09:12:00] you just had hardcoded os_version variable on the code :-P [09:18:15] heads up now that I have your attention, that I will be reimaging the dbprov hosts- that means 1h in which some db restores will not be possible and you will have to use the ones on the other dc [09:19:05] ack [09:20:01] I didn't say the time- towards the end of the week [09:20:22] Friday afternoon! :-D [09:21:38] @kormat lol at your last update catching the debian installer [09:22:03] aw, crap [09:22:13] where's the 'revert' button on phab :( [09:29:18] you can poke around in the database... [09:29:32] * kormat hisses [09:34:46] argh. forgot to put in a downtime for s5/codfw, so expect a bunch of alert resolve messages in a bit, sorry. [11:33:15] https://twitter.com/_sysengineer/status/1511844001556619269 [12:15:54] 👍 👍 [12:25:21] Reedy: https://en.wikipedia.org/wiki/Pretty_Good_Privacy#Criminal_investigation [12:30:06] marostegui: are you currently running a schema change against the s2 codfw primary? [12:53:50] nope [12:54:02] ok cool. i'll reboot it shortly. [13:23:06] I (used to) have a printout of my GPG revocation certificate [13:45:40] kormat: you doing db2135? [13:46:20] marostegui: already did a while ago [13:46:29] something wrong? [13:46:29] ah ok, the proxy complained, I will reload it [13:46:45] ahh, i see. i did all the misc codfw primaries, fwiw. [13:46:45] fixed [13:46:56] then you probably need to reload all codfw proxies [13:47:01] I did haproxy2004 [13:47:14] yeah, 2001, 2002 and 2003 need reload [13:47:19] Just checked icinga [13:47:37] `systemctl restart haproxy`? [13:47:59] systemctl reload haproxy [13:48:02] that should be enough [13:48:33] ok cool, done. [13:49:06] recoveries seem to be arriving [13:52:59] just added dbproxy.* to the regex of hosts i watch on icinga.