[19:16:26] taavi: quarry nfs failed again [19:22:31] again? [19:23:16] mhm [19:23:29] T302154 [19:23:30] T302154: quarry-nfs-1 went down; quarry is offline - https://phabricator.wikimedia.org/T302154 [19:23:35] !log quarry hard rebooted quarry-nfs-1 again T302154 [19:23:40] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Quarry/SAL [19:24:53] andrewbogott: I think at least one of your shiny new nfs servers has a kernel bug :( [19:25:13] quarry broke again, huh? [19:25:17] yeah [19:25:25] Let me try rebuilding it with Buster and we'll see if that goes better [19:30:08] In theory there's a ready-made process for migrating to a new server. Let's see if it works! [19:31:51] I love the phrase "in theory" :') [19:32:08] I tested it quite a bit but not with a running service :) [19:32:25] taavi: In case you ever wind up wanting to mess with this yourself, the runbook I'm following is https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Runbooks/Create_an_NFS_server#Create_a_replacement_server_for_an_existing_service [19:36:25] woo, proper docs! [19:38:27] yeah! I started to write those a few days ago and discovered I had already written them [19:44:15] taavi: anything we should do before I try the failover? [19:49:50] !log quarry moving nfs service from quarry-nfs-1 (bullseye) to quarry-nfs-2 (buster), testing to see if T302154 is a kernal or nfs-version issue [19:49:53] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Quarry/SAL [19:49:54] T302154: quarry-nfs-1 went down; quarry is offline - https://phabricator.wikimedia.org/T302154 [20:04:53] ok, that was shaky but it looks like things are working now [23:03:58] !log tools.bridgebot Bridge #wikimedia-ve to Telegram (T299326) [23:04:03] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.bridgebot/SAL [23:53:03] !log tools.toolinfo-scraper Update to 922ce7c (T294142) [23:53:07] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.toolinfo-scraper/SAL [23:53:26] andrewbogott taavi: can you give me a ping if this quarry issue happens again? [23:54:22] I would also like to be pinged :)