[10:13:48] !help users reporting ssh and other stuff on beta slow (also not sure if related but tools-checker went off) [10:13:48] If you don't get a response in 15-30 minutes, please create a phabricator task -- https://phabricator.wikimedia.org/maniphest/task/edit/form/1/?projects=wmcs-kanban [10:34:08] RhinosF1: yes sorry, we are doing a ceph intervention that went the non-expected way [10:34:53] dcaro: np, we managed to get beta back working [10:35:59] things should be more or less stable right now, though might slow down a bit in a few min (we are adding a node to the cluster, but with a new network setup that showed trickier, and triggered a whole re-shuffling of data around the cluster) [10:36:41] dcaro: np, should we keep an eye out incase of any more crashes? [10:36:43] that extra load made VMs io too slow [10:37:08] we are looking also, but an extra pair of eyes is always welcome :) [10:38:06] np np [10:38:28] btw. where did the users report issues? (making sure I'm in the right channels) [10:39:02] dcaro: -releng [10:39:15] I'm there xd /me looks [10:39:15] TheresNoTime was first [10:39:51] the beta DB went offline for a bit so it did the expected and locked itself just in case [11:18:28] dcaro: i assume that's ceph again [11:26:36] yep, we are still doing things, what are you seeing? [11:26:36] dcaro: CI started being very upset. beta is also loosing edits for some reason at the moment. [12:06:56] RhinosF1: jbond things should be a bit better now, can you verify? [12:42:55] dcaro: yes believe so [13:05:16] dcaro: sorry was in an interview checking now [13:07:08] dcaro: lgtm [13:57:30] !log admin decommissioning cloudcontrol1003 + cloudcontrl1004. I backed up $home in case anyone needs their files. [13:57:33] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Admin/SAL [14:25:47] **fyi** s2 replication to cloud world is broken [14:40:23] ^ has been fixed [16:55:01] !log tools restart puppetdb on tools-puppetdb-1, crashed during the ceph issues [16:55:03] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL [17:16:19] !log tools.stashbot Added Cwhite as co-maintainer; removed 20after4 as co-maintainer [17:16:21] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.stashbot/SAL [18:27:14] If your tools use the externallinks tables from the wiki replicas, check out T312666 which is proposing some schema changes to reduce the size of this table across the wikis. Constructive feedback is welcome! [18:27:14] T312666: Remove duplication in externallinks table - https://phabricator.wikimedia.org/T312666