[13:32:32] Amir1: can I get a few minutes of your time to look at labtestwikitech? I migrated it to a new db server (as per your request on T310795) but am still getting the same 'has gone away' failure that I started with :) [13:32:32] T310795: Revive Labtestwikitech (formerly: Abolish labtestwikitech) - https://phabricator.wikimedia.org/T310795 [13:33:40] app server is cloudweb2002-dev.wikimedia.org, db server is clouddb2002-dev.codfw.wmnet [13:33:47] andrewbogott: does it have the grants with the new password? [13:34:04] yes, we double- and triple-checked connectivity and permissions. [13:34:20] Also the logs aren't showing a grant failure, it's something more like an unexpected disconnect. [13:34:46] "Aborted connection 2666 to db: 'unconnected' user: 'unauthenticated' host: 'cloudweb2002-dev.wikimedia.org' (This connection closed normally without authentication)" [13:34:58] so seems like mediawiki misbehaving [13:35:50] hmm, trying to figure it out [13:35:54] thank you! [13:36:05] it can be ferm needing update [13:37:50] pretty sure it's not a connectivity issue because cli mysql works fine on the app server [13:37:59] but please double-check my work :) [13:42:01] also feel free to scap pull or alter config or whatever on cloudweb [14:23:28] Amir1: s2 went again [14:23:30] PROBLEM - MariaDB Replica SQL: s2 on dbstore1007 is CRITICAL: CRITICAL slave_sql_state Slave_SQL_Running: No, Errno: 1054, Errmsg: Error Unknown column tl_namespace in field list on query. Default database: itwiki. [Query snipped] [14:24:41] I check it [14:24:41] s4 also has ~13 hour lag [14:24:51] That's intentional [14:24:59] one thing at a time [14:27:28] ok, i see why for s4 [15:22:10] sigh, it would be nice if any given swift cluster could have a full complement of working drives :( [15:38:38] or they boot at first try [15:38:40] :D [15:50:12] I'm not that optimistic ;-) [16:25:01] Amir1: what's your gerrit username, please? I want to tag you in a CR... [16:25:57] ladsgroup [16:26:28] that looks to be a gmail address not a wmf one? [16:27:01] Yeah, it's right though [16:27:04] Lots of people do that [16:27:35] OK [17:28:15] Amir1: no luck? [18:48:27] ok, back now, I check Empero.r and Andrew's issues [18:54:03] Emperor: you're now an owner, do you wanna try again?