[06:56:53] sigh, I woke up a bit late, Doing topology changes now [07:21:38] what's wrong with clouddb1021? any task where I can read? [08:11:42] marostegui: what is wrong about it? [08:12:03] it is not accessible from cumin [08:12:56] I will create a task [08:13:30] actually any of them are anymore [08:13:55] it is related to ipv6 I think [08:14:16] yeah, I can't connect to it either [08:16:28] marostegui: okay if I schedule switchover tickets for the next week? [08:16:32] s1 and s3 [08:16:38] sure [08:16:43] Awesome. Thanks [08:16:49] I have created https://phabricator.wikimedia.org/T323550 [08:19:34] the issue is that mysql_root_clients (which is used for the firewall rules) is for ipv4 only, so you can't connect via ipv6 [08:19:46] but a better question is why do those hosts suddenly have ipv6 records [08:20:27] taavi: Thanks for the insights. Yeah, I don't know what could have changed lately (as I have been away pretty much since Wed past week) [08:21:04] I note that maintain-dbusers is also failing because the grants are for ipv4 only [08:21:51] taavi: Yeah, we don't have grants for ipv6 https://phabricator.wikimedia.org/T270101 [08:28:00] https://wikis.world/@nixCraft@mastodon.social/109385543340690248 xD [08:30:12] :) [08:30:19] 6a) restore costs extra [08:49:09] quick reminder: I'm going to reboot cumin2002 at 9:00 UTC, i.e. in ten minutes [09:10:20] moritzm: let me know when finished [09:15:31] jynus: reboot is done, the only thing missing is to rearm the keyholder for homer, which should also be done soon [09:15:40] but cookbooks etc can resume [09:27:14] thanks [10:28:39] I am going to test a new version of transfer.py on cumin2002, please don't use it for any transfer in the following 2 hours or so [10:32:18] jynus: can we use it on 1001 though?= [10:32:23] yep [10:32:32] technically you can use it on cumin2002 too [10:32:40] but I cannot guarantee yet it will work [10:32:58] I am checking I haven't broken compatibility and options [10:33:13] no worries, will stick to 1001 [10:36:56] I will got you a manpage too, what is this, the 80s!?? [10:39:07] did you upload transfer.py to freshmeat.net too? [10:40:13] I don't get that reference, was that like softonic? [10:41:35] https://www.rigacci.org/docs/biblio/online/debian_survival/freshmeat-page.png [10:41:38] don't remember this site? [10:42:16] that was part of the conglomerate of slashdot, maybe? [10:42:25] I think sourceforge owned it [10:42:28] but I don't remember it [10:51:46] in any case, this is the highlights on the changelog: https://gerrit.wikimedia.org/r/c/operations/software/transferpy/+/859455/2/debian/changelog [11:16:48] marostegui: if you were attached to your cumin1001 screen session, I may have detached accidentally (screen itself wasn't affected) [11:16:55] *detached you [11:17:08] jynus: nope, you didn't apparently [11:17:09] I got confused between my and your screen session, that's all [11:17:46] and forced the -d later to realize I was on the wrong session [12:59:34] marostegui: I am now confident enough to recommend using transfer.py on cumin2002 [13:00:09] I will leave it like that for a day so it has a full rotation of backups using it [13:00:18] and then upgrade cumin1001 too [13:00:40] cool jynus [13:01:23] it should not change anything for you, but you know how it is - it has so many dependencies that it is difficult to test every running part [14:18:46] would tomorrow morning be okay to reboot cumin1001 from a backup and DB perspective? [14:19:15] moritzm: From a DB point of view, maybe, I will only be able to tell you tomorrow morning :) [14:26:06] okay, given that most people tend to use cumin1001 over 2002 and if it's fine for Jaime I'd announce something like 10 UTC to the ops mailing list? Then we have time in the morning to figure out whether we can proceed or not? [14:30:11] let me give you an approximate time on when backups finish [14:30:30] ack [14:31:36] moritzm: I think 10 UTC should be fine for me, but I will be able to tell you tomorrow morning [14:32:06] 6h30 - 19h should be backup-free on eqiad [14:34:16] ok, thanks. I'll send a mail announcing a reboot for tomorrow 10 UTC; and if that needs to be moved, so be it [14:36:07] I always love dropping stuff https://grafana.wikimedia.org/d/000000377/host-overview?orgId=1&var-server=db1122&var-datasource=thanos&var-cluster=wmcs&from=1669084515584&to=1669127715584&viewPanel=28 [14:48:32] moritzm: I have seven screens in cumin1001 right now :P [14:48:39] but it should be done by tomorrow [14:49:01] ack :-) [16:38:33] hmm, it's the combination of doing alter on externallinks plus dropping a field with forty bytes from index. Commons is 70GB smaller because of that change only https://grafana.wikimedia.org/d/000000377/host-overview?orgId=1&var-server=db2119&var-datasource=thanos&var-cluster=wmcs&from=1667972777502&to=1668222126671&viewPanel=28 [16:38:40] the rest is still to be done