[05:50:41] I am going to switch sanitarium masters in s4 codfw, so going to stop s4 codfw for a bit [05:50:44] (the master that is) [06:02:00] All done, I am going to check that the data is being sanitized correctly [07:52:34] I see still some swift nodes need rebooting for T310483, le sigh [08:01:16] (PrometheusMysqldExporterFailed) firing: Prometheus-mysqld-exporter failed (db2156:9104) - https://grafana.wikimedia.org/d/000000278/mysql-aggregated - https://alerts.wikimedia.org/?q=alertname%3DPrometheusMysqldExporterFailed [08:06:16] (PrometheusMysqldExporterFailed) firing: (2) Prometheus-mysqld-exporter failed (db2156:9104) - https://grafana.wikimedia.org/d/000000278/mysql-aggregated - https://alerts.wikimedia.org/?q=alertname%3DPrometheusMysqldExporterFailed [08:09:24] ^ that's me [08:09:30] I am setting up new hosts [08:16:16] (PrometheusMysqldExporterFailed) firing: (2) Prometheus-mysqld-exporter failed (db2156:9104) - https://grafana.wikimedia.org/d/000000278/mysql-aggregated - https://alerts.wikimedia.org/?q=alertname%3DPrometheusMysqldExporterFailed [09:46:17] I belive that is the case, because the 2 runners are not from the US [09:47:11] (I don't think it is a bad thing, given they are recorded and ones doesn't need to attend all of them) [10:21:37] We don't have any public holidays here until October 😢 [10:40:14] jynus: I can join that [10:40:21] It is at 18:00 CEST right? [10:40:59] you don't have to go if it is outside of your working hours, I am sure it will be recorded [10:41:44] e.g. maybe between you and Amir1 at least 1 can attend live [10:41:55] jynus: I will attend live :) [10:42:22] and note you are not the only one to mention the schedule is not great [10:42:30] I will attend as well [10:46:16] (PrometheusMysqldExporterFailed) resolved: Prometheus-mysqld-exporter failed (db2156:9104) - https://grafana.wikimedia.org/d/000000278/mysql-aggregated - https://alerts.wikimedia.org/?q=alertname%3DPrometheusMysqldExporterFailed [10:46:25] marostegui: backporting the rc optimizer hand-holding change. Stay tuned [10:46:31] XD [12:43:41] so I though I was doing a trivial alter on my misc service [12:44:02] but I just realized that the table has 274 million rows [12:44:19] maybe time to cleanup old backup records? [12:45:17] I remember we had a task to implement a clean up script [12:45:42] let me try to find it [12:46:28] T205627 I think [12:46:29] T205627: Purge and monitor old metadata for the mariadb backups database - https://phabricator.wikimedia.org/T205627 [12:46:37] yeah that one [12:48:30] I think I will look into it, if not deletion, at least move historical records to a separate table [12:50:08] I think I may create lag on m1 replicas because of this big alter in some minutes [12:50:20] that's fine, just downtime them [12:50:34] doing for 1 hour [12:58:13] Amir1: we are 1.38 now [12:58:24] also we are struggling with the actor migration [12:58:48] it could take months on miraheze and are setup is not designed for that [13:20:12] Query OK, 0 rows affected (40 min 49.843 sec) [13:20:28] I think it was the fact that it was compressed + online that took so long [13:26:03] Should be finished in 1 hour, I have downtime'd the alerts for a couple: https://grafana.wikimedia.org/goto/9j3thSenz?orgId=1 [15:00:35] I'm fixing some misc drifts live, you might see some large writes on s7 and s5 [15:31:01] marostegui: would you have 5 minutes of time for a feedback question tomorrow morning? [15:32:27] sure thing [15:33:50] I would want Amir's opinion, too, as it is about presenting backup statistics, although probably not in (my) morning :-) [15:44:05] jynus: let me know when you want it :) [15:44:15] marostegui: I brought gifts for you T312028 [15:44:16] T312028: Random drifts on s3 part 2 - https://phabricator.wikimedia.org/T312028 [15:45:54] esus [15:45:55] Jesus [15:46:00] I will get them done before the switchover [15:46:17] Or at least...try to :p [15:46:40] :D [15:50:19] Amir1: I assumed the first part field-unsigned-mismatch means that `bp_user` int(11) NOT NULL, needs to be unsigned right? [15:50:38] yeah [15:50:41] oki [15:51:18] I think it was probably even requested but bot_passwords doesn't exist in every wiki (only non-SUL wikis) [16:01:07] Amir1: where can I find the canonical definitions for the tables? [16:01:14] So I don't have to bother you about the changes :p [16:01:24] marostegui: https://github.com/wikimedia/mediawiki/blob/master/maintenance/tables-generated.sql [16:01:27] I remember tables.sql was deprecated right [16:01:28] There yo go [16:01:30] Thanks <3 [16:03:26] Thanks for fixing them!