[01:20:29] <gry>	 could someone with a working email please send a message to whatever mailing list it is, that dupdet is crashing and anyone with knowledge of php is welcome to query Gryllida on-wiki to request access to the tool for debugging. source of tool  https://github.com/jamesryanalexander/Duplication-Detector May be issue with version of php or webserver. Thanks
[12:30:40] <taavi>	 !log tools.wmde-graphql-demo disabled tool per T305687
[12:30:43] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.wmde-graphql-demo/SAL
[12:30:45] <stashbot>	 T305687: Archive/delete tool wmde-graphql-demo - https://phabricator.wikimedia.org/T305687
[12:31:54] <taavi>	 !log tools.fc-importer disabled tool per T305404
[12:31:55] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.fc-importer/SAL
[12:31:56] <stashbot>	 T305404: Archive/delete tool fc-importer - https://phabricator.wikimedia.org/T305404
[12:32:38] <taavi>	 !log tools.welcomer disabled tool per T305388
[12:32:40] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.welcomer/SAL
[12:32:40] <stashbot>	 T305388: Delete tool welcomer - https://phabricator.wikimedia.org/T305388
[12:33:40] <taavi>	 !log tools.wlm-de-redirect disabled tool per T305377
[12:33:41] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.wlm-de-redirect/SAL
[12:33:42] <stashbot>	 T305377: Archive/delete tool wlm-de-redirect - https://phabricator.wikimedia.org/T305377
[12:34:28] <taavi>	 !log tools.wikidiff2-dev-test disabled tool per T305376
[12:34:30] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.wikidiff2-dev-test/SAL
[12:34:30] <stashbot>	 T305376: Archive/delete tool wikidiff2-dev-test - https://phabricator.wikimedia.org/T305376
[12:35:59] <taavi>	 !log tools.catgraph disabled tool per T305374
[12:36:02] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.catgraph/SAL
[12:36:02] <stashbot>	 T305374: Archive/delete tools catgraph, catgraph-jsonp & cgstat - https://phabricator.wikimedia.org/T305374
[12:36:22] <taavi>	 !log tools.catgraph-jsonp disabled tool per T305374
[12:36:24] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.catgraph-jsonp/SAL
[12:36:50] <taavi>	 !log tools.cgstat disabled tool per T305374
[12:36:51] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.cgstat/SAL
[12:38:05] <taavi>	 !log tools.james disabled tool per T305289
[12:38:07] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.james/SAL
[12:38:07] <stashbot>	 T305289: Archive/delete tool “james” - https://phabricator.wikimedia.org/T305289
[12:39:00] <taavi>	 !log tools.hoo-propertysuggester-test disabled tool per T303597
[12:39:02] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.hoo-propertysuggester-test/SAL
[12:39:02] <stashbot>	 T303597: Delete tool hoo-propertysuggester-test  - https://phabricator.wikimedia.org/T303597
[14:15:02] <wm-bot>	 !log tools.shex-simple <lucaswerkmeister-wmde> toolforge-jobs run update --command '~/update.sh' --image tf-php74 --schedule '0 * * * *' # T305944; php74 image chosen because tf-bullseye-std doesn’t have git
[14:15:11] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.shex-simple/SAL
[14:15:30] <wm-bot>	 !log tools.shex-simple <lucaswerkmeister-wmde> crontab -r # T305944
[14:15:32] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.shex-simple/SAL
[15:28:18] <bd808>	 !log toolhub Updated demo server to 5c2ef1
[15:28:20] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Toolhub/SAL
[16:23:02] <wm-bot>	 !log tools.shex-simple <lucaswerkmeister-wmde> updated public_html worktrees and update.sh to change master to main (public_html/master is now a symlink to main)
[16:23:04] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.shex-simple/SAL
[16:34:23] <mutante>	 !log gitlab-runners pausing runner-1013, then will remove it and create new bullseye runner to replace it
[16:34:25] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Gitlab-runners/SAL
[17:02:51] <mutante>	 !log gitlab-runners pausing runner-1014, then will remove it and create new bullseye runner runner 1025 to replace it
[17:02:53] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Gitlab-runners/SAL
[17:35:57] <mutante>	 hmmm.. some issue with cinder volumes that I'm not sure how to debug
[17:36:40] <mutante>	 I have been creating new instances in gitlab-runners and 4 out of 5 it just worked and automatically got a /var/lib/docker mount in /etc/fstab
[17:36:53] <mutante>	 also I can see the volumes in Horizon
[17:37:23] <mutante>	 there are 10 volumes and 10 instances, though they are not mapped to instances but to the project as such
[17:37:42] <mutante>	 now with my latest new instance I get " no volumes are available to mount"
[17:38:04] <mutante>	 I deleted an old instance, then created a new instance to replace it, as before
[17:38:36] <mutante>	 maybe the only difference was I did it a bit faster.. so maybe there is some race about releasing the volume
[17:40:19] <mutante>	 maybe because I did the 'switch puppetmasters' process faster.. but there is that mechanism that prevents me from running puppet manually before bootstrap is done..so..
[17:43:31] <mutante>	 rebooting the instance fixed it! :)
[17:43:55] <mutante>	 didn't have to do that before but .. just randomly tried it. good 
[17:44:32] <taavi>	 have you tried turning it off and on again? :-P
[17:44:38] <mutante>	 lol, precisely
[17:45:42] <mutante>	 or of course it's possible it was just "the 5 minutes are over" and reboot was pure coincidence.. but I'm ok with not knowing 
[17:47:06] <mutante>	 but no.. I created it 20 min ago and I did that yesterday also within 20min, anyways :)
[18:07:04] <andrewbogott>	 'no volumes are available to mount' is a message from Horizon?
[18:07:36] <andrewbogott>	 mutante:  ^
[18:08:21] <mutante>	 andrewbogott: no, it's a message you see in puppet output when it tries to use "cinderutils"
[18:08:24] <mutante>	 file: /etc/puppet/modules/cinderutils/manifests/ensure.pp
[18:08:45] <mutante>	 Error: Could not retrieve catalog from remote server: Error 500 on SERVER ..
[18:08:52] <mutante>	 To proceed, create and attach a Cinder volume ..
[18:09:29] <mutante>	 that then says it's because "No mount at /var/lib/docker and no volumes are available to mount."
[18:09:38] <mutante>	 after reboot or in the other cases.. it "just works"
[18:09:47] <andrewbogott>	 ok.
[18:10:03] <andrewbogott>	 I don't know, that sounds like it wasn't seeing the volume in lsblk, so that's what I would check (if we had a misbehaving VM now which we don't)
[18:10:07] <mutante>	 an instance in that state does not have "docker" in /etc/fstab
[18:10:11] <mutante>	 while the working ones do
[18:10:36] <mutante>	 I have a few more to do over the coming days
[18:10:47] <mutante>	 so let's see if it happens again
[18:10:54] <mutante>	 one out of 10 does not count yet
[18:12:13] <mutante>	 I feel like it is about timing between deleting an old instance and creating a new instance and if you don't rush it and give it time then it's not an issue.
[18:12:25] <mutante>	 but will try to confirm that 
[18:13:13] <mutante>	 ACK @ lsblk, thanks
[18:17:06] <andrewbogott>	 mutante: that puppet manifest is using wmcs-prepare-cinder-volume as far as I know, so all the interesting stuff is probably happening in there.
[18:17:46] <mutante>	 andrewbogott: *nod*, thank you
[18:23:07] <razzi>	 Hi all, I started the reimaging of clouddb hosts for T299480, and clouddb1013 completed successfully; however there's an error about `/usr/bin/wmf-pt-kill` not found, and sure enough, it is missing on the newly reimaged clouddb1013, and I see `/usr/bin/wmf-pt-kill` on clouddb1017 which I haven't reimaged yet. I guess this script is missing from puppet? Anybody have context? No rush, the whole host is downtimed currently
[18:23:08] <stashbot>	 T299480: Upgrade clouddb* hosts to Bullseye - https://phabricator.wikimedia.org/T299480
[18:34:34] <mutante>	 razzi: is it still gone after running puppet a second or third time?
[18:34:43] <mutante>	 or maybe just on the first run
[18:35:31] <mutante>	 that is fairly common that a puppet role works after the 2nd run and then it's fine
[18:35:47] <mutante>	 except when using cookbook and the role is already applied ..then that would fail
[18:36:52] <andrewbogott>	 razzi: It looks to me like that should get installed by apt.  Try apt-get update and then re-run puppet?
[18:36:52] <mutante>	 oh wait, ignore that, I found something different, there is code that absents the wmf-pt-kill stuff
[18:37:17] <andrewbogott>	 oh, uhoh
[18:38:13] <mutante>	 "if $instances" then it removes it
[18:38:58] <mutante>	 it does a lookup('profile::wmcs::db::wikireplicas::mariadb_multiinstance::instances'
[18:39:08] <mutante>	 to then decide based on that
[18:39:19] <mutante>	 whether it should stop and mask wmf-pt-kill or not
[18:40:27] <mutante>	 hosts/clouddb1013.yaml:profile::wmcs::db::wikireplicas::mariadb_multiinstance::instances:
[18:40:38] <mutante>	 clouddb1013 has it's own hosts file 
[18:40:42] <mutante>	 that sets this value
[18:42:04] <mutante>	 but the other hosts do as well..
[18:43:21] <mutante>	 razzi: sorry for distracting with that.. do what andrewbogott said , he is right. the reason is:
[18:43:27] <mutante>	 [apt1001:~] $ sudo -E reprepro ls wmf-pt-kill
[18:43:27] <mutante>	 wmf-pt-kill | 2.2.20-1+wmf5 | stretch-wikimedia | amd64, i386, source
[18:43:27] <mutante>	 wmf-pt-kill |  3.1.0-1+wmf6 |  buster-wikimedia | amd64, i386
[18:43:36] <mutante>	 if that is for bullseye..it's just not in APT
[18:44:58] <andrewbogott>	 oh, the package isn't there at all?  Maybe we can just copy it over.
[18:45:04] <andrewbogott>	 if it isn't python2 :/
[18:45:11] * taavi guesses Perl
[18:45:21] <mutante>	 "but maybe you can copy it over" was about to type that..but depends
[18:47:28] <mutante>	 https://gerrit.wikimedia.org/r/q/project:operations/debs/wmf-pt-kill
[18:48:06] <mutante>	  lib/Percona/Toolkit.pm
[18:48:59] <mutante>	 maybe ask the owner of https://gerrit.wikimedia.org/r/c/operations/debs/wmf-pt-kill/+/585495 about it
[18:56:48] <andrewbogott>	 sorry, I'm in too many conversations at once. Trying to circle back to this one...
[18:59:04] <andrewbogott>	 razzi: T305974
[18:59:05] <stashbot>	 T305974: Provide wmf-pt-kill on Debian Bullseye - https://phabricator.wikimedia.org/T305974
[18:59:37] <andrewbogott>	 OK if we block on that until Manuel expresses an opinion? If you're super blocked I can try a package copy right now but no idea if it'll work (and I'd have to do some research even to know how to test)
[19:00:37] <andrewbogott>	 also razzi your downtime on 1013 just expired
[19:57:10] <andrewbogott>	 razzi, mutante, I copied that package from the buster repo to the bullseye one and everything seems fine *shrug*
[20:00:21] <mutante>	 andrewbogott: cool, confirmed it's on apt1001.  ahh..and just read the ticket, even DBA already chimed in, how nice and quick
[20:20:46] <RhinosF1>	 Can I bribe a merge if https://gerrit.wikimedia.org/r/c/labs/tools/wikibugs2/+/779112
[21:19:48] <bd808>	 !log tools.grid-deprecation Added komla as co-maintainer
[21:19:51] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.grid-deprecation/SAL
[21:24:01] <bd808>	 !log tools Add komla as projectadmin (T305986)
[21:24:03] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL
[21:24:03] <stashbot>	 T305986: Grant Komla Sapaty tools admin rights - https://phabricator.wikimedia.org/T305986
[21:27:07] <bd808>	 !log tools Added komla to 'roots' sudoers policy (T305986)
[21:27:10] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL
[21:31:16] <bd808>	 !log tools.admin Added komla as co-maintainer (T305986)
[21:31:18] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.admin/SAL
[21:31:19] <stashbot>	 T305986: Grant Komla Sapaty tools admin rights - https://phabricator.wikimedia.org/T305986
[21:32:44] <bd808>	 !log tools Added komla to Gerrit group 'toollabs-trusted' (T305986)
[21:32:47] <stashbot>	 Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools/SAL
[22:44:45] <EpicPupper>	 Hi there! I was looking to do some AI stuff on Toolforge/Cloud VPS. Are the servers CUDA-enabled (a.k.a. have GPUs)? I'm 99% leaning towards no but just wanted to make sure before I ask for a grant for APIs :)  Thanks!
[22:47:47] <Reedy>	 Nope
[22:52:21] <GenNotability>	 darn, there go my plans to run a bitcoin mining op
[22:54:01] <AntiComposite>	 hasn't stopped people from trying
[22:58:34] <bd808>	 I think we only has 8 GPUs anywhere and certainly none in Cloud VPS at this point. It turns out that finding a GPU with open source firmware is hard.