[00:05:37] !log tools.bash Updated to ff8328d [00:05:38] Logged the message at https://wikitech.wikimedia.org/wiki/Nova_Resource:Tools.bash/SAL [01:25:24] legoktm: I see some task/patches around codesearch mention disk space concerns. Anything I can do to help with that? Do we need to reprovision to a larger? I guess nowadays we use detacheable volume, so maybe it can be resized or we can move from one mount to another and temporarily attach 2 or something? [01:26:04] did it run out of space again? [01:26:09] no, not yet :) [01:26:37] but it seems there is hesitancy to add "a lot" more repos, which I've personally ignored several times already [01:26:43] but not sure how many times to do that. [01:26:48] https://grafana-labs.wikimedia.org/d/000000059/cloud-vps-project-board?orgId=1&var-project=codesearch&from=now-30d&to=now&var-server=codesearch8 [01:27:01] 18GB left, using ~36GB [01:28:48] the main issue is that git repos get larger over time (duh) and said growth is multiplied by the number of search profiles the repo is indexed in. I personally wouldn't be hesitant in adding new repos as long as someone or some monitoring is keeping an eye on disk space [01:29:03] I think the simplest and best option is to throw more disk space at it [01:30:02] probably some upstream issues could be filed with hound like running `git gc` or redoing the initial shallow clone ocassionally [01:32:33] (AFAIK WMCS is not short on disk space, and given the value of codesearch vs maintainer time, it seems reasonable to ask for more space) [01:33:58] it's only short of disk space in the right places [01:39:42] /me seconds the immense value of codesearch (as if there were any doubts about that ^^) [03:04:21] What's the difference between `login.toolforge.org` and `dev.toolforge.org`? [03:04:55] It seems to be the same SSH host. [03:12:45] shouldn't be [03:14:16] login.toolforge.org goes to tools-sgebastion-10, dev.toolforge.org is -11 [03:15:13] there's nothing significantly different between the two hosts, except that if you slow dev. down fewer people will complain immediately [03:17:13] e [03:17:14] thanks [13:55:43] I am deploying a continuous deployment. I tried to run a script directly on the tool account through SSH, but found that I could not enter the tool directory. What's going on? [13:55:43] ``` [13:55:45] ssh login.toolforge.org become new-userbox-log bash -c "cd src && pwd" [13:55:46] /mnt/nfs/labstore-secondary-tools-home/xtex``` [13:57:16] add another set of quotes (i.e. `bash -c '"cd src && pwd"'`) [13:57:19] because ssh is awesome [14:01:11] thanks [14:01:56] also use /data/project/ instead of the /mnt/nfs/... patch, the latter one is an implementation detail [14:02:47] okay, thanks [17:32:25] I'm debugging a weird situation on labswiki around the "Redirect fixer" user [17:32:27] $ mwscript sql.php --wiki labswiki [17:32:27] > SELECT * FROM user WHERE user_name='Redirect fixer'; [17:32:32] ( [17:32:32] [user_id] => 31136 [17:32:33] [user_name] => Redirect fixer [17:32:42] So this exists, clearly, and even has non-zero edits [17:32:56] $user = User::newFromName( 'Redirect fixer' ); [17:33:00] > echo $user->getName(); [17:33:00] Redirect fixer [17:33:04] > echo $user->isItemLoaded('id'); [17:33:04] 1 [17:33:08] > echo $user->getId(); [17:33:08] 0 [17:33:11] isRegistered(); false [17:33:22] hence addToDatabase() fails with a key conflict [17:35:29] it seems when load() is lazily triggered, it ends up convinced the user doesn't exist and so is fully loaded / like an anon. [17:35:37] > SELECT * FROM actor WHERE actor_name='Redirect fixer'; [17:35:38] [actor_id] => 14640 [17:35:38] [actor_user] => [17:35:38] [actor_name] => Redirect fixer [17:36:23] Dont' we have User::newSystemUser( 'usernaname', [ 'steal' => true ] ) [17:36:25] for stuff like this [17:36:40] https://wikitech.wikimedia.org/wiki/Special:Contributions/Redirect_fixer [17:36:46] edits are not imported either [17:37:02] it seems like they were made as recnet as March 2022 in a way that acted as a named anon user somehow. [17:39:05] Reedy: looks like steal isn't effective either [17:39:08] gets fooled by the same bug [17:43:44] only 2 results for this query: [17:43:57] > SELECT * FROM actor WHERE actor_user IS NULL AND actor_name RLIKE '^[A-Z]' AND actor_name NOT LIKE '%>%' LIMIT 10; [17:43:57] [actor_id] => 14640 [actor_user] => [actor_name] => Redirect fixer [17:43:57] [actor_id] => 37817 [actor_user] => [actor_name] => Unknown user [17:48:44] Depending on whether the "bug" is the logic in User::load or the fact that such rows can be created in the first place - if we believe the former, then it was introduced in https://gerrit.wikimedia.org/r/c/mediawiki/core/+/525605 where the case for load by 'name' was changed from using the `user` table to using the `actor` table as definitive and only source. [18:09:57] write up at T326431 [18:09:58] T326431: Some system users have invalid 'actor' database rows - https://phabricator.wikimedia.org/T326431 [18:10:43] I'm going to fix labswiki manually for now