[04:29:25] (03PS3) 10Santhosh: WIP - Community-defined campaign translations [research/recommendation-api] - 10https://gerrit.wikimedia.org/r/1059945 (https://phabricator.wikimedia.org/T371515) (owner: 10Eamedina) [09:12:14] (03CR) 10Klausman: [C:03+1] langid: match python module usage with other isvcs [machinelearning/liftwing/inference-services] - 10https://gerrit.wikimedia.org/r/1060471 (https://phabricator.wikimedia.org/T369344) (owner: 10Kevin Bazira) [09:34:47] 06Machine-Learning-Team, 06DC-Ops, 10ops-codfw: hw troubleshooting: Memory issues (ECC) with ml-serve2004.codfw.wmnet - https://phabricator.wikimedia.org/T372036 (10klausman) 03NEW [10:07:10] (03CR) 10Kevin Bazira: [C:03+2] "Thanks for the review :)" [machinelearning/liftwing/inference-services] - 10https://gerrit.wikimedia.org/r/1060471 (https://phabricator.wikimedia.org/T369344) (owner: 10Kevin Bazira) [10:14:17] (03Merged) 10jenkins-bot: langid: match python module usage with other isvcs [machinelearning/liftwing/inference-services] - 10https://gerrit.wikimedia.org/r/1060471 (https://phabricator.wikimedia.org/T369344) (owner: 10Kevin Bazira) [11:08:33] * klausman lunch [13:29:21] 06Machine-Learning-Team, 10MW-1.43-notes (1.43.0-wmf.17; 2024-08-06), 07OKR-Work: Deploy Modernized Recommendation API to LiftWing - https://phabricator.wikimedia.org/T371465#10051178 (10kevinbazira) @santhosh, thank you for adding the cxserver host header config. We deployed the new image and tested the sec... [13:35:12] 06Machine-Learning-Team, 10MW-1.43-notes (1.43.0-wmf.17; 2024-08-06), 07OKR-Work: Deploy Modernized Recommendation API to LiftWing - https://phabricator.wikimedia.org/T371465#10051222 (10klausman) One thing that occurred to me: in filesystem-land, many languages/stdlibs have a path joining function (in Pytho... [13:48:57] 06Machine-Learning-Team, 10Automoderator, 06Moderator-Tools-Team: Use multilingual revert risk model in Automoderator on supported wikis - https://phabricator.wikimedia.org/T365581#10051273 (10Samwalton9-WMF) @diego I'd like to continue talking about the possibility of using RRML for Automoderator. When we d... [14:04:26] Good morning all [14:08:50] o/ hi from Katowice :D [14:25:15] o/ [14:25:37] klausman: thanks for the reviews :) [14:25:37] the langid pod is up and running in staging: https://phabricator.wikimedia.org/P67245#269285 [14:25:37] going to deploy to prod [14:25:58] :+1: [15:00:38] 06Machine-Learning-Team, 10Automoderator, 06Moderator-Tools-Team: Use multilingual revert risk model in Automoderator on supported wikis - https://phabricator.wikimedia.org/T365581#10051580 (10diego) Hi @Samwalton9-WMF , we choose RRLA because it was more stable, but since then, we made some updates to RRML... [15:04:26] 06Machine-Learning-Team: Upgrade Knative control plane Docker images to Bullseye/Bookworm - https://phabricator.wikimedia.org/T368359#10051603 (10elukey) 05Open→03Resolved [15:06:33] 06Machine-Learning-Team, 10Automoderator, 06Moderator-Tools-Team: Use multilingual revert risk model in Automoderator on supported wikis - https://phabricator.wikimedia.org/T365581#10051610 (10diego) @Samwalton9-WMF , just keep in mind that the scores from RRML and RRLA are different. This means that you may... [15:18:14] 06Machine-Learning-Team, 06DC-Ops, 10ops-codfw, 06SRE: hw troubleshooting: Memory issues (ECC) with ml-serve2004.codfw.wmnet - https://phabricator.wikimedia.org/T372036#10051654 (10Jhancock.wm) a:05Papaul→03Jhancock.wm @klausman I'm onsite and can do a DIMM swap on this if you have time. [15:22:04] 06Machine-Learning-Team, 06DC-Ops, 10ops-codfw, 06SRE: hw troubleshooting: Memory issues (ECC) with ml-serve2004.codfw.wmnet - https://phabricator.wikimedia.org/T372036#10051680 (10ops-monitoring-bot) Icinga downtime and Alertmanager silence (ID=773e931e-1862-4d24-b0bd-52100c4ad9bb) set by klausman@cumin20... [15:22:53] 10Lift-Wing, 06Machine-Learning-Team: Request to host article-country model on Lift Wing - https://phabricator.wikimedia.org/T371897#10051683 (10Isaac) [15:23:59] (03PS1) 10Jsn.sherman: FetchScoreJob: use setLastError() for job errors [extensions/ORES] - 10https://gerrit.wikimedia.org/r/1060871 [15:28:38] (03CR) 10Jsn.sherman: "While working on something else, I noticed that ORES score fetch errors were not getting passed to the job runner log. This little patch j" [extensions/ORES] - 10https://gerrit.wikimedia.org/r/1060871 (owner: 10Jsn.sherman) [17:17:02] (03CR) 10Scardenasmolinar: [C:03+2] "I have tested this locally and it works as expected! Thanks for adding this." [extensions/ORES] - 10https://gerrit.wikimedia.org/r/1060871 (owner: 10Jsn.sherman) [17:34:24] (03Merged) 10jenkins-bot: FetchScoreJob: use setLastError() for job errors [extensions/ORES] - 10https://gerrit.wikimedia.org/r/1060871 (owner: 10Jsn.sherman) [18:18:13] 06Machine-Learning-Team, 10ORES, 06Discovery-Search, 06Growth-Team: Investigate what would be required to include countries in ORES and accessible via a search keyword - https://phabricator.wikimedia.org/T301671#10052411 (10Isaac) @EBernhardson I want to re-invigorate this task as I have started to make pr... [19:27:52] 06Machine-Learning-Team, 06DC-Ops, 10ops-codfw, 06SRE: Q1:rack/setup/install ml-serve20[09-11] - https://phabricator.wikimedia.org/T371920#10052625 (10Jhancock.wm) @klausman these servers are ready! [19:29:51] 06Machine-Learning-Team, 06DC-Ops, 10ops-codfw, 06SRE: Q1:rack/setup/install ml-serve20[09-11] - https://phabricator.wikimedia.org/T371920#10052622 (10Jhancock.wm) 05Open→03Resolved