[00:06:21] 06Data-Engineering-Radar, 06Traffic: 14Lock-in Varnish and VarnishKafka versions - 14https://phabricator.wikimedia.org/T304617#9691260 (10BCornwall) 05In progress→03Resolved 14Thanks, @ssingh for the patch! [02:30:48] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator - https://phabricator.wikimedia.org/T361853#9691366 (10Ottomata) [06:05:53] (HiveServerHeapUsage) firing: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [06:42:16] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Datasets Config][Spike] Understand and document the details and conflicts between Datasets Config, Refine refactor, Dynamic EventStreamConfig, and Metrics Platform Instrumentation Configurator - https://phabricator.wikimedia.org/T361853#9691672 (10gmodena) [06:42:17] 10Data-Engineering (Q4 2024 April 1st - June 30th), 10Event-Platform, 07Spike: [SPIKE] Can we express Event Platform configs in config store? - https://phabricator.wikimedia.org/T361017#9691673 (10gmodena) [07:05:53] (HiveServerHeapUsage) resolved: Hive Server JVM Heap usage is above 80% on an-coord1003:10100 - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/Hive/Alerts#Hive_Server_Heap_Usage - https://grafana.wikimedia.org/d/000000379/hive?panelId=7&fullscreen&orgId=1&var-instance=an-coord1003:10100 - https://alerts.wikimedia.org/?q=alertname%3DHiveServerHeapUsage [08:31:23] (03CR) 10Aqu: [C:03+2] Add refinery-source jars for v0.2.34 to artifacts [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1017074 (owner: 10Maven-release-user) [09:04:19] (03CR) 10Joal: "Aren't you missing the refined table schema?" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1012656 (https://phabricator.wikimedia.org/T314956) (owner: 10Gmodena) [09:06:47] (03CR) 10Joal: [C:03+1] "LGTM! Merge at will." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/983926 (https://phabricator.wikimedia.org/T314956) (owner: 10Ottomata) [09:27:05] 06Data-Engineering, 13Patch-For-Review: Extract refine schema management into a dedicated tool - https://phabricator.wikimedia.org/T356762#9692034 (10JAllemandou) I prefer the "by functionality" organization, for separating schema vs data code. I think we need the 2 different functions to make the Iceberg one... [09:28:59] 06Data-Engineering, 06Data-Platform-SRE, 10Scap, 07git-lfs, and 2 others: analytics/refinery: Stop using git-fat - https://phabricator.wikimedia.org/T328472#9692043 (10JAllemandou) Thank you so much @hashar for unblocking us! [10:10:00] 14Analytics, 06Data-Engineering-Icebox, 10CX-analytics, 10Language-analytics, and 2 others: Special:ContentTranslationStats is slow and getting crowded - https://phabricator.wikimedia.org/T325790#9692181 (10Pginer-WMF) [14:20:03] brouberol, btullis o/ [14:20:20] if you have a minute lemme know what you think about https://phabricator.wikimedia.org/T353705 [14:20:51] I left a proposed plan a while ago but the task is in watching :D [14:21:04] I can take care of it, very quick, but I'd need a green light [14:43:21] btullis is out, but I can have a look at it [14:43:31] thanks! [14:45:28] when you say "all the clusters except Wikikube have /64s allocated, so we should drop them and allocate new /116 instead": what happens to the pods while we do so? [14:46:03] my expectation would be: nothing, and then we get IPv6 within the new /116 pool when we redeploy, but that might not be the case [15:03:58] ah yes sorry I forgot to mention - ipv6 is only enabled in wikikube [15:04:20] so this is just to be able to enable ipv6 via hiera for DSE if we want in the futue [15:04:23] *future [15:04:34] if we do it with the current settings the k8s control plan will refuse [15:13:07] brouberol: --^ [15:23:00] {{done}}, it is easily revertable in case [15:45:31] (03PS2) 10Xcollazo: WIP: Clean up and parameterize SQL code for Common Impact Metrics. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1016796 (https://phabricator.wikimedia.org/T358681) [15:52:42] (03PS3) 10Aleksandar Mastilovic: Adding report updater CX queries [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1002588 (https://phabricator.wikimedia.org/T356424) [15:54:00] ooh, gotcha [15:54:09] (sorry, I was interrupted by baby-oncall) [15:55:42] I just approved https://gerrit.wikimedia.org/r/c/operations/puppet/+/1017311 [15:56:08] ahahha yes don't worry, those interrupts take priority :) [15:56:10] thanks for the review! [15:57:07] for sure [16:02:21] (03Abandoned) 10Aleksandar Mastilovic: Adding report updater CX queries [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1002588 (https://phabricator.wikimedia.org/T356424) (owner: 10Aleksandar Mastilovic) [16:02:56] (03PS2) 10Aleksandar Mastilovic: WMCS HQL scripts [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1017161 [16:21:39] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Dataset Config Store] Setup initial CI checks - https://phabricator.wikimedia.org/T357468#9693274 (10tchin) Config store repo does CI checks for jsonschema correctness and config values against its jsonschema. The Datasets Config service repo has dockerized CI... [16:46:06] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Dataset Config Store] Setup initial CI checks - https://phabricator.wikimedia.org/T357468#9693328 (10lbowmaker) I think if we organized files by dataset we discussed at one point requiring things like config for deleting data after a certain time (for example)... [16:47:16] (03PS3) 10Milimetric: WIP: Clean up and parameterize SQL code for Common Impact Metrics. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1016796 (https://phabricator.wikimedia.org/T358681) (owner: 10Xcollazo) [17:34:36] 06Data-Engineering, 06Data-Platform-SRE, 10Scap, 07git-lfs, and 2 others: analytics/refinery: Stop using git-fat - https://phabricator.wikimedia.org/T328472#9693446 (10hashar) Do celebrate @dancy who has done the first pass of the migration and reviewed+deployed the patches I have sent :-] Hopefully that... [17:38:09] 10Data-Engineering (Q4 2024 April 1st - June 30th): Replace service runner with a simplified library to better support metrics and debugging - https://phabricator.wikimedia.org/T360924#9693464 (10Ottomata) Curious! What's the status on collaboration with rest of org on NodeJS services and library support? IIUC... [17:39:04] 06Data-Engineering, 06Web-Team-Backlog: Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962 (10KSarabia-WMF) 03NEW [17:45:39] 06Data-Engineering, 13Patch-For-Review: Extract refine schema management into a dedicated tool - https://phabricator.wikimedia.org/T356762#9693483 (10Ottomata) > I prefer the "by functionality" organization Yap cool with me. Let the namingbikeshed begin. > I think we need the 2 different functions to make th... [17:46:25] 06Data-Engineering, 06Data Products, 10Observability-Logging, 06Traffic, 13Patch-For-Review: Move analytics log from Varnish to HAProxy - https://phabricator.wikimedia.org/T351117#9693484 (10Ottomata) Very cool! [20:05:54] (03PS4) 10Mforns: Productionize CommonsCategoryGraphBuilder for CIM project [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1015013 (https://phabricator.wikimedia.org/T358681) [20:07:11] (03CR) 10Mforns: "This code works and is the most optimized so far. It is still missing lots of comments to explain the details hidden behind scala's compac" [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1015013 (https://phabricator.wikimedia.org/T358681) (owner: 10Mforns) [21:07:09] 06Data-Engineering, 06Web-Team-Backlog, 13Patch-For-Review: Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962#9694154 (10KSarabia-WMF) @phuedx - @jwang and I were discussing whether it is possible to accurately capture the correct sample rate instead of returning N... [22:12:21] 06Data-Engineering, 06Movement-Insights, 06Product-Analytics, 06Research-Freezer: Investigate relation of UA deprecation to increase in automated traffic and reduction in unique devices - https://phabricator.wikimedia.org/T336715#9694304 (10Mayakp.wiki) > -- [x] Dig into our data on user traffic and how...