[00:03:00] 10Analytics-Clusters, 10Patch-For-Review: Move Superset and Turnilo to an-tool1010 - https://phabricator.wikimedia.org/T268219 (10razzi) Ok, this is done, with one last bit of cleanup: I'd like to rename role::analytics_cluster::ui::dashboards to role::analytics_cluster::ui::superset since it's only hosting su... [06:54:56] hello folks [06:55:20] I was reading the linkedin blog post for hadoop, and I was wondering if we have a breakdown of RPC calls for the namenode [06:55:32] READ vs WRITE etc.. [07:02:14] Hi elukey [07:02:54] I don't think we currently have that info - but you're the expert :) [07:09:16] bonjour joal, I see that there is a rpc detailed that should be exported by jmx, but I am trying to use jconsole on an-master1001 and it seems not working anymore [07:09:19] ufffffffff [07:09:35] :( [07:09:39] it would be great to have a breakdown [07:09:52] the Observer Namenode could be a nice change for the cluster [07:11:52] elukey: Possible, depending on how many reads vs rights (I get your need for the split now :) [07:13:54] :D [07:14:27] elukey: I got stuck in some hadoop logging issues - /me has learnt quite some in that regard in the past days [07:17:58] anything interesting for the cluster? [07:18:26] not really - mostly intricate hadoop-application related logging [07:19:42] and, I'll have confirmation in about 1h, but I think I got the full gobblin feature working :) [07:20:38] ack :) [07:20:48] I am seeing A LOT of interesting metrics for RPC [07:21:28] \o/ [07:21:32] for example, for port 8040 GetBlockReportsOps [07:21:40] ehehe [07:21:43] sorry, BlocReportsOps [07:21:49] and avg time [07:22:17] GetServiceStatus is probably what zkfc uses [07:23:27] but yes there is a nice breakdown as well [07:23:49] I think that it would be valuable before the maintenance [07:55:19] 10Analytics, 10Analytics-Kanban, 10Analytics-Wikistats: Top edited pages list on enwiktionary contains nonexistent pages with titles made up of question marks - https://phabricator.wikimedia.org/T284623 (10JAllemandou) I ran a query and found that the problem is due to some page-titles not being correctly de... [10:56:40] 10Analytics-Radar, 10Product-Analytics, 10Product-Data-Infrastructure, 10Language-Team (Language-2021-April-June): All events in the contenttranslationabusefilter data stream failing validation - https://phabricator.wikimedia.org/T283872 (10ngkountas) a:03ngkountas [12:26:29] Ok - working version of gobblin is here: https://github.com/jobar/gobblin/tree/wmf [12:26:54] this version needs the new release of WikimediaEventUtilities (0.7.0) [12:27:38] It has been tested successfully on webrequest and eventlogging - currently doing eventstream (for use of EventStreamConfig as topic list source) [12:35:29] wow [12:40:39] this took longer than expected elukey, but here we are :) [13:09:01] great work :) [13:09:35] elukey: there are a couple of changes I made for our version that I'll submit upstream - I think they'll be valuable [13:10:11] hm - not working with event-stream - trying to fix [13:20:08] joal: <3 [13:20:30] if I can add an extra, I'd be soooo happy if gobblin worked with TLS since the beginning :D [13:20:43] but it can be done later on of course [13:20:59] elukey: full TLS on kafka for all streams [13:21:13] woooooowwwwwwwwwwwwwwwwww [13:21:16] * elukey dances [13:21:36] elukey: let's triple check and validate first :) [13:23:14] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Product-Data-Infrastructure, 10Platform Team Initiatives (Modern Event Platform (TEC2)): Allow disabling/enabling configured streams via wgEventStreams config - https://phabricator.wikimedia.org/T259712 (10Ottomata) T284620 is about removing a stream... [13:23:42] nice! [13:32:01] elukey: o/ qq [13:32:18] why do we have choices in kafka cookbook cli opts? [13:32:32] of kafka cluster names / cumin aliases [13:32:33] just for extra safety? [13:32:55] it just means we have to manually keep in sync multiple definitions of list of kafka clusters [13:34:20] ottomata: o/ I am not getting the question, the cumin aliases are needed to target a specific cluster [13:34:41] cumin aliases yes, but does the cookbook need to restrict possible values? [13:35:06] i could, but since the cookbook hasn't been updated with that option [13:35:17] i have to make a patch to make the cookbook's list of possible cluster choices up to date [13:36:02] yes yes now I get it, it is true but I am not sure if it is possible to get the list somewhere else [13:36:09] do we need a list, i mean? [13:36:19] and if you mistype the name of the cluster the cookbook fails etc.. [13:36:45] sure, but so would a cumin command that targeted the wrong hosts [13:37:03] and it think it asks you before proceedding, right? [13:37:08] or mirror cookbook doesn't, we could add that [13:37:11] if you have a proposal in mind speak up, I am not getting your point :D [13:37:12] even better... [13:37:23] would be to render the choices from puppet [13:37:37] I"m just trying to DRY up the configs so they dont' get out of sync [13:37:49] if we remove the choices list restriction, there's no reason need to DRY [13:37:59] sure but it seem a little overkill, we add/remove clusters rarely [13:38:00] :) [13:38:03] yaeh that's tru [13:38:23] ok, i'll leave them, just was in there and thought i'd ask [13:38:27] will add testt [13:38:32] ack :) [13:41:11] we probably run mirromaker in k8s :) [13:41:44] it is a perfect candidate yes [13:50:24] (03CR) 10Mforns: [C: 03+1] Add editoractivation schema to analytics/legacy [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/699051 (https://phabricator.wikimedia.org/T284679) (owner: 10Ottomata) [13:52:14] ottomata: one possible way to dry up could be to move the function that returns the list of clusters to __init__.py [13:52:27] we do something similar for hadoop IIRC [13:53:01] oo that would be nice, cool idea. will do that after https://phabricator.wikimedia.org/T279342 is done [15:04:51] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Patch-For-Review: Replace Content::getNativeData() calls with TextContent::getText() in EventLogging - https://phabricator.wikimedia.org/T283671 (10Ottomata) I opened a can of worms: https://phabricator.wikimedia.org/T284345#7151622 [15:37:15] 10Analytics, 10Event-Platform, 10Services: EventGate should use recent service-runner (^2.8.1) with Prometheus support - https://phabricator.wikimedia.org/T272714 (10Ottomata) cc Michael: I know you were looking for some event platform tasks..this one would be really helpful! [15:50:03] Hi ottomata - Would you have a minute for me? [15:50:10] sure joal [15:50:21] i'm already in bc for fun time :) [15:50:29] ottomata: I have a failing gobblin on getting EventStreamConfig [15:50:37] joining [17:51:24] 10Analytics-Radar, 10Readers-Web-Backlog, 10Performance-Team (Radar), 10Vue.js (Vue.js Search Experience (Vector modern)): Revise schema and performance dashboards for Vue.js search - https://phabricator.wikimedia.org/T250336 (10Jdlrobson) a:03nray @nray this is done if I'm not mistaken with the dashboar... [18:07:08] joal: https://gerrit.wikimedia.org/r/c/wikimedia-event-utilities/+/699450 [18:15:45] 10Analytics, 10Event-Platform, 10Performance-Team, 10Patch-For-Review: EditorActivation Event Platform Migration - https://phabricator.wikimedia.org/T284679 (10Ottomata) 05Open→03Declined [18:15:51] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 5 others: Migrate legacy metawiki schemas to Event Platform - https://phabricator.wikimedia.org/T259163 (10Ottomata) [18:16:48] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 5 others: Migrate legacy metawiki schemas to Event Platform - https://phabricator.wikimedia.org/T259163 (10Ottomata) [18:16:52] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 4 others: Determine which remaining legacy EventLogging schemas need to be migrated or decommissioned - https://phabricator.wikimedia.org/T282131 (10Ottomata) [18:17:07] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 4 others: Determine which remaining legacy EventLogging schemas need to be migrated or decommissioned - https://phabricator.wikimedia.org/T282131 (10Ottomata) [18:17:27] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 4 others: Determine which remaining legacy EventLogging schemas need to be migrated or decommissioned - https://phabricator.wikimedia.org/T282131 (10Ottomata) p:05Triage→03High [18:17:57] 10Analytics, 10Event-Platform, 10Patch-For-Review: WikipediaPortal Event Platform Migration - https://phabricator.wikimedia.org/T282012 (10Ottomata) Ping @eyener? [18:18:05] 10Analytics, 10Event-Platform, 10Patch-For-Review: WikipediaPortal Event Platform Migration - https://phabricator.wikimedia.org/T282012 (10Ottomata) p:05Triage→03Medium [18:18:40] 10Analytics, 10Event-Platform: mediawiki/page/properties-change schema should use map type for added and removed page properties - https://phabricator.wikimedia.org/T281483 (10Ottomata) p:05Triage→03Medium [18:18:58] 10Analytics, 10Event-Platform: Deploy schema repos to analytics cluster and use local uris for analytics jobs - https://phabricator.wikimedia.org/T280017 (10Ottomata) p:05Triage→03Low [18:19:31] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Metrics-Platform: wgEventStreams (EventStreamConfig) should support per wiki overrides - https://phabricator.wikimedia.org/T277193 (10Ottomata) p:05Triage→03Medium [18:19:44] 10Analytics, 10Event-Platform, 10Release-Engineering-Team (Radar): Stop using puppet + git pull for auto deployment of schema repos - https://phabricator.wikimedia.org/T274901 (10Ottomata) p:05Triage→03Low [18:23:55] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Gerrit-Privilege-Requests, and 2 others: Create or identify an appropriate Gerrit group for +2 rights on schemas/event/secondary - https://phabricator.wikimedia.org/T279089 (10Ottomata) FYI, we just changed +2 to ldap/wmf and ldap/wmde instead. https:... [18:26:09] 10Analytics, 10Analytics-Kanban, 10Event-Platform, 10Fundraising-Backlog, 10MW-1.37-notes (1.37.0-wmf.4; 2021-05-04): CentralNoticeBannerHistory and CentralNoticeImpression Event Platform Migration - https://phabricator.wikimedia.org/T271168 (10Ottomata) p:05Triage→03High @AndyRussG hi! Did ^ happen? [18:26:34] 10Analytics, 10Event-Platform: Schema compatibility check for changing event schemas fails when adding to the middle of an array - https://phabricator.wikimedia.org/T270470 (10Ottomata) p:05Triage→03Low [18:29:06] 10Analytics, 10Better Use Of Data, 10Product-Analytics, 10Product-Data-Infrastructure: Schema repository structure, naming - https://phabricator.wikimedia.org/T269936 (10Ottomata) p:05Triage→03Medium Hi, any follow up here? Should we close this ticket? [18:29:43] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Product-Analytics, 10Product-Data-Infrastructure: [MEP] Determine how stream configuration is authored and deployed - https://phabricator.wikimedia.org/T269774 (10Ottomata) p:05Triage→03Low [18:34:02] 10Analytics, 10Analytics-EventLogging, 10Better Use Of Data, 10Event-Platform, and 4 others: KaiOS / Inuka Event Platform client - https://phabricator.wikimedia.org/T273219 (10Ottomata) p:05Triage→03High [18:35:07] 10Analytics, 10Event-Platform: Figure out where stream/schema annotations belong (for sanitization and other use cases) - https://phabricator.wikimedia.org/T263672 (10Ottomata) 05Open→03Resolved p:05Triage→03Medium a:03Ottomata I'm going to close this task. We certainly have determined we don't thin... [18:35:13] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 5 others: Modern Event Platform: Schema Guidelines and Conventions - https://phabricator.wikimedia.org/T214093 (10Ottomata) [18:35:19] 10Analytics-EventLogging, 10Analytics-Kanban, 10Event-Platform, 10Goal, and 3 others: Modern Event Platform - https://phabricator.wikimedia.org/T185233 (10Ottomata) [18:36:45] 10Analytics, 10Better Use Of Data, 10Event-Platform, 10Product-Data-Infrastructure, 10Performance-Team (Radar): Research and consider network connections made due to Event Platform - https://phabricator.wikimedia.org/T263049 (10Ottomata) p:05Triage→03Low [18:37:00] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Better Use Of Data, and 5 others: Migrate legacy metawiki schemas to Event Platform - https://phabricator.wikimedia.org/T259163 (10Ottomata) p:05Triage→03High [18:38:55] 10Analytics, 10Analytics-EventLogging, 10QuickSurveys, 10WMDE-TechWish, 10Readers-Web-Backlog (Tracking): QuickSurveys should show an error when response is blocked - https://phabricator.wikimedia.org/T256463 (10Ottomata) p:05Triage→03Low [18:39:53] 10Analytics, 10User-Elukey: Deprecation (if possible) of the #central channel on irc.wikimedia.org - https://phabricator.wikimedia.org/T242712 (10Ottomata) p:05Triage→03Low [18:40:06] 10Analytics, 10Analytics-EventLogging, 10Analytics-Kanban, 10Patch-For-Review: Replace Content::getNativeData() calls with TextContent::getText() in EventLogging - https://phabricator.wikimedia.org/T283671 (10Ottomata) p:05Triage→03Medium [18:40:18] (03Abandoned) 10Ottomata: Add editoractivation schema to analytics/legacy [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/699051 (https://phabricator.wikimedia.org/T284679) (owner: 10Ottomata) [18:42:37] 10Analytics, 10Platform Engineering, 10Research: Create airflow instances for Platform Engineering and Research - https://phabricator.wikimedia.org/T284225 (10Ottomata) [18:43:11] 10Analytics, 10Analytics-Kanban, 10Platform Engineering, 10Research: Create airflow instances for Platform Engineering and Research - https://phabricator.wikimedia.org/T284225 (10Ottomata) p:05Triage→03High @razzi, I think we can go ahead and create an-airflow1002 for platform eng. Could you make that... [18:43:26] 10Analytics, 10Analytics-Kanban, 10Platform Engineering, 10Research, 10User-razzi: Create airflow instances for Platform Engineering and Research - https://phabricator.wikimedia.org/T284225 (10Ottomata) [18:44:11] 10Analytics, 10Analytics-Kanban: [SPIKE] analytics-airflow jobs development - https://phabricator.wikimedia.org/T284172 (10Ottomata) p:05Triage→03High [19:09:34] 10Analytics, 10Event-Platform, 10Patch-For-Review: WikipediaPortal Event Platform Migration - https://phabricator.wikimedia.org/T282012 (10EYener) Thank you! Yes we can get this done in Q1. [19:10:07] 10Analytics, 10Event-Platform, 10Patch-For-Review: WikipediaPortal Event Platform Migration - https://phabricator.wikimedia.org/T282012 (10Ottomata) Great thanks, we'll aim to migrate this schema early in Q2 then.