[00:23:22] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [04:24:41] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [07:36:25] 10Data-Engineering, 10Data-Engineering-Wikistats: Monthly pageview stats for March 2023 missing - https://phabricator.wikimedia.org/T333923 (10Antoine_Quhen) 05Open→03Resolved a:03Antoine_Quhen I can confirm that now the data looks good in both: * https://dumps.wikimedia.org/other/pageview_complete/month... [08:23:29] (03PS1) 10Aqu: Improve documentation of create_disallowed_cassandra_articles_table script [analytics/refinery] - 10https://gerrit.wikimedia.org/r/906707 (https://phabricator.wikimedia.org/T333950) [08:24:41] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [08:27:14] (03PS2) 10Aqu: Fix doc of create_disallowed_cassandra_articles_table [analytics/refinery] - 10https://gerrit.wikimedia.org/r/906707 (https://phabricator.wikimedia.org/T333950) [10:33:08] (03CR) 10Aqu: [V: 03+2 C: 03+2] "Thanks for the reviews!" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/894661 (https://phabricator.wikimedia.org/T327073) (owner: 10Aqu) [10:34:30] !log About to deploy analytics/refinery in test cluster [10:34:31] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [10:35:56] Hi aqu and mforns - Thanks for handling the emergency of AQS [10:36:29] joal glad you are back! [10:36:47] aqu, mforns: I have a question on the filtering approach: Do we wish to filter only the top result? Wouldn't it be better to filter the pageview-per-article as well? [10:37:32] aqu: Not back for real, still off toda, but it's getting better and I opened my computer today :) [10:38:01] pageview-per-article is filtered also: https://github.com/wikimedia/analytics-refinery/blob/master/hql/cassandra/daily/load_cassandra_pageview_per_article_daily.hql#L79 [10:38:58] aqu: You guys rock :0 I haven't looked at the code so just seeing the mentions of top-pageviews in the slack thread I figured out I'd better ask :) [10:39:13] :) [10:40:33] aqu: I imagine we could be better at filtering only in case the number of views is above a threshold, to not remove the article permanently, but that's detail for later - the solution fixes the problem we have now, whichi is perfect - thanks a lot [11:26:16] steve_munene are you around? I'm trying to deploy analytics/refinery and I get stuck at sending the files to HDFS: [11:26:16] aqu@an-test-coord1001:/srv/deployment/analytics/refinery$ sudo -u hdfs kerberos-run-command hdfs /srv/deployment/analytics/refinery/bin/refinery-deploy-to-hdfs --verbose --no-dry-run [11:26:16] 2023-04-07T11:19:32+00:00 Error: Cannot describe current version [11:26:16] It could come from a modified local file "artifacts/org/wikimedia/gobblin-wmf/gobblin-wmf-core-1.0.1-jar-with-dependencies.jar" [11:26:16] Do you know more about it ? Could you revert the local change ? [11:27:56] My bad, you are off today. It can wait till next week don't worry. [12:24:41] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [13:56:24] 10Data-Engineering-Planning, 10XTools, 10Chinese-Sites: Run maintain-views on zhwiki, newiki - https://phabricator.wikimedia.org/T334041 (10lbowmaker) [13:58:08] 10Data-Engineering-Planning, 10Event-Platform Value Stream: Define Service Level Objective (SLO) - https://phabricator.wikimedia.org/T333833 (10lbowmaker) [13:59:14] 10Data-Engineering-Planning, 10Event-Platform Value Stream: Event Catalog: Standardize Options Handling - https://phabricator.wikimedia.org/T333795 (10lbowmaker) [14:08:56] 10Data-Engineering-Planning, 10Event-Platform Value Stream: mediwiki-event-enrichment in k8s should use mwapi-async envoy listener for stream config in - https://phabricator.wikimedia.org/T333575 (10lbowmaker) [14:11:05] 10Data-Engineering: Create HDFS folder wmf/data/research - https://phabricator.wikimedia.org/T332926 (10lbowmaker) 05Open→03Resolved a:03lbowmaker [14:18:34] heya joal :] glad you're feeling a bit better [14:30:32] (03PS1) 10Mforns: Update pageview allowlist with new wiki ckb.wiktionary [analytics/refinery] - 10https://gerrit.wikimedia.org/r/906733 [14:31:38] (03CR) 10Mforns: [V: 03+2 C: 03+2] "Self-merging the quick fix :P" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/906733 (owner: 10Mforns) [14:55:12] 10Data-Engineering, 10Data-Engineering-Wikistats: Monthly pageview stats for March 2023 missing - https://phabricator.wikimedia.org/T333923 (10Radim.kubacki) Indeed, it is there. There is one question: previously the filename contained year and month, now there is also 01 (day?). Is it going to remain as patte... [15:45:38] 10Data-Engineering, 10Machine-Learning-Team, 10Research, 10Event-Platform Value Stream (Sprint 11), 10Patch-For-Review: Design event schema for ML scores/recommendations on current page state - https://phabricator.wikimedia.org/T331401 (10Isaac) > Q: Will it be useful to have the 'prior state' of predict... [15:49:29] 10Data-Engineering, 10Machine-Learning-Team, 10Research, 10Event-Platform Value Stream (Sprint 11), 10Patch-For-Review: Design event schema for ML scores/recommendations on current page state - https://phabricator.wikimedia.org/T331401 (10diego) I would say that it would be nice to have it but not a must... [16:24:41] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [16:34:41] (SystemdUnitFailed) firing: (7) jupyter-dsaez-singleuser-conda-analytics.service Failed on stat1004:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [17:40:02] 10Data-Engineering, 10Research-Backlog: [Open question] Improve bot identification at scale - https://phabricator.wikimedia.org/T138207 (10leila) [17:46:03] 10Data-Engineering, 10Research-Backlog: [Open question] Improve bot identification at scale - https://phabricator.wikimedia.org/T138207 (10leila) @Ladsgroup I'm with you on the overall theme that better bot detection is possible and an important project to work on. :) @odimitrijevic I saw your note in T333950... [18:20:42] 10Data-Engineering, 10Product-Analytics: Job Failed: product-analytics-movement-metrics - https://phabricator.wikimedia.org/T334302 (10Mayakp.wiki) [19:41:59] 10Data-Engineering-Planning, 10Data Pipelines, 10WMF-General-or-Unknown, 10Editing-team (Kanban Board), and 2 others: "Invalid revision ID -1" error for VisualEditorFeatureUse events, mostly from officewiki - https://phabricator.wikimedia.org/T322602 (10matmarex) (check logs after deployment) [20:38:22] (SystemdUnitFailed) firing: (6) jupyter-aqu-singleuser-conda-analytics.service Failed on stat1005:9100 - https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state - https://grafana.wikimedia.org/d/g-AaZRFWk/systemd-status - https://alerts.wikimedia.org/?q=alertname%3DSystemdUnitFailed [21:11:08] 10Data-Engineering, 10Abstract Wikipedia team, 10Anti-Harassment, 10Cloud-Services, and 16 others: Migrate PipelineLib repos to GitLab - https://phabricator.wikimedia.org/T332953 (10thcipriani)