[01:28:05] 06Data-Engineering-Icebox, 06Movement-Insights, 10Movement-Metrics: Consider recalculating revert rate - https://phabricator.wikimedia.org/T267053#9802701 (10nshahquinn-wmf) 05Open→03Declined Revert rate has been removed from our current set of movement metrics (T359692). [09:25:36] (03CR) 10Gmodena: Update script importing XML dumps onto HDFS (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1032018 (https://phabricator.wikimedia.org/T364045) (owner: 10Joal) [09:31:16] (03CR) 10Gmodena: "LGTM. Left you a nit question (non blocking)." [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1032020 (https://phabricator.wikimedia.org/T364045) (owner: 10Joal) [09:50:30] 06Data-Engineering, 06Data Products, 10Observability-Logging, 06Traffic: HAProxy log format doesn't support "invalid" request path - https://phabricator.wikimedia.org/T365117 (10Fabfur) 03NEW [10:03:06] (03CR) 10Joal: Update MediawikiXMLDumpsConverter (031 comment) [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1032020 (https://phabricator.wikimedia.org/T364045) (owner: 10Joal) [10:06:16] (03CR) 10Joal: Update script importing XML dumps onto HDFS (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1032018 (https://phabricator.wikimedia.org/T364045) (owner: 10Joal) [11:08:31] (03CR) 10Gmodena: Update script importing XML dumps onto HDFS (031 comment) [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1032018 (https://phabricator.wikimedia.org/T364045) (owner: 10Joal) [12:03:23] 06Data-Engineering, 10Dumps-Generation, 06SRE, 10Data-Platform-SRE (2024.05.06 - 2024.05.26), 13Patch-For-Review: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228#9804192 (10BTullis) @xcollazo added [[https://gerrit.wikimedia.org/r/c/operations/puppet/+/102... [12:16:09] 06Data-Engineering, 10Dumps-Generation, 06SRE, 10Data-Platform-SRE (2024.05.06 - 2024.05.26), 13Patch-For-Review: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228#9804237 (10BTullis) Here is a one-liner to list the next scheduled runs of all of the timers f... [12:23:05] 10Quarry, 10Internet-Archive: [bug] Lot of queries stuck in queued state for hours and days (with stop actions leading to HTTP 500) - https://phabricator.wikimedia.org/T365136 (10Teslaton) 03NEW [12:41:00] 06Data-Engineering: Reset kerberos password for WMDE-leszek - https://phabricator.wikimedia.org/T365137 (10WMDE-leszek) 03NEW [13:01:51] 06Data-Engineering, 06Data-Platform: Add MW table 'cu_log' to data lake - https://phabricator.wikimedia.org/T364398#9804345 (10lbowmaker) [14:17:53] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Spike] [Refine Refactoring] List out all production Refine datasets that need to be migrated to the config store (Airflow and Iceberg) - https://phabricator.wikimedia.org/T361498#9804869 (10Ahoelzl) configured event streams and last commits for identifying dep... [14:18:09] 10Data-Engineering (Q4 2024 April 1st - June 30th): [Spike] [Refine Refactoring] List out all production Refine datasets that need to be migrated to the config store (Airflow and Iceberg) - https://phabricator.wikimedia.org/T361498#9804872 (10Ahoelzl) a:03lbowmaker [14:42:52] 10Data-Engineering (Q4 2024 April 1st - June 30th), 13Patch-For-Review: [Refine refactoring] Extract refine schema management into a dedicated tool - https://phabricator.wikimedia.org/T356762#9805088 (10Antoine_Quhen) Moreover, one more conf for a dag to execute in a depth-first manner is to add in its default... [14:47:22] 06Data-Engineering, 10Dumps-Generation, 06SRE, 10Data-Platform-SRE (2024.05.06 - 2024.05.26), 13Patch-For-Review: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228#9805121 (10xcollazo) > Xabriel, what do you think? Is this workable to try to get the host rol... [15:19:25] 10Quarry: Error 500 when clicking "stop query" - https://phabricator.wikimedia.org/T362213#9805326 (10Oudedutchman) The bug could not be reproduced locally on Quarry when running with `docker-compose up`. [15:52:58] !log moving the `dumps::generation::worker::dumper_misc_crons` role from snapshot1008 to snapshot1017 for T325228 [15:53:01] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [15:53:02] T325228: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228 [15:53:53] btullis: o/ [15:54:05] elukey: Hi, how can I help. [15:54:06] ? [15:54:14] I saw the code review for the new chart but I didn't have time to review it yet, will try to do it tomorrow :) [15:55:33] elukey: <3 Many thanks. There is a stack of patches which should make sense and the helm-lint on this one is the one which is the most valuable, I believe: https://gerrit.wikimedia.org/r/c/operations/deployment-charts/+/1031589/7 [15:57:58] ack! [16:00:15] 06Data-Engineering, 10Dumps-Generation, 06SRE, 10Data-Platform-SRE (2024.05.06 - 2024.05.26), 13Patch-For-Review: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228#9805537 (10BTullis) I have disabled the timers on snapshot1008 with the following. ` btullis@s... [16:06:32] 06Data-Engineering, 10Dumps-Generation, 06SRE, 10Data-Platform-SRE (2024.05.06 - 2024.05.26), 13Patch-For-Review: Migrate Dumps Snapshot hosts from Buster to Bullseye - https://phabricator.wikimedia.org/T325228#9805593 (10BTullis) I stopped the timers with: ` btullis@snapshot1008:~$ for t in $(cat timers... [16:38:25] random question, does it make sense to put our own Matomo beacon on www.wikimediastatus.net ? [16:44:04] 06Data-Engineering, 10Librarization, 10MediaWiki-extensions-EventLogging, 10MediaWiki-extensions-JsonData: Librarise Libs/JsonSchemaValidation or replace - https://phabricator.wikimedia.org/T303131#9805960 (10Aklapper) a:05Ottomata→03None @Ottomata: Removing task assignee as this open task has been ass... [16:44:10] 06Data-Engineering, 10Event-Platform: Allow disabling/enabling configured streams via wgEventStreams config - https://phabricator.wikimedia.org/T259712#9805969 (10Aklapper) a:05Ottomata→03None @Ottomata: Removing task assignee as this open task has been assigned for more than two years - see the email sent... [16:44:55] 06Data-Engineering, 06tech-decision-forum, 10Event-Platform: MediaWiki Event Carried State Transfer - Problem Statement - https://phabricator.wikimedia.org/T291120#9805965 (10Aklapper) a:05Ottomata→03None @Ottomata: Removing task assignee as this open task has been assigned for more than two years - see... [16:54:47] 06Data-Engineering, 10Data Pipelines: Data pipelines should support conda environment files and integrate conda dist - https://phabricator.wikimedia.org/T303839#9806099 (10Aklapper) a:05gmodena→03None @gmodena: Removing task assignee as this open task has been assigned for more than two years - see the ema... [17:03:43] 06Data-Engineering, 14Data-Engineering-Kanban, 10Data-Engineering-Wikistats, 06Product-Analytics: Wikistats reports no mobile unique devices for Wikidata and MediaWiki.org - https://phabricator.wikimedia.org/T299559#9806216 (10Aklapper) a:05JAllemandou→03None @JAllemandou: Removing task assignee as thi... [17:06:51] 06Data-Engineering-Radar, 10MediaWiki-General: Update pingback "PHP Version" dashboards - https://phabricator.wikimedia.org/T298922#9806253 (10Aklapper) a:05mforns→03None @mforns: Removing task assignee as this open task has been assigned for more than two years - see the email sent to all task assignees o... [18:10:42] 06Data-Engineering, 06Data Products, 10Observability-Logging, 06Traffic, 13Patch-For-Review: HAProxy log format doesn't support "invalid" request path - https://phabricator.wikimedia.org/T365117#9806504 (10Fabfur) Some other information about this: * In HAProxy replacing `%HPO` with `%HP` logs the whole... [18:49:07] 06Data-Engineering, 10Data Pipelines: Drop MediaViewer and MultimediaViewer* tables - https://phabricator.wikimedia.org/T311229#9806647 (10TheDJ) I'm just wondering why we are not just dropping this ? Cleanup is really important and help avoid problems in the future. How many high priority / fire fighting issu... [19:27:33] 07Analytics-Data-Problem, 06Data-Platform-SRE, 13Patch-Needs-Improvement: Pageview definition relies on X-Analytics to determine special pages - https://phabricator.wikimedia.org/T304362#9806766 (10nshahquinn-wmf) p:05Triage→03Low [19:38:14] (03PS3) 10Xcollazo: SQL queries that format the base Commons Impact Metrics datasets into the expected shape for Cassandra. [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1023461 (https://phabricator.wikimedia.org/T358707) [19:53:08] 06Data-Engineering: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log - https://phabricator.wikimedia.org/T364487#9806823 (10amastilovic) This task can be closed as the issue has been fixed and changes to the DAG have been merged. [19:55:12] (03PS1) 10Gehel: fix(ISPDatabaseReader): avoid null pointer exception when reading MaxMind [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1032562 [19:56:43] (03PS2) 10Gehel: fix(ISPDatabaseReader): avoid null pointer exception when reading MaxMind [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1032562 [20:07:21] 06Data-Engineering: ISPDatabaseReader null pointer exception - https://phabricator.wikimedia.org/T365197 (10CDanis) 03NEW [20:07:35] (03PS3) 10CDanis: fix(ISPDatabaseReader): avoid null pointer exception when reading MaxMind [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1032562 (https://phabricator.wikimedia.org/T365197) (owner: 10Gehel) [20:35:52] 06Data-Engineering, 13Patch-For-Review: ISPDatabaseReader null pointer exception - https://phabricator.wikimedia.org/T365197#9806960 (10CDanis) I can confirm that refinery-hive-0.2.31-shaded.jar does not show the issue on the same dataset. [20:57:36] 06Data-Engineering-Radar, 10MediaWiki-General: Update pingback "PHP Version" dashboards - https://phabricator.wikimedia.org/T298922#9807039 (10Reedy) 05Open→03Resolved [20:57:50] 06Data-Engineering, 10MediaWiki-General: PHP 8.3 missing (showing as other?) on https://pingback.wmflabs.org/#php-version - https://phabricator.wikimedia.org/T365201 (10Reedy) 03NEW [20:58:29] 06Data-Engineering, 13Patch-For-Review: ISPDatabaseReader null pointer exception - https://phabricator.wikimedia.org/T365197#9807041 (10mpopov) @CDanis: Can you please paste the Spark code / Spark SQL query you used for reproducibility? Also, I want to say I've sometimes run into this kind of error message wh... [21:01:50] 06Data-Engineering, 10MediaWiki-General: PHP 8.3 missing (showing as other?) on https://pingback.wmflabs.org/#php-version - https://phabricator.wikimedia.org/T365201#9807077 (10Reedy) It seems {8c9b658eae394dfd82dad53f7c55b9f34c0c5837} should've been the fix by @CCicalese_WMF, but I'm guessing this wasn't depl... [21:01:54] (03CR) 10Reedy: "This doesn't seem to have been deployed/things re-generated. T365201 filed" [analytics/refinery] - 10https://gerrit.wikimedia.org/r/1024530 (owner: 10Cicalese) [21:02:06] 06Data-Engineering, 13Patch-For-Review: ISPDatabaseReader null pointer exception - https://phabricator.wikimedia.org/T365197#9807078 (10CDanis) Sure @mpopov! I was running over today's subset (about 228k events). `lang=python import wmfdata spark = wmfdata.spark.create_session(type='yarn-regular') import pys... [21:10:48] 06Data-Engineering: [Data Quality] Implement completeness check for MediaWiki History - https://phabricator.wikimedia.org/T365203 (10mpopov) 03NEW [21:10:54] 14Analytics-Radar, 06Data-Engineering-Icebox, 06Discovery-Search, 06Research, and 3 others: Image Classification Research and Development - https://phabricator.wikimedia.org/T215413#9807107 (10dr0ptp4kt) [21:12:40] 06Data-Engineering: [Data Quality] Implement wiki completeness check for MediaWiki History - https://phabricator.wikimedia.org/T365203#9807141 (10mpopov) [21:57:46] 06Data-Engineering, 10FY2023-24-WE 2.1 Typography and palette customizations, 10Data Products (Data Products Sprint 13), 13Patch-For-Review, 10Web-Team-Backlog (FY2023-24 Q4 Sprint 3): Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962#9807247 (10KSarabia-WMF) a:... [21:57:49] 06Data-Engineering, 10FY2023-24-WE 2.1 Typography and palette customizations, 10Data Products (Data Products Sprint 13), 13Patch-For-Review, 10Web-Team-Backlog (FY2023-24 Q4 Sprint 3): Update Sample Rates for Metrics Platform Events - https://phabricator.wikimedia.org/T361962#9807248 (10KSarabia-WMF) a:... [23:32:26] 06Data-Engineering, 10Data-Engineering-Wikistats: Add azerbaijani language to Wikistats - https://phabricator.wikimedia.org/T365209 (10NMW03) 03NEW