[01:34:58] 06Data-Engineering, 06DBA, 10MediaWiki-Core-Revision-backend, 07Schema-change: Rethink rev_sha1 field - https://phabricator.wikimedia.org/T389026#10644894 (10Bugreporter) rev_sha1 is actually a derived field and if it is dropped it can still be computed on-the-fly without redoing hash from scratch (see htt... [05:28:22] 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 07Epic: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162 (10tchin) 03NEW [05:28:31] 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 07Epic: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10645132 (10tchin) [05:28:35] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE-Mediawiki-Content, 13Patch-For-Review: Implement alerting for wmf_content.mediawiki_content_history_v1 - https://phabricator.wikimedia.org/T384962#10645133 (10tchin) [05:31:46] 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 07Epic: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10645136 (10tchin) [05:33:30] 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content, 07Epic: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10645137 (10tchin) [06:35:34] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Data-Engineering-Roadmap, 10DPE-Mediawiki-Content: Add ability to add tags to alerts - https://phabricator.wikimedia.org/T389162#10645228 (10tchin) [07:59:06] 06Data-Engineering, 06DBA, 10MediaWiki-Core-Revision-backend, 07Schema-change: Rethink rev_sha1 field - https://phabricator.wikimedia.org/T389026#10645369 (10daniel) > Store the checksum as an unsigned bigint instead. That sounds good to me. > Alternatively we can just introduce a revision_hash table wit... [08:06:43] 10Data-Engineering (Q3 2025 January 1st - March 31th), 07Essential-Work, 13Patch-For-Review: Migrate analytics Airflow DAGs to k8s Airflow deployment - https://phabricator.wikimedia.org/T386282#10645395 (10brouberol) [08:16:27] 06Data-Engineering, 06DBA, 10MediaWiki-Core-Revision-backend, 07Schema-change: Rethink rev_sha1 field - https://phabricator.wikimedia.org/T389026#10645415 (10Ladsgroup) >>! In T389026#10641812, @Ottomata wrote: > As this is being considered, please keep in mind that rev_sha1 is used in downstream data pipe... [08:29:11] 06Data-Engineering, 06DBA, 10MediaWiki-Core-Revision-backend, 07Schema-change: Rethink rev_sha1 field - https://phabricator.wikimedia.org/T389026#10645441 (10Bugreporter) This tasks saves 32 bytes per revision. However I have a series of other ideas that can save more: * {T384129} (saves 17 bytes per revis... [09:12:19] (03PS1) 10Aqu: Refine webrequest - uri_host with trailing dot [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/1128801 (https://phabricator.wikimedia.org/T354694) [09:27:57] 06Data-Engineering, 07Epic, 13Patch-For-Review: [Maintenance] Safeguard VarnishKafka to HAProxy analytics transition - https://phabricator.wikimedia.org/T354694#10645578 (10Antoine_Quhen) I’ve created a patch addressing the issue of trailing dots, which can be found here: https://gerrit.wikimedia.org/r/11288... [11:55:33] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Growth-Structured-Tasks, 06Growth-Team, 10Image-Suggestions, and 6 others: wmf.wikidata_item_page_link and wmf.wikidata_entity snapshots stuck at 2025-01-20 - https://phabricator.wikimedia.org/T386255#10646186 (10BTullis) I have good news and bad new... [13:34:13] 06Data-Engineering, 06DBA, 10MediaWiki-Core-Revision-backend, 07Schema-change: Rethink rev_sha1 field - https://phabricator.wikimedia.org/T389026#10646525 (10Ottomata) > My question is that whether a bigint or an int field would be "good enough" for your usecases. Oh ya these would probably work fine, jus... [13:51:47] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Data Pipelines, 10Observability-Metrics, 07Essential-Work, and 2 others: Disable Data Platform Engineering generated graphite metrics and dashboards - https://phabricator.wikimedia.org/T372855#10646678 (10AndrewTavis_WMDE) See mentioned tasks above f... [14:17:33] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE-Mediawiki-Content, 10Image-Suggestions, 10Section-Level-Image-Suggestions, 10Structured-Data-Backlog (Current Work): [SPIKE] Check the Wikimedia content history dataset - https://phabricator.wikimedia.org/T385787#10646866 (10Cparle) a:03Cparle [16:16:34] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE-Mediawiki-Content, 10Image-Suggestions, 10Section-Level-Image-Suggestions, 10Structured-Data-Backlog (Current Work): [SPIKE] Check the Wikimedia content history dataset - https://phabricator.wikimedia.org/T385787#10647494 (10Cparle) [16:26:57] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10DPE-Mediawiki-Content, 10Image-Suggestions, 10Section-Level-Image-Suggestions, 10Structured-Data-Backlog (Current Work): [SPIKE] Check the Wikimedia content history dataset - https://phabricator.wikimedia.org/T385787#10647603 (10Cparle) Replacing `... [17:39:23] 06Data-Engineering, 06Data-Engineering-Radar, 10MediaWiki-Blocks, 10Multiblocks, and 4 others: Add a unique index to the block_target table - https://phabricator.wikimedia.org/T389028#10647970 (10Ahoelzl) [17:40:48] 06Data-Engineering: test_produced_by_config SLA miss configured to be too small for upstream dataset run time - https://phabricator.wikimedia.org/T388861#10647981 (10Ahoelzl) @amastilovic [17:46:30] 06Data-Engineering, 06Movement-Insights: Add MW History data quality checks output to Superset dashboard - https://phabricator.wikimedia.org/T365238#10648003 (10Ahoelzl) 05Open→03Declined [17:47:59] 06Data-Engineering, 10DPE-Mediawiki-Content: When doing ADD COLUMN to a struct under a map, Iceberg fails to SELECT it - https://phabricator.wikimedia.org/T388793#10648019 (10Ahoelzl) [17:54:03] 06Data-Engineering, 06Data-Engineering-Icebox, 06Research-Freezer: [Open question] Improve bot identification at scale - https://phabricator.wikimedia.org/T138207#10648060 (10Ottomata) [18:01:22] 10Data-Engineering (Q3 2025 January 1st - March 31th), 10Growth-Structured-Tasks, 06Growth-Team, 10Image-Suggestions, and 6 others: wmf.wikidata_item_page_link and wmf.wikidata_entity snapshots stuck at 2025-01-20 - https://phabricator.wikimedia.org/T386255#10648216 (10xcollazo) > The bad news is that the... [18:21:40] 10Data-Engineering-Wikistats: Incorrect