[06:45:47] 10Data-Engineering-Planning: Cleanup User Hive Databases - https://phabricator.wikimedia.org/T323884 (10odimitrijevic) [09:04:45] 10Data-Engineering, 10Event-Platform Value Stream, 10SRE, 10Traffic, and 2 others: Incident: 2022-03-4 Banner sampling leading to a relatively wide site outage (mostly esams) - https://phabricator.wikimedia.org/T303036 (10Marostegui) @lmata what should we do with this follow up task? [09:40:58] 10Data-Engineering, 10Event-Platform Value Stream, 10SRE, 10Traffic, and 2 others: Incident: 2022-03-4 Banner sampling leading to a relatively wide site outage (mostly esams) - https://phabricator.wikimedia.org/T303036 (10BTullis) I'm not sure that there's much more to do, is there? From a technical perspe... [10:04:56] 10Data-Engineering-Planning, 10Shared-Data-Infrastructure (EQ2 Kanban (Sprints 04-05)): Add an-presto10[06-15] to the presto cluster - https://phabricator.wikimedia.org/T323783 (10BTullis) [10:05:12] 10Data-Engineering-Planning, 10Shared-Data-Infrastructure (EQ2 Kanban (Sprints 04-05)): Add an-presto10[06-15] to the presto cluster - https://phabricator.wikimedia.org/T323783 (10BTullis) a:03Stevemunene [10:38:33] 10Data-Engineering-Planning, 10Shared-Data-Infrastructure, 10Data Pipelines (Sprint 04): Create Plan for Spark 2 Deprecation - https://phabricator.wikimedia.org/T318367 (10EChetty) 05Open→03Resolved [10:38:38] 10Data-Engineering-Kanban, 10Data-Engineering-Planning, 10Cassandra, 10Shared-Data-Infrastructure, 10User-Eevans: Properly add aqsloader user (w/ secrets) - https://phabricator.wikimedia.org/T305600 (10EChetty) [10:38:40] 10Data-Engineering-Planning, 10Cassandra, 10Data Pipelines (Sprint 04), 10Patch-For-Review: Write dedicated cassandra authorization code to read password from file when loading - https://phabricator.wikimedia.org/T306895 (10EChetty) 05Open→03Resolved [10:38:46] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 04): Allow Cormac Parle and Marco Fossati to deploy analytics-platform-eng Airflow instance - https://phabricator.wikimedia.org/T321925 (10EChetty) 05Open→03Resolved a:03EChetty [10:45:47] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Bug: User History has mismatching order of fields in Parquet vs. Hive - https://phabricator.wikimedia.org/T321231 (10EChetty) [10:47:39] 10Data-Engineering-Planning, 10Shared-Data-Infrastructure: NEW FEATURE REQUEST: Upgrade superset to 1.5.2 - https://phabricator.wikimedia.org/T323458 (10EChetty) [11:06:14] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10EChetty) [11:06:17] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Airflow Upgrade Compatibility with V2.3.2 - https://phabricator.wikimedia.org/T309552 (10EChetty) [11:06:19] 10Analytics-Jupyter, 10Data-Engineering-Planning, 10Data Pipelines, 10Product-Analytics, 10Patch-For-Review: Add support for jupyterlab on conda-analytics - https://phabricator.wikimedia.org/T321088 (10EChetty) [11:34:11] (03PS1) 10Volans: oozie, druid: fix aggregated_time_firstbyte [analytics/refinery] - 10https://gerrit.wikimedia.org/r/861365 [11:35:17] joal: o/ We spoke about trying to work together on a test case for T321960 - Do you think that there is perhaps an chance we could find one for data on the test cluster? [11:35:17] T321960: Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 [11:36:57] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10BTullis) [11:50:41] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10BTullis) I have replicated the original query without using wmfdata python. ` btullis@stat1004:~$ presto --catalog analytics_hive... [12:00:06] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10BTullis) For now, I have manually created a new catalog on an-coord1001 with the single parameter change under test. ` btullis@an-... [12:00:49] !log restarted presto-server on an-coord1001 to test T321960 [12:00:51] Logged the message at https://www.mediawiki.org/wiki/Analytics/Server_Admin_Log [12:00:51] T321960: Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 [12:25:19] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10BTullis) Apart from the fact that it didn't like the uppercase `T` in the catalog name, this seems to work and the behaviour with... [12:33:36] Hi btullis - I assume you have already deployed the parameter I wanted to test, right? [12:34:59] joal: Yes. I'm not sure whether it's working or not. You can test it out with `presto --catalog analytics_hive_t321960`from a stat box. [12:35:23] Oh wow - already deployed on the prod cluster! [12:35:24] Adding that parameter is the only change between that and the `analytics_hive` catalog. [12:35:38] testing now :) [12:35:46] 10Data-Engineering, 10Event-Platform Value Stream, 10SRE, 10Traffic, and 2 others: Incident: 2022-03-4 Banner sampling leading to a relatively wide site outage (mostly esams) - https://phabricator.wikimedia.org/T303036 (10lmata) 05Open→03Resolved a:03lmata Thank you @BTullis for T303036#8423773. I th... [12:37:01] joal: I'm going to clean up after myself, but I though that this was an expedient way of testing and v. low risk. I have restarted the presto-server service on an-coord1001 (required) and an-presto1001 (as a test). [12:37:20] That's awesome btullis :) [12:38:13] (VarnishkafkaNoMessages) firing: varnishkafka on cp5029 is not sending enough cache_upload requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_upload&var-instance=cp5029%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [12:39:41] btullis: actually queries give errors with the new catalog AFAICS [12:40:26] btullis: sortedCandidates is null or empty for ModularHashingNodeProvider [12:40:57] Hmm, I suspected as much. I was just researching what `sortedCandidates is null or empty for ModularHashingNodeProvider` means. [12:41:13] As simple query like this worked: `SHOW TABLES FROM event;` [12:41:32] this only queries metastore, and works [12:41:40] I tried a query reading data - failed [12:43:13] (VarnishkafkaNoMessages) resolved: varnishkafka on cp5029 is not sending enough cache_upload requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_upload&var-instance=cp5029%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [12:43:43] OK, thanks. I wonder if there is anything else I need to do to get the a new catalog to work, or whether it's actually the new parameter that made it fail. Perhaps I should try a different parameter in the temporary catalog? [12:44:04] possible btullis - we can try [12:49:18] Thanks. Leave it with me, I will experiment some more. [12:54:34] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Presto returns incorrect data for an added field - https://phabricator.wikimedia.org/T321960 (10BTullis) After a bit more testing it seems that this new catalog isn't working for data queries. The error `sortedCandidates is nu... [12:58:44] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 04), 10Spike: Easy Flink Python UDF + SQL enrichment - https://phabricator.wikimedia.org/T320968 (10JArguello-WMF) 05Open→03Resolved [12:58:57] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 04): [Shared Event Platform] Mediawiki Stream Enrichment should consume the consolidated page-change stream. - https://phabricator.wikimedia.org/T311084 (10JArguello-WMF) [13:39:14] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 04): [Shared Event Platform] Mediawiki Stream Enrichment should consume the consolidated page-change stream. - https://phabricator.wikimedia.org/T311084 (10JArguello-WMF) 05Open→03Resolved [13:39:16] 10Data-Engineering-Planning, 10Event-Platform Value Stream, 10Epic: [Shared Event Platform] Design and Implement POC Flink Service to Combine Existing Streams, Enrich and Output to New Topic - https://phabricator.wikimedia.org/T307959 (10JArguello-WMF) [13:40:14] 10Data-Engineering-Planning, 10Event-Platform Value Stream, 10Shared-Data-Infrastructure: [SPIKE] Deploy event driven stateless Flink service to DSE cluster - https://phabricator.wikimedia.org/T320812 (10JArguello-WMF) [13:43:43] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 05): Flink SQL queries should access Kafka topics from a Catalog - https://phabricator.wikimedia.org/T322022 (10JArguello-WMF) [14:05:01] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 05), 10Patch-For-Review: Create a shared flink docker image - https://phabricator.wikimedia.org/T316519 (10Ottomata) [14:05:57] 10Data-Engineering-Planning, 10Event-Platform Value Stream, 10EventStreams: EventStreams doesn't show the Wikistories-* streams - https://phabricator.wikimedia.org/T307679 (10Ottomata) In stream-beta they should show up automatically. I do see https://stream-beta.wmflabs.org/v2/ui/#/?streams=mediawiki.wikist... [14:07:13] 10Data-Engineering: Move archiva to private IPs + CDN - https://phabricator.wikimedia.org/T317182 (10Ottomata) @echetty I don't think this task belongs in Event Platform. Removing tag. [14:08:37] 10Analytics-Clusters, 10Data-Engineering-Planning, 10Voice & Tone: Rename geoeditors_blacklist_country - https://phabricator.wikimedia.org/T259804 (10Ottomata) [14:09:03] 10Data-Engineering-Planning, 10Projects-Cleanup: Clean up wikimetrics - https://phabricator.wikimedia.org/T318193 (10Ottomata) [14:11:25] 10Data-Engineering-Planning, 10Shared-Data-Infrastructure: Move archiva to private IPs + CDN - https://phabricator.wikimedia.org/T317182 (10BTullis) Dropping it into #shared-data-infrastructure [14:11:47] (03CR) 10Matthias Mullie: "All instrumentation has been implemented; this schema is good to go." [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/845498 (https://phabricator.wikimedia.org/T321069) (owner: 10Matthias Mullie) [14:11:55] (03CR) 10Matthias Mullie: [C: 03+1] Add schema for Extension:SearchVue actions [schemas/event/secondary] - 10https://gerrit.wikimedia.org/r/845498 (https://phabricator.wikimedia.org/T321069) (owner: 10Matthias Mullie) [14:13:07] (03CR) 10Elukey: [C: 03+1] oozie, druid: fix aggregated_time_firstbyte [analytics/refinery] - 10https://gerrit.wikimedia.org/r/861365 (owner: 10Volans) [14:40:13] (VarnishkafkaNoMessages) firing: varnishkafka on cp5020 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_text&var-instance=cp5020%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [14:45:13] (VarnishkafkaNoMessages) resolved: varnishkafka on cp5020 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_text&var-instance=cp5020%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [14:51:55] 10Data-Engineering, 10Event-Platform Value Stream, 10Patch-For-Review: Design Schema for page state and page state with content (enriched) streams - https://phabricator.wikimedia.org/T308017 (10Ottomata) > build increasingly complex code to not fall out of sync with Mediawiki (akin to the heroic scale of wha... [14:53:18] 10Data-Engineering-Kanban, 10Data-Engineering-Planning, 10Data Pipelines: Optimization of conda-analytics deb package - https://phabricator.wikimedia.org/T318397 (10Ottomata) I'm fine either way. I think I prefer two packages if we want to keep the worker installed size smaller, if we don't care, then let's... [15:01:18] 10Data-Engineering, 10Event-Platform Value Stream (Sprint 05): [SPIKE] Evaluate a pyflink version of Mediawiki Stream Enrichment - https://phabricator.wikimedia.org/T323217 (10gmodena) [15:27:46] 10Data-Engineering-Planning, 10Event-Platform Value Stream (Sprint 05): Flink SQL queries should access Kafka topics from a Catalog - https://phabricator.wikimedia.org/T322022 (10Ottomata) Very cool! Code? :) > Kafka stretch could maybe help here. It will help, but I don't think it will eliminate the need f... [15:46:13] (VarnishkafkaNoMessages) firing: varnishkafka on cp5018 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_text&var-instance=cp5018%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [15:51:13] (VarnishkafkaNoMessages) resolved: varnishkafka on cp5018 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://grafana.wikimedia.org/d/000000253/varnishkafka?orgId=1&var-datasource=eqsin%20prometheus/ops&var-cp_cluster=cache_text&var-instance=cp5018%3A9132&viewPanel=14 - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [15:55:59] 10Data-Engineering-Planning: Cleanup User Hive Databases - https://phabricator.wikimedia.org/T323884 (10mpopov) > Identify and archive/delete any databases that are no longer in use I've listed some on Slack: https://wikimedia.slack.com/archives/CSV483812/p1669220529284269, but there are more [15:58:18] 10Data-Engineering, 10Equity-Landscape: Population input metrics - https://phabricator.wikimedia.org/T309279 (10JAnstee_WMF) p:05Medium→03High [16:10:35] 10Analytics-Jupyter, 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06), 10Patch-For-Review: Add support for jupyterlab on conda-analytics - https://phabricator.wikimedia.org/T321088 (10Ottomata) > can you please install the latest conda deb package on an-test-client1001 @x... [16:11:27] 10Analytics-Jupyter, 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06), 10Patch-For-Review: Add support for jupyterlab on conda-analytics - https://phabricator.wikimedia.org/T321088 (10EChetty) [16:54:13] (VarnishkafkaNoMessages) firing: (2) varnishkafka on cp5019 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [16:59:13] (VarnishkafkaNoMessages) resolved: (2) varnishkafka on cp5019 is not sending enough cache_text requests - https://wikitech.wikimedia.org/wiki/Analytics/Systems/Varnishkafka - https://alerts.wikimedia.org/?q=alertname%3DVarnishkafkaNoMessages [17:07:12] (03PS16) 10Aqu: Add HdfsXMLFsImageConverter to refinery-job [analytics/refinery/source] - 10https://gerrit.wikimedia.org/r/852315 (https://phabricator.wikimedia.org/T321168) [18:18:40] 10Data-Engineering, 10API Platform, 10AQS 2.0 Roadmap, 10Epic, and 2 others: Create k8s deployment of AQS 2.0 - https://phabricator.wikimedia.org/T288661 (10BPirkle) There is likely overlap between this task and the direction that it looks like {T323190} is headed. I feel they should remain separate tasks... [18:37:18] 10Data-Engineering, 10AQS 2.0 Roadmap, 10API Platform (API Platform Roadmap), 10Epic, and 2 others: AQS 2.0:Wikistats 2 service - https://phabricator.wikimedia.org/T288301 (10odimitrijevic) I am arriving at the conversation a little late. I am curious about the reason to separate geoeditor from the editin... [19:01:35] 10Analytics-Radar, 10Data-Engineering, 10Product-Analytics, 10Wmfdata-Python: wmfdata cannot recover from a crashed Spark session - https://phabricator.wikimedia.org/T245713 (10nshahquinn-wmf) 05Open→03Resolved a:03nshahquinn-wmf Thanks to T273210, Wmfdata now has the ability to recreate Spark sessio... [19:01:37] 10Analytics-Radar, 10Data-Engineering, 10Product-Analytics, 10Wmfdata-Python, 10Epic: Analysts cannot reliably use wmfdata to run SQL queries against Hive databases - https://phabricator.wikimedia.org/T245891 (10nshahquinn-wmf) [19:51:33] PROBLEM - Check systemd state on an-launcher1002 is CRITICAL: CRITICAL - degraded: The following units failed: produce_canary_events.service https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state [19:55:45] 10Data-Engineering, 10Patch-For-Review, 10Product-Analytics (Kanban): Add mediawiki_web_ab_test_enrollment to the allowlist - https://phabricator.wikimedia.org/T323664 (10mforns) Hi @MNeisler! The existing data in the event table only goes back 90 days from the current date. At the time of this message, the... [20:10:57] RECOVERY - Check systemd state on an-launcher1002 is OK: OK - running: The system is fully operational https://wikitech.wikimedia.org/wiki/Monitoring/check_systemd_state [20:16:14] 10Data-Engineering-Planning, 10Data Pipelines: NEW FEATURE REQUEST: sqoop (all) user properties from mariadb to wmf_raw.mediawiki_user_properties - https://phabricator.wikimedia.org/T323456 (10EChetty) [20:16:45] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): NEW FEATURE REQUEST: sqoop (all) user properties from mariadb to wmf_raw.mediawiki_user_properties - https://phabricator.wikimedia.org/T323456 (10EChetty) [20:16:47] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): NEW FEATURE REQUEST: sqoop (all) user properties from mariadb to wmf_raw.mediawiki_user_properties - https://phabricator.wikimedia.org/T323456 (10EChetty) [20:18:11] (03CR) 10Mforns: "Thanks for putting this together! Left a comment." [analytics/refinery] - 10https://gerrit.wikimedia.org/r/859652 (https://phabricator.wikimedia.org/T323664) (owner: 10MNeisler) [20:18:33] 10Data-Engineering-Planning, 10Data Pipelines: Implement periodical cleaning of Airflow databases - https://phabricator.wikimedia.org/T322036 (10EChetty) [20:18:43] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Implement periodical cleaning of Airflow databases - https://phabricator.wikimedia.org/T322036 (10EChetty) [20:19:06] 10Data-Engineering-Planning, 10Data Pipelines: Back-fill Wikidata reliability Graphite metrics - https://phabricator.wikimedia.org/T321838 (10EChetty) [20:19:13] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Back-fill Wikidata reliability Graphite metrics - https://phabricator.wikimedia.org/T321838 (10EChetty) [20:19:44] 10Data-Engineering-Planning, 10Data Pipelines, 10Product-Analytics: [REQUEST] Add new Fundraising dimensions to druid.pageviews_daily & druid.pageviews_hourly - https://phabricator.wikimedia.org/T304571 (10EChetty) [20:20:15] 10Data-Engineering, 10Data Pipelines, 10Platform Engineering: Catalog, Categorize, and Templetize existing scheduled workflows - https://phabricator.wikimedia.org/T282035 (10EChetty) 05Open→03Resolved [20:20:18] 10Data-Engineering, 10Data Pipelines, 10Epic, 10Platform Team Workboards (Image Suggestion API): Airflow collaborations - https://phabricator.wikimedia.org/T282033 (10EChetty) [20:20:42] 10Data-Engineering, 10Data Pipelines: Alarms for virtualpageview should exist (probably in oozie) for jobs that have been idle too long - https://phabricator.wikimedia.org/T213716 (10EChetty) 05Open→03Resolved [20:20:44] 10Analytics, 10Analytics-Kanban: virtualpageview_hourly lacks data from December 17 on - https://phabricator.wikimedia.org/T213602 (10EChetty) [20:21:09] 10Data-Engineering-Planning, 10Data Pipelines, 10Product-Analytics: Review why total_edits on Mediawiki_History differs from the total_edits on Editors_Daily - https://phabricator.wikimedia.org/T316896 (10EChetty) [20:21:22] 10Data-Engineering-Planning, 10Product-Analytics, 10Data Pipelines (Sprint 05-06): Review why total_edits on Mediawiki_History differs from the total_edits on Editors_Daily - https://phabricator.wikimedia.org/T316896 (10EChetty) [20:22:03] 10Data-Engineering-Planning, 10Data-Engineering-Radar, 10Data Pipelines: Create LVS endpoint for druid-public-overlord (for oozie job indexing) - https://phabricator.wikimedia.org/T180971 (10EChetty) [20:22:10] 10Data-Engineering-Radar: Create LVS endpoint for druid-public-overlord (for oozie job indexing) - https://phabricator.wikimedia.org/T180971 (10EChetty) [20:22:31] 10Data-Engineering, 10Data Pipelines, 10Shared-Data-Infrastructure: Airflow scheduler and webserver logs should be readable by airflow instance admins - https://phabricator.wikimedia.org/T304615 (10EChetty) [20:22:37] 10Data-Engineering, 10Patch-For-Review, 10Product-Analytics (Kanban): Add mediawiki_web_ab_test_enrollment to the allowlist - https://phabricator.wikimedia.org/T323664 (10MNeisler) Hi @mforns, thank for the update! Based on this, I don't think it's worth the additional effort to backfill. The primary data I... [20:22:53] 10Data-Engineering-Planning, 10Data Pipelines, 10Product-Analytics, 10Research: Update HDFS links tables as Mediawiki changes - https://phabricator.wikimedia.org/T304979 (10EChetty) [20:23:05] 10Data-Engineering-Planning, 10Data Pipelines, 10SRE, 10Traffic-Icebox: Mobile redirects drop provenance parameters - https://phabricator.wikimedia.org/T252227 (10EChetty) [20:25:11] 10Data-Engineering-Kanban, 10Data-Engineering-Planning, 10Data Pipelines: Optimization of conda-analytics deb package - https://phabricator.wikimedia.org/T318397 (10EChetty) 05Open→03Declined [20:25:35] 10Data-Engineering-Planning, 10Data Pipelines: Automatically monitor schema changes that would break sqoop - https://phabricator.wikimedia.org/T310824 (10EChetty) [20:29:43] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Add Python Linter Checks to CI - https://phabricator.wikimedia.org/T318346 (10EChetty) p:05Triage→03Medium [20:29:54] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Add Python Linter Checks to CI - https://phabricator.wikimedia.org/T318346 (10EChetty) [20:32:56] 10Data-Engineering-Planning, 10Data Pipelines, 10Epic: Migrate all Cassandra Jobs - https://phabricator.wikimedia.org/T309995 (10EChetty) 05Open→03Resolved [20:38:16] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06), 10Epic: Build Druid Operator - https://phabricator.wikimedia.org/T309996 (10EChetty) [20:38:48] 10Data-Engineering-Planning, 10Data Pipelines (Sprint 05-06): Build Druid Operator - https://phabricator.wikimedia.org/T309996 (10EChetty) [20:38:50] 10Data-Engineering, 10Data Pipelines (Sprint 05-06): Migrate 1+ Druid load jobs - https://phabricator.wikimedia.org/T307508 (10EChetty) [20:48:59] 10Data-Engineering, 10Patch-For-Review, 10Product-Analytics (Kanban): Add mediawiki_web_ab_test_enrollment to the allowlist - https://phabricator.wikimedia.org/T323664 (10mforns) Oh, I see @MNeisler. (Then @EChetty please ignore my previous ping to you!) Yes, let's add the sanitization spec to the allow-lis... [21:03:54] 10Data-Engineering, 10AQS 2.0 Roadmap, 10API Platform (API Platform Roadmap), 10Epic, and 2 others: AQS 2.0:Wikistats 2 service - https://phabricator.wikimedia.org/T288301 (10BPirkle) >>! In T288301#8425714, @odimitrijevic wrote: > I am arriving at the conversation a little late. I am curious about the rea...