[00:40:33] spam link [00:49:17] I wasn't quite sure with the logo having the meta logo in the O of their logo [00:51:50] oh true. could be legit maybe, sry [00:52:48] that would be https://afrocrowd.org/ then... [00:54:07] still wondering what's their affiliation with Wikimedia though to use that logo... [00:54:34] yeah, https://twitter.com/afroCROWDit implies that there is some connection, eh [00:54:34] mmm [00:54:42] It feels offtopic for the channel, anyway [00:54:51] afrocrowd is a recognized affiliate: [[m:AfroCROWD]] [00:55:02] but the poster was still offtopic for this group, yeah [09:42:10] https://www.mediawiki.org has a section "Set up and run MediaWiki". I'm curioys what's unclear. (re @ofnotbeing: I would be very grateful if anyone can help) [16:12:23] I made a thing! It's a Wikipedia Year In Review tool. Apologies in advance for those of you who have half a million edits - it will be a bit slow :) https://wikipediayir.netlify.com/ Let me know what you think! [16:41:11] I created this datamodel today for the Riksdagen open data to sentences project and I would like some feedback from the community.Basically the idea is to analyze all 160k documents and store every single unique rawtoken and sentence in a database.This is going to be a huge database which I'm not sure ToolsDB can handle (WMF recommend Trove for databases >125 GB)I want to store normalized tokens and later I want to link [16:41:43] I created this datamodel today for the Riksdagen open data to sentences project and I would like some feedback from the community. [16:41:44] [16:41:45] Basically the idea is to analyze all 160k documents and store every single unique rawtoken and sentence in a database. [16:41:47] This is going to be a huge database which I'm not sure ToolsDB can handle (WMF recommend Trove for databases >125 GB) [16:41:48] [16:41:50] I want to store normalized tokens and later I want to link the raw tokens to Wikidata Lexeme Form IDs. [16:41:51] [16:41:53] I'm curious to see: [16:41:54] * How many unique rawtokens vs normalized tokens we see on average in the documents. [16:41:56] * How many of the raw tokens can be found in Wikidata currently (lexeme form coverage) [16:41:57] * Which are the most common raw tokens which are currently missing in Wikidata as forms? [16:41:59] [16:42:00] The different tables are explained in the UML here: [16:42:02] https://github.com/dpriskorn/riksdagen_sentences/blob/save_to_database/diagrams/datamodel.puml [16:42:03] [16:42:05] See https://github.com/dpriskorn/riksdagen_sentences/discussions/5 for an open space for discussions [16:43:52] I created this datamodel today for the Riksdagen open data to sentences project and I would like some feedback from the community.See https://github.com/dpriskorn/riksdagen_sentences/discussions/5 for the model : https://tools-static.wmflabs.org/bridgebot/a2fdb698/file_55957.jpg [16:54:23] Jdlrobson, it's lovely! Any chance it could be localized? [16:55:07] It also has a few minor RTL bugs I can possibly fix if I can see the source. [17:20:26] https://github.com/jdlrobson/wikipedia-year-in-review (re @amire80: It also has a few minor RTL bugs I can possibly fix if I can see the source.) [18:06:59] @amire80 yeh it could be localized i just havent got round to that yet (setting it up on translatewiki would be a huge help if someone wants to kick that off) [18:08:31] It is a static web app so the build script would need to build various versions [18:50:57] This could be probably moved to toolforge? [19:24:07] veri nice! (re @wmtelegram_bot: I made a thing! It's a Wikipedia Year In Review tool. Apologies in advance for those of you who have half a million ...) [19:54:40] @Ladsgroup yes eventually but right now I need the hassle free instant deploys with Netlify. I do need to make use of the Redis server to reduce the amount of API hits needed. [20:15:31] Generate your own Year! wikipediayir.netlify.app : https://tools-static.wmflabs.org/bridgebot/829324f7/file_55963.jpg [21:13:10] Nice ! I was wondering whether it would work on other projects and tried with Wikidata - happy to say it works just fine ! :) (re @wmtelegram_bot: I made a thing! It's a Wikipedia Year In Review tool. Apologies in advance for those of you who have half a million ...) [22:35:51] Oh, I'd love to use it for Wikidata (re @JeanFred: Nice ! I was wondering whether it would work on other projects and tried with Wikidata - happy to say it works just fine ! :))