[00:05:21] does wikidata have edit filters like WP does [00:08:45] Dragonfly6-7: yes, it does [00:09:10] thank you [00:09:45] you are welcome [12:16:24] Jarnsax: oh god this is a mess [12:16:39] maybe the field should just be free text [12:16:53] or something like [A-Z][A-Za-z. [12:16:54] whoops [12:17:15] [A-Z][A-Za-z. ]+[\d-]+\d [14:42:39] inductiveload, Yeah, you're looking at what, half a dozen different systems used over time? [14:45:16] Anyway, normalising later would be easy enough [14:45:41] The important thing is that we have some place, any place, to store the data in the first place [14:46:07] Oh yes.. something I've wished existed before. [14:46:40] It also seems the loc themselves don't have a canonical format [14:47:25] inductiveload, It raises the odds that I'll go back to transcribing the early CCE and copying the data here. :) [14:57:51] https://www.wikidata.org/wiki/Q754656 <- I think this (and it's parts) are about as OCD as I can get about describing a university. [15:01:11] Hopefully giving targets for the 'affiliation' of journal article authors. [15:01:51] "RAIC Centre for Architecture" or whatever [19:11:12] Jarnsax: hmm, I guess we also need a VCC catalog entry property [19:11:43] I'd wait on that till they get a final version, there are no stable links yet [19:11:46] annoyingly their beta webapp thingy doesn't give you an obvious URL for a specific card [19:11:55] lol [19:11:58] indeed [19:12:04] This is interesting... https://www.nypl.org/blog/2019/05/31/us-copyright-history-1923-1964 [19:12:18] as xml on github, apparently. [19:12:24] i kinda thought they'd have a unique ID already [19:12:50] https://github.com/NYPL/catalog_of_copyright_entries_project [19:12:59] https://github.com/NYPL/cce-renewals [19:13:29] cool [19:13:47] i guess what we really need is to make sure we have a Q-id for every volume of the CCE [19:13:59] then we can reference to the volume + page [19:14:09] I think they are all on commons. [19:14:27] yes I think they are [19:14:40] but they're not organised in any major way AFAIK [19:14:57] Yeah, I just meant so 'they exist' on wd [19:16:18] i guess we need an "edition" item for each one [19:16:30] well a "volume" actually [19:16:55] I'd be dubious about any attempt to injest that without proofreading it, and linking authors publisher etc, otherwise just more stuff to get misconnected later. [19:16:56] (c.f. pending questions on how that even gets modelled) [19:17:21] ingest what? [19:17:27] the xml [19:17:27] the list of volumes? [19:17:40] you mean the NYPL XML? [19:17:44] yeah [19:18:11] indeed [19:18:24] adding thousands (millions) of entries with 'author name string' would suck. [19:18:31] i just mean we should have a WD item for each physical volume of the CCE [19:18:38] Oh, yeah [19:18:54] then we can add the commons files, IA IDs, hathi IDs, etc [19:19:13] and then we can use them as references to other claims [19:19:25] most volumes of the CCE include the LCCN, usually in some old obscure format [19:21:58] If doing this for the CCE, it would be nice to do it for the NUC as well, since everything in the CCE should be in it, and have actual biblio data [19:22:29] is there an online portal? [19:22:35] No, lol. [19:22:48] i'm not really after a massive import project (not that I'd say no) [19:22:57] https://en.wikipedia.org/wiki/National_Union_Catalog#/media/File:NUC-Folger.jpg [19:23:00] (lol) [19:23:12] what I really want is to support WS operatons [19:23:54] Yeah, I just meant, if looking it up in the CCE, might as well also look it up in the NUC scans (which are much easier to work with) [19:24:13] where are the NUC scans? [19:24:30] we should totally have a portal for that at WS [19:24:34] https://catalog.hathitrust.org/Record/000140237 [19:25:29] the, lol, 'portal' for the NUC is basically the 'from old catalog' entries in the loc catalog [19:25:36] fffffffffffffffffffffffffff [19:26:24] i mean Portal: at WS [19:26:57] https://crownlibrary.wordpress.com/2014/12/03/national-union-catalog-christmas-tree-2014/ [19:27:04] how much did that catalog cost [19:27:23] Oh gawd, I can't imagine, the compilation project took decades. [19:27:46] "as of 2005, an estimated 27% of books from before 1956 only be found in the National Union Catalog!" [19:28:14] stuff that people didn't register, so the loc didn't acquire it [19:40:12] bloody deep state librarians [19:40:19] seeking grants [23:47:24] I'm happy to report that the Alberta School of Dog Grooming does not exist on wikidata, lol.