Lankide eztabaida:VIGNERON
Gehitu atalaTriple dictionary
[aldatu]Hello Nicolas! As a research project with the University, @DL2204: has been working in a OCR process for the first Basque Dictionary, the Hiztegi_Hirukoitza by Larramendi. It contains translation of words from Spanish into Basque and Latin. Currently he is uploading every page from the OCR and many of the words will be normalized using some external software and linked via Wikidata Lexemes (d:Lexeme:L48666#F2). I write you because I want your expertise both on Lexemes and Wikisource, because I think we can use this work in a variety of forms that currently are not being used. Can you look into any page of the dictionary and give us some ideas? I can make mass changes using bots, so we can imagine new ways of representing words.
Thanks! -Theklan (eztabaida) 09:45, 30 urria 2020 (UTC)
- Egunon Theklan,
- Wonderful idea. I'm working on some similar idea for Breton dictionaries on Wikisource too but I do almost everything by hand. A more automated process would be wonderful! For the Wikisource part, I'll look into it.
- For the Lexeme part, the model of d:Lexeme:L48666#F2 is interresting and good. I entered the same kind of information but the other way around, see d:Lexeme:L69#P1343 (Lexeme P1343 "the dictionary where it's found", P5830 as qualifier for the form and more qualifier like "page" P304). What do you think? Both are equivalent, which model should we prefer? (or maybe both? but that would be redundant and overkill, no?).
- Cheers, VIGNERON (eztabaida) 10:03, 30 urria 2020 (UTC)
- I have this dream: d:Lexeme:L48666#F2. But I don't know if it would be feasible now that we have the text uploaded. -Theklan (eztabaida) 14:43, 30 urria 2020 (UTC)
- Ah nope nope, P973 is really overkill, the same link is already on d:Q65216433, no need to put it twice (and if you really want to see it directly, a simple js script can retrieve and display it).
- I took a look at Orrialde:Larramendi 1745 dictionary body.pdf/2. The formatting is a bit weird. All the
</br>
are not needed and the indentation with : is weird. On the French Wikisource, we have this template fr:Modèle:AlinéaNégatif to to such indentation, see fr:Page:Henry - Lexique étymologique du breton moderne.djvu/37 for how I used it. - Cheers VIGNERON (eztabaida) 14:53, 30 urria 2020 (UTC)
- If you click on the link it goes directly to the word in this case. -Theklan (eztabaida) 14:56, 30 urria 2020 (UTC)
- Yes, I did noticed the anchor, but you can still infer and create it automatically.
- I didn't saw the def template here, this is the ideal formatting.
- Cheers, VIGNERON (eztabaida) 15:02, 30 urria 2020 (UTC)
- If you click on the link it goes directly to the word in this case. -Theklan (eztabaida) 14:56, 30 urria 2020 (UTC)
- I have this dream: d:Lexeme:L48666#F2. But I don't know if it would be feasible now that we have the text uploaded. -Theklan (eztabaida) 14:43, 30 urria 2020 (UTC)