Edukira joan

Lankide eztabaida:VIGNERON

Orriaren edukia ez da onartzen beste hizkuntza batzuetan.
Gehitu atala
Wikitekatik
Latest comment: duela 4 urte by VIGNERON in topic Triple dictionary

Triple dictionary

[aldatu]

Hello Nicolas! As a research project with the University, @DL2204: has been working in a OCR process for the first Basque Dictionary, the Hiztegi_Hirukoitza by Larramendi. It contains translation of words from Spanish into Basque and Latin. Currently he is uploading every page from the OCR and many of the words will be normalized using some external software and linked via Wikidata Lexemes (d:Lexeme:L48666#F2). I write you because I want your expertise both on Lexemes and Wikisource, because I think we can use this work in a variety of forms that currently are not being used. Can you look into any page of the dictionary and give us some ideas? I can make mass changes using bots, so we can imagine new ways of representing words.

Thanks! -Theklan (eztabaida) 09:45, 30 urria 2020 (UTC)Reply

Egunon Theklan,
Wonderful idea. I'm working on some similar idea for Breton dictionaries on Wikisource too but I do almost everything by hand. A more automated process would be wonderful! For the Wikisource part, I'll look into it.
For the Lexeme part, the model of d:Lexeme:L48666#F2 is interresting and good. I entered the same kind of information but the other way around, see d:Lexeme:L69#P1343 (Lexeme P1343 "the dictionary where it's found", P5830 as qualifier for the form and more qualifier like "page" P304). What do you think? Both are equivalent, which model should we prefer? (or maybe both? but that would be redundant and overkill, no?).
Cheers, VIGNERON (eztabaida) 10:03, 30 urria 2020 (UTC)Reply
I have this dream: d:Lexeme:L48666#F2. But I don't know if it would be feasible now that we have the text uploaded. -Theklan (eztabaida) 14:43, 30 urria 2020 (UTC)Reply
Ah nope nope, P973 is really overkill, the same link is already on d:Q65216433, no need to put it twice (and if you really want to see it directly, a simple js script can retrieve and display it).
I took a look at Orrialde:Larramendi 1745 dictionary body.pdf/2. The formatting is a bit weird. All the </br> are not needed and the indentation with : is weird. On the French Wikisource, we have this template fr:Modèle:AlinéaNégatif to to such indentation, see fr:Page:Henry - Lexique étymologique du breton moderne.djvu/37 for how I used it.
Cheers VIGNERON (eztabaida) 14:53, 30 urria 2020 (UTC)Reply
If you click on the link it goes directly to the word in this case. -Theklan (eztabaida) 14:56, 30 urria 2020 (UTC)Reply
Yes, I did noticed the anchor, but you can still infer and create it automatically.
I didn't saw the def template here, this is the ideal formatting.
Cheers, VIGNERON (eztabaida) 15:02, 30 urria 2020 (UTC)Reply