Should LLM tech be used optimize the Gramps Weblate glossary

emyoulation · June 22, 2025, 1:55pm

The Weblate glossary for Gramps has grown to over 13 thousand strings. It seems like that should be sufficient to communicate most concepts. Yet the glossary continues to grow. (When I recall checking the count last year, wasn’t it around 8 thousand?)

Should there be a new part of the source validation process that compares translatable strings to the existing strings? Where it finds new entries and recommends a possible equivalent string from the existing glossary?

Another LLM process might be to find phrases that are re-wordings of each other. Where it could recommend consolidations.

emyoulation · June 22, 2025, 5:57pm

When looking at the “components” breakdown, the string counts look more like what I had recalled:

Gramps 7.2k strings
Addons 5.2k strings
Web (GrampsWeb) 227 strings
Glossary 680 strings

codefarmer · June 24, 2025, 2:57pm

In theory this is a good idea as long as the processes that are used are able to understand the semantics of the strings in question.

Within Gramps core code de-duplicating strings makes, since it would promote consistency. In the addons repo it may not be easy since addon authors might want more autonomy to change strings without affecting all other addons.

emyoulation · June 24, 2025, 3:08pm

Semantic context is an interesting point. Does Weblate provide such context for translators?

Noticed some strings are duplicated with a numeration annotation. Does that mean that th string requires different translation sometimes?

system · July 24, 2025, 3:08pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Glossary gramplet Ideas	0	479	October 18, 2021
How to change an event translation [Weblate] Help localization	19	862	November 9, 2021
Wiki suggestion [Multilingual hyperlinking] Ideas wiki	9	660	May 13, 2020
Languages in Gramps. How many? Development localization	4	614	August 29, 2020
Gramps has a slight British accent Development	7	493	April 3, 2022

Should LLM tech be used optimize the Gramps Weblate glossary

Related topics