(function(doc, html, url) { var widget = doc.createElement("div"); widget.innerHTML = html; var script = doc.currentScript; // e = a.currentScript; if (!script) { var scripts = doc.scripts; for (var i = 0; i < scripts.length; ++i) { script = scripts[i]; if (script.src && script.src.indexOf(url) != -1) break; } } script.parentElement.replaceChild(widget, script); }(document, '

Making Dictionaries and Glossaries Work Together: Towards a Unified Language Data Framework

What is it about?

This paper looks at how we can make digital dictionaries and term databases (like glossaries) work better together across different computer systems and languages. Two important international standards—LMF (Lexical Markup Framework) and TMF (Terminological Markup Framework)—already help with organising and sharing this kind of language data. However, they were created for slightly different purposes. Our paper shows how these two systems can be brought closer together and proposes a new, unified framework to make them easier to use together. This would help researchers, translators, and software developers to build better tools for language learning, translation, and information management in multiple languages. We also highlight the benefits and challenges of combining these systems and suggest ways forward for future improvements.

Why is it important?

As language technologies rapidly evolve, there is an urgent need for consistent and interoperable ways to represent lexical and terminological data. While ISO standards such as LMF and TMF provide robust frameworks individually, their coexistence has led to fragmentation in how dictionaries and terminological databases are developed and maintained. This paper is the first to offer a systematic proposal for a Unified Markup Framework (UMF) that bridges the gap between these two standards. By aligning LMF and TMF through a shared meta-model and real-world use cases, our work provides a timely solution to enhance interoperability across digital language resources. This has the potential to improve the efficiency and scalability of multilingual applications, including machine translation, language learning tools, and digital lexicography, ultimately making language data more accessible, reusable, and future-proof.

Read more on Kudos…
The following have contributed to this page:
Federica Vezzani, Giorgio Maria Di Nunzio, and Ana Salgado
' ,"url"));