Proceedings of the 5th Conference on Language, Data and Knowledge: The 5th OntoLex Workshop

Katerina Gkirtzou, Slavko Žitnik, Jorge Gracia, Dagmar Gromann, Maria Pia di Buono, Johanna Monti, Maxim Ionov (Editors)


Anthology ID:
2025.ontolex-1
Month:
September
Year:
2025
Address:
Naples, Italy
Venues:
ontolex | WS
SIG:
Publisher:
Unior Press
URL:
https://aclanthology.org/2025.ontolex-1/
DOI:
ISBN:
978-88-6719-334-9
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
https://aclanthology.org/2025.ontolex-1.pdf

pdf bib
Proceedings of the 5th Conference on Language, Data and Knowledge: The 5th OntoLex Workshop
Katerina Gkirtzou | Slavko Žitnik | Jorge Gracia | Dagmar Gromann | Maria Pia di Buono | Johanna Monti | Maxim Ionov

pdf bib
Inferring Adjective Hypernyms with Language Models to Increase the Connectivity of Open English Wordnet
Lorenzo Augello | John Philip McCrae

Open English Wordnet is a key resource published in OntoLex-lemon as part of the linguistic linked open data cloud. There are, however, many links missing in the resource, and in this paper, we look at how we can establish hyper-ymy between adjectives. We present a theoretical discussion of the hypernymy relation and how it differs for adjectives in contrast to nouns and verbs. We develop a new resource for adjective hypernymy and fine-tune large language models to predict adjective hypernymy, showing that the methodology of TaxoLLaMa can be adapted to this task.

pdf bib
Bringing IATE into the Semantic Web Family
Paula Diez Ibarbia | Patricia Martín Chozas | Elena Montiel Ponsoda

This paper is an extension of previous work by the authors and other researchers that studies the application of the OntoLex-lemon model for representing the InterActive Terminology for Europe (IATE) database in the Semantic Web. While traditional XML-based approaches have been effective for multilingual terminological work, the Semantic Web enables richer, more interoperable representations. The study evaluates the suitability of OntoLex-lemon for modeling IATE’s complex structure and identifies limitations in existing vocabularies. To address these, this paper tries to identify orher existing vocabularies and ontologies that could satisfy those limitations, which include term reliability, regional usage, lifecycle statuses, lookup forms, and concept cross-references. Still, some representation requirements are not covered by existing vocabularies and may need to be further discussed within the community.

pdf bib
Ontologies for historical languages: using the LiLa and OntoLex-Lemon framework to build a Lemma Bank for Old Irish
Theodorus Fransen

This paper presents a Linked Data approach to digitising and structuring Old Irish linguistic resources using the Lila (Linking Latin) ontology, which is itself largely based on the OntoLex-Lemon framework (Cimiano et al., 2016). Old Irish, as an ancient Celtic language with fragmented textual traditions, presents unique challenges for the creation and interoperability of digital resources. This work is part of the MOLOR project, whose aim is to create a knowledge base for Old Irish by interlinking texts, lexicons, and inflectional data. The first step in this ambitious endeavour is described here: the creation of an RDF linguistic Linked Data hub known as a Lemma Bank, similar to the one created as part of the LiLa project, addressing specific linguistic challenges and opportunities while adhering to the LiLa ontology.

pdf bib
A Lightweight String Based Method of Encoding Etymologies in Linked Data Lexical Resources
Anas Fahad Khan | Maxim Ionov | Paola Marongiu | Ana Salgado

In this submission we propose an approach to encoding etymological information as strings (“etymology strings”). We begin by discussing the advantages of such an approach with respect to one in which etymologies and etymons are explicitly represented as RDF individuals. Next we give a formal description of the regular language underlying our approach as an Extended Backus-Naur Form grammar (EBNF). We use the Chamuça Hindi lexicon as a test case for our approach and show some of the kinds of SPARQL queries which can be made using etymological strings.

pdf bib
Ontolex-Lemon in Wikidata and other Wikibase instances
David Lindemann

This paper provides insight into how the core elements of the Ontolex-Lemon model are inte- grated in the Wikibase Ontology, the data model fundamental to any instance of the Wikibase software, including Wikidata lexemes, which today is probably the largest Ontolex-Lemon use case, a dataset collaboratively built by the community of Wikidata users. We describe how lexical entries are modeled on a Wikibase, including the linguistic description of lexemes, the linking of lexical entries, lexical senses and lexical forms across resources, and links across the domain of lexemes and the ontological part of a Wikibase knowledge graph. Our aim is to present Wikibase as a solution for storing and collaboratively editing lexical data follow- ing Semantic Web standards, and to identify relevant research questions to be addressed in future work.

pdf bib
Philosophising Lexical Meaning as an OntoLex-Lemon Extension
Veruska Zamborlini | Jiaqi Zhu | Marieke van Erp | Arianna Betti

OntoLex-Lemon is a model for representing lexical information, focusing on the use of lexical entries in texts rather than their definitions. This work proposes an extension to the model that aims to capture the definition of senses attributed to lexical entries. We explicitly represent a conceptual setup authored by an agent that operates on lexical content. It either proposes new senses for existing lexical entries in a language or coins new terms to express proposed senses. It provides textual and/or formal definitions to senses/concepts, and can serve as an interpretation of other senses/concepts through rephrasing, translation, formalization, or comparison. Because a conceptual setup and its interpretations may not be unanimously accepted, it is important to support the selection of relevant meanings, as for example, those proposed by a certain author. We illustrate the application of our proposed extension with two case studies, one about the philosophical definition of the concept of idea and its interpretations, and one about historical attributions of meaning to the Dutch East India Company (VOC).