Rute Costa


2024

pdf bib
CHAMUÇA: Towards a Linked Data Language Resource of Portuguese Borrowings in Asian Languages
Fahad Khan | Ana Salgado | Isuri Anuradha | Rute Costa | Chamila Liyanage | John P. McCrae | Atul Kr. Ojha | Priya Rani | Francesca Frontini
Proceedings of the 9th Workshop on Linked Data in Linguistics @ LREC-COLING 2024

This paper presents the development of CHAMUÇA, a novel lexical resource designed to document the influence of the Portuguese language on various Asian languages, with an initial focus on the languages of South Asia. Through the utilization of linked open data and the OntoLex vocabulary, CHAMUÇA offers structured insights into the linguistic characteristics, and cultural ramifications of Portuguese borrowings across multiple languages. The article outlines CHAMUÇA’s potential contributions to the linguistic linked data community, emphasising its role in addressing the scarcity of resources for lesser-resourced languages and serving as a test case for organising etymological data in a queryable format. CHAMUÇA emerges as an initiative towards the comprehensive catalogization and analysis of Portuguese borrowings, offering valuable insights into language contact dynamics, historical evolution, and cultural exchange in Asia, one that is based on linked data technology.

2022

pdf bib
Proceedings of the Workshop on Terminology in the 21st century: many faces, many places
Rute Costa | Sara Carvalho | Ana Ostroški Anić | Anas Fahad Khan
Proceedings of the Workshop on Terminology in the 21st century: many faces, many places

2020

pdf bib
Challenges of Word Sense Alignment: Portuguese Language Resources
Ana Salgado | Sina Ahmadi | Alberto Simões | John Philip McCrae | Rute Costa
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)

This paper reports on an ongoing task of monolingual word sense alignment in which a comparative study between the Portuguese Academy of Sciences Dictionary and the Dicionário Aberto is carried out in the context of the ELEXIS (European Lexicographic Infrastructure) project. Word sense alignment involves searching for matching senses within dictionary entries of different lexical resources and linking them, which poses significant challenges. The lexicographic criteria are not always entirely consistent within individual dictionaries and even less so across different projects where different options may have been assumed in terms of structure and especially wording techniques of lexicographic glosses. This hinders the task of matching senses. We aim to present our annotation workflow in Portuguese using the Semantic Web technologies. The results obtained are useful for the discussion within the community.

2019

pdf bib
TemPO : towards a conceptualisation of pathology in speech and language therapy
Frédérique Brin-Henry | Rute Costa | Sylvie Desprès
Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Terminologie et Intelligence Artificielle (atelier TALN-RECITAL \& IC)

pdf bib
Les noms des artefacts d’éclairage : extraction et représentation des termes portugais et espagnols dans l’archéologie de l’al-Andalus [The names of lighting artefacts: extraction and representation of Portuguese and Spanish terms in the archaeology of al-Andalus]
Bruno Almeida | Rute Costa | Christophe Roche
Traitement Automatique des Langues, Volume 60, Numéro 3 : TAL et humanités numériques [NLP and Digital Humanities]

2004

bib
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)
Maria Teresa Lino | Maria Francisca Xavier | Fátima Ferreira | Rute Costa | Raquel Silva
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf bib
The Verb in the Terminological Collocations. Contribution to the Development of a Morphological Analyser: MorphoCom
Rute Costa | Raquel Silva
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)