Multilingual Definition Modeling

Edison Marrese-Taylor; Erica K. Shimomoto; Alfredo Solano; Enrique Reid

doi:10.18653/v1/2025.findings-acl.1328

Multilingual Definition Modeling

Edison Marrese-Taylor, Erica K. Shimomoto, Alfredo Solano, Enrique Reid

Abstract

In this paper, we propose the first multilingual study on definition modeling. We use monolingual dictionary data for four new languages (Spanish, French, Portuguese, and German) and perform an in-depth empirical study to test the performance of pre-trained multilingual language models on definition modeling of monosemic words when finetuned on this data. Furthermore, we use a zero-shot approach to test the multilingual capabilities of two popular chat-based Large Language Models (LLMs) in the task. Results show that multilingual language models can perform on-pair with English but cannot leverage potential cross-lingual synergies, with LLMs generally offering better performance overall. A comprehensive human evaluation of the LLM-generated definition highlights the zero and few-shot capabilities of these models in this new task, also showing their shortcomings. Finally, we show that performance on our task via BERTScore strongly correlates to the performance on multilingual LLM benchmarks, suggesting that our task offers a viable compute-constrained, stable and natural alternative to these.

Anthology ID:: 2025.findings-acl.1328
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 25888–25906
Language:
URL:: https://aclanthology.org/2025.findings-acl.1328/
DOI:: 10.18653/v1/2025.findings-acl.1328
Bibkey:
Cite (ACL):: Edison Marrese-Taylor, Erica K. Shimomoto, Alfredo Solano, and Enrique Reid. 2025. Multilingual Definition Modeling. In Findings of the Association for Computational Linguistics: ACL 2025, pages 25888–25906, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Multilingual Definition Modeling (Marrese-Taylor et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.1328.pdf

PDF Cite Search Fix data