GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary

Fatemah Yousef Almeman; Luis Espinosa Anke

GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary

Fatemah Yousef Almeman, Luis Espinosa Anke

Abstract

Reverse Dictionary (RD) is the task of obtaining the most relevant word or set of words given a textual description or dictionary definition. Effective RD methods have applications in accessibility, translation or writing support systems. Moreover, in NLP research we find RD to be used to benchmark text encoders at various granularities, as it often requires word, definition and sentence embeddings. In this paper, we propose a simple approach to RD that leverages LLMs in combination with embedding models. Despite its simplicity, this approach outperforms supervised baselines in well studied RD datasets, while also showing less overfitting. We also conduct a number of experiments on different dictionaries and analyze how different styles, registers and target audiences impact the quality of RD systems. We conclude that, on average, untuned embeddings alone fare way below an LLM-only baseline (although they are competitive in highly technical dictionaries), but are crucial for boosting performance in combined methods.

Anthology ID:: 2025.coling-main.549
Volume:: Proceedings of the 31st International Conference on Computational Linguistics
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8242–8254
Language:
URL:: https://aclanthology.org/2025.coling-main.549/
DOI:
Bibkey:
Cite (ACL):: Fatemah Yousef Almeman and Luis Espinosa Anke. 2025. GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary. In Proceedings of the 31st International Conference on Computational Linguistics, pages 8242–8254, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary (Almeman & Espinosa Anke, COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-main.549.pdf

PDF Cite Search Fix data