Definition generation for lexical semantic change detection

Mariia Fedorova, Andrey Kutuzov, Yves Scherrer


Abstract
We use contextualized word definitions generated by large language models as semantic representations in the task of diachronic lexical semantic change detection (LSCD). In short, generated definitions are used as ‘senses’, and the change score of a target word is retrieved by comparing their distributions in two time periods under comparison. On the material of five datasets and three languages, we show that generated definitions are indeed specific and general enough to convey a signal sufficient to rank sets of words by the degree of their semantic change over time. Our approach is on par with or outperforms prior non-supervised sense-based LSCD methods. At the same time, it preserves interpretability and allows to inspect the reasons behind a specific shift in terms of discrete definitions-as-senses. This is another step in the direction of explainable semantic change modeling.
Anthology ID:
2024.findings-acl.339
Volume:
Findings of the Association for Computational Linguistics ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand and virtual meeting
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5712–5724
Language:
URL:
https://aclanthology.org/2024.findings-acl.339
DOI:
Bibkey:
Cite (ACL):
Mariia Fedorova, Andrey Kutuzov, and Yves Scherrer. 2024. Definition generation for lexical semantic change detection. In Findings of the Association for Computational Linguistics ACL 2024, pages 5712–5724, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
Definition generation for lexical semantic change detection (Fedorova et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.339.pdf