Adapting Measures of Literality for Use with Historical Language Data

Adam Roussel


Abstract
This paper concerns the adaptation of two existing computational measures relating to the estimation of the literality of expressions to enable their use in scenarios where data is scarce, as is usually the case with historical language data. Being able to determine an expression’s literality via statistical means could support a range of linguistic annotation tasks, such as those relating to metaphor, metonymy, and idiomatic expressions, however making this judgment is especially difficult for modern annotators of historical and ancient texts. Therefore we re-implement these measures using smaller corpora and count-based vectors more suited to these amounts of training data. The adapted measures are evaluated against an existing data set of particle verbs annotated with degrees of literality. The results were inconclusive, yielding low correlations between 0.05 and 0.10 (Spearman’s ρ). Further work is needed to determine which measures and types of data correspond to which aspects of literality.
Anthology ID:
2024.nlp4dh-1.20
Volume:
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities
Month:
November
Year:
2024
Address:
Miami, USA
Editors:
Mika Hämäläinen, Emily Öhman, So Miyagawa, Khalid Alnajjar, Yuri Bizzoni
Venue:
NLP4DH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
209–215
Language:
URL:
https://aclanthology.org/2024.nlp4dh-1.20
DOI:
Bibkey:
Cite (ACL):
Adam Roussel. 2024. Adapting Measures of Literality for Use with Historical Language Data. In Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, pages 209–215, Miami, USA. Association for Computational Linguistics.
Cite (Informal):
Adapting Measures of Literality for Use with Historical Language Data (Roussel, NLP4DH 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nlp4dh-1.20.pdf