The ‘aftermath’ of compounds: Investigating Compounds and their Semantic Representations

Swarang Joshi

The ‘aftermath’ of compounds: Investigating Compounds and their Semantic Representations

Abstract

This study investigated how well computational embeddings aligned with human semantic judgments in the processing of English compound words. We compared static word vectors (GloVe) and contextualized embeddings (BERT) against human ratings of lexeme meaning dominance (LMD) and semantic transparency (ST) drawn from a psycholinguistic dataset. Using measures of association strength (Edinburgh Associative Thesaurus), frequency (BNC), and predictability (LaDEC), we computed embedding-derived LMD and ST metrics and assessed their relationships with human judgments via Spearman’s correlation and regression analyses. Our results showed that BERT embeddings better captured compositional semantics than GloVe, and that predictability ratings were strong predictors of semantic transparency in both human and model data. These findings advanced computational psycholinguistics by clarifying the factors that drove compound word processing and offered insights into embedding-based semantic modeling.

Anthology ID:: 2025.ijcnlp-srw.27
Volume:: The 14th International Joint Conference on Natural Language Processing and The 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Month:: December
Year:: 2025
Address:: Mumbai, India
Editors:: Santosh T.y.s.s, Shuichiro Shimizu, Yifan Gong
Venue:: IJCNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 322–328
Language:
URL:: https://aclanthology.org/2025.ijcnlp-srw.27/
DOI:
Bibkey:
Cite (ACL):: Swarang Joshi. 2025. The ‘aftermath’ of compounds: Investigating Compounds and their Semantic Representations. In The 14th International Joint Conference on Natural Language Processing and The 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, pages 322–328, Mumbai, India. Association for Computational Linguistics.
Cite (Informal):: The ‘aftermath’ of compounds: Investigating Compounds and their Semantic Representations (Joshi, IJCNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ijcnlp-srw.27.pdf

PDF Cite Search Fix data