Word boundaries and the morphology-syntax trade-off

Pablo Mosteiro, Damián Blasi


Abstract
This paper investigates the relationship between syntax and morphology in natural languages, focusing on the relation between the amount of information stored by word structure on the one hand, and word order on the other. In previous work, a trade-off between these was observed in a large corpus covering over a thousand languages, suggesting a dynamic ‘division of labor’ between syntax and morphology, as well as yielding proof for the efficient coding of information in language. In contrast, we find that the trade-off can be explained by differing conventions in orthographic word boundaries. We do so by redefining word boundaries within languages either by increasing or decreasing the domain of wordhood implied by orthographic words. Namely, we paste frequent word-pairs together and split words into their frequently occurring component parts. These interventions yield the same trade-off within languages across word domains as what is observed across languages in the orthographic word domain. This allows us to conclude that the original claims on syntax-morphology trade-offs were spurious and that, more importantly, there does not seem to exist a privileged wordhood domain where within- and across-word regularities yield an optimal or optimized amount of information.
Anthology ID:
2025.clrel-1.9
Volume:
Proceedings of the New Horizons in Computational Linguistics for Religious Texts
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Sane Yagi, Sane Yagi, Majdi Sawalha, Bayan Abu Shawar, Abdallah T. AlShdaifat, Norhan Abbas, Organizers
Venues:
CLRel | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
86–93
Language:
URL:
https://aclanthology.org/2025.clrel-1.9/
DOI:
Bibkey:
Cite (ACL):
Pablo Mosteiro and Damián Blasi. 2025. Word boundaries and the morphology-syntax trade-off. In Proceedings of the New Horizons in Computational Linguistics for Religious Texts, pages 86–93, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Word boundaries and the morphology-syntax trade-off (Mosteiro & Blasi, CLRel 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.clrel-1.9.pdf