From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme

Marie Mikulova, Barbora Štěpánková, Jan Štěpánek


Abstract
In the last decades, computational linguistics has become increasingly interested in annotation schemes that aim at an adequate description of the meaning of the sentences and texts. Discussions are ongoing on an appropriate annotation scheme for a large and complex amount of diverse information. In this contribution devoted to description of polyfunctional uninflected words (namely particles), i.e. words which, although having only one paradigmatic form, can have several different syntactic functions and even express relatively different semantic distinctions, we argue that it is the multi-layer system (linked from meaning to text) that allows a comprehensive description of the relations between morphological properties, syntactic function and expressed meaning, and thus contributes to greater accuracy in the description of the phenomena concerned and to the overall consistency of the annotated data. These aspects are demonstrated within the Prague Dependency Treebank annotation scheme, whose pioneering proposal can be found in the first COLING proceedings from 1965 (Sgall 1965), and to this day, the concept has proved to be sound and serves very well for complex annotation.
Anthology ID:
2025.coling-main.147
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2163–2175
Language:
URL:
https://aclanthology.org/2025.coling-main.147/
DOI:
Bibkey:
Cite (ACL):
Marie Mikulova, Barbora Štěpánková, and Jan Štěpánek. 2025. From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme. In Proceedings of the 31st International Conference on Computational Linguistics, pages 2163–2175, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
From Form to Meaning: The Case of Particles within the Prague Dependency Treebank Annotation Scheme (Mikulova et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.147.pdf