A Universal Dependencies Treebank for Highland Puebla Nahuatl

Robert Pugh, Francis Tyers


Abstract
We present a Universal Dependencies (UD) treebank for Highland Puebla Nahuatl. The treebank is only the second such UD corpus for a Mexican language, and supplements an existing treebank for another Nahuatl variant. We describe the process of data collection, annotation decisions and interesting syntactic constructions, and discuss some similarities and differences between the Highland Puebla Nahuatl treebank and the existing Western Sierra Puebla Nahuatl treebank.
Anthology ID:
2024.naacl-long.76
Volume:
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Kevin Duh, Helena Gomez, Steven Bethard
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1393–1403
Language:
URL:
https://aclanthology.org/2024.naacl-long.76
DOI:
Bibkey:
Cite (ACL):
Robert Pugh and Francis Tyers. 2024. A Universal Dependencies Treebank for Highland Puebla Nahuatl. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 1393–1403, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
A Universal Dependencies Treebank for Highland Puebla Nahuatl (Pugh & Tyers, NAACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.naacl-long.76.pdf
Copyright:
 2024.naacl-long.76.copyright.pdf