Annotating and Disambiguating the Discourse Usage of the Enclitic dA in Turkish

Ebru Ersöyleyen, Deniz Zeyrek, Fırat Öter


Abstract
The Turkish particle dA is a focus-associated enclitic, and it can act as a discourse connective conveying multiple senses, like additive, contrastive, causal etc. Like many other linguistic expressions, it is subject to usage ambiguity and creates a challenge in natural language automatization tasks. For the first time, we annotate the discourse and non-discourse connnective occurrences of dA in Turkish with the PDTB principles. Using a minimal set of linguistic features, we develop binary classifiers to distinguish its discourse connective usage from its other usages. We show that despite its ability to cliticize to any syntactic type, variable position in the sentence and having a wide argument span, its discourse/non-discourse connective usage can be annotated reliably and its discourse usage can be disambiguated by exploiting local cues.
Anthology ID:
2023.law-1.5
Volume:
Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Jakob Prange, Annemarie Friedrich
Venue:
LAW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
46–54
Language:
URL:
https://aclanthology.org/2023.law-1.5
DOI:
10.18653/v1/2023.law-1.5
Bibkey:
Cite (ACL):
Ebru Ersöyleyen, Deniz Zeyrek, and Fırat Öter. 2023. Annotating and Disambiguating the Discourse Usage of the Enclitic dA in Turkish. In Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII), pages 46–54, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Annotating and Disambiguating the Discourse Usage of the Enclitic dA in Turkish (Ersöyleyen et al., LAW 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.law-1.5.pdf
Video:
 https://aclanthology.org/2023.law-1.5.mp4