Annotating anaphoric phenomena in situated dialogue

Sharid Loáiciga, Simon Dobnik, David Schlangen


Abstract
In recent years several corpora have been developed for vision and language tasks. With this paper, we intend to start a discussion on the annotation of referential phenomena in situated dialogue. We argue that there is still significant room for corpora that increase the complexity of both visual and linguistic domains and which capture different varieties of perceptual and conversational contexts. In addition, a rich annotation scheme covering a broad range of referential phenomena and compatible with the textual task of coreference resolution is necessary in order to take the most advantage of these corpora. Consequently, there are several open questions regarding the semantics of reference and annotation, and the extent to which standard textual coreference accounts for the situated dialogue genre. Working with two corpora on situated dialogue, we present our extension to the ARRAU (Uryupina et al., 2020) annotation scheme in order to start this discussion.
Anthology ID:
2021.mmsr-1.7
Volume:
Proceedings of the 1st Workshop on Multimodal Semantic Representations (MMSR)
Month:
June
Year:
2021
Address:
Groningen, Netherlands (Online)
Editors:
Lucia Donatelli, Nikhil Krishnaswamy, Kenneth Lai, James Pustejovsky
Venue:
MMSR
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
78–88
Language:
URL:
https://aclanthology.org/2021.mmsr-1.7
DOI:
Bibkey:
Cite (ACL):
Sharid Loáiciga, Simon Dobnik, and David Schlangen. 2021. Annotating anaphoric phenomena in situated dialogue. In Proceedings of the 1st Workshop on Multimodal Semantic Representations (MMSR), pages 78–88, Groningen, Netherlands (Online). Association for Computational Linguistics.
Cite (Informal):
Annotating anaphoric phenomena in situated dialogue (Loáiciga et al., MMSR 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.mmsr-1.7.pdf