Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative Functions

Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos


Abstract
ISO 24617-2, the ISO standard for dialog act annotation, sets the ground for more comparable research in the area. However, the amount of data annotated according to it is still reduced, which impairs the development of approaches for automatic recognition. In this paper, we describe a mapping of the original dialog act labels of the LEGO corpus, which have been neglected, into the communicative functions of the standard. Although this does not lead to a complete annotation according to the standard, the 347 dialogs provide a relevant amount of data that can be used in the development of automatic communicative function recognition approaches, which may lead to a wider adoption of the standard. Using the 17 English dialogs of the DialogBank as gold standard, our preliminary experiments have shown that including the mapped dialogs during the training phase leads to improved performance while recognizing communicative functions in the Task dimension.
Anthology ID:
2020.lrec-1.67
Volume:
Proceedings of the 12th Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
531–539
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.67
DOI:
Bibkey:
Cite (ACL):
Eugénio Ribeiro, Ricardo Ribeiro, and David Martins de Matos. 2020. Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative Functions. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 531–539, Marseille, France. European Language Resources Association.
Cite (Informal):
Mapping the Dialog Act Annotations of the LEGO Corpus into ISO 24617-2 Communicative Functions (Ribeiro et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.67.pdf