A New English-Dutch-NGT Corpus for the Hospitality Domain

Mirella De Sisto, Vincent Vandeghinste, Dimitar Shterionov


Abstract
One of the major challenges hampering the development of language technology which targets sign languages is the extremely limited availability of good quality data geared towards machine learning and deep learning approaches. In this paper we introduce the NGT-Dutch Hotel Review Corpus (NGT-HoReCo), which addresses this issue by providing multimodal parallel data in English, Dutch and Sign Language of the Netherlands (NGT). The corpus contains 283 hotel reviews in written English, translated into written Dutch and into NGT videos. It will be made publicly available through CLARIN and through the ELG platform.
Anthology ID:
2023.at4ssl-1.4
Volume:
Proceedings of the Second International Workshop on Automatic Translation for Signed and Spoken Languages
Month:
June
Year:
2023
Address:
Tampere, Finland
Editors:
Dimitar Shterionov, Mirella De Sisto, Mathias Muller, Davy Van Landuyt, Rehana Omardeen, Shaun Oboyle, Annelies Braffort, Floris Roelofsen, Fred Blain, Bram Vanroy, Eleftherios Avramidis
Venue:
AT4SSL
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
34–37
Language:
URL:
https://aclanthology.org/2023.at4ssl-1.4
DOI:
Bibkey:
Cite (ACL):
Mirella De Sisto, Vincent Vandeghinste, and Dimitar Shterionov. 2023. A New English-Dutch-NGT Corpus for the Hospitality Domain. In Proceedings of the Second International Workshop on Automatic Translation for Signed and Spoken Languages, pages 34–37, Tampere, Finland. European Association for Machine Translation.
Cite (Informal):
A New English-Dutch-NGT Corpus for the Hospitality Domain (Sisto et al., AT4SSL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.at4ssl-1.4.pdf