Building MT systems in low resourced languages for Public Sector users in Croatia, Iceland, Ireland, and Norway

Róisín Moran, Carla Para Escartín, Akshai Ramesh, Páraic Sheridan, Jane Dunne, Federico Gaspari, Sheila Castilho, Natalia Resende, Andy Way


Abstract
When developing Machine Translation engines, low resourced language pairs tend to be in a disadvantaged position: less available data means that developing robust MT models can be more challenging. The EU-funded PRINCIPLE project aims at overcoming this challenge for four low resourced European languages: Norwegian, Croatian, Irish and Icelandic. This presentation will give an overview of the project, with a focus on the set of Public Sector users and their use cases for which we have developed MT solutions. We will discuss the range of language resources that have been gathered through contributions from public sector collaborators, and present the extensive evaluations that have been undertaken, including significant user evaluation of MT systems across all of the public sector participants in each of the four countries involved.
Anthology ID:
2021.mtsummit-up.25
Volume:
Proceedings of Machine Translation Summit XVIII: Users and Providers Track
Month:
August
Year:
2021
Address:
Virtual
Editors:
Janice Campbell, Ben Huyck, Stephen Larocca, Jay Marciano, Konstantin Savenkov, Alex Yanishevsky
Venue:
MTSummit
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
353–381
Language:
URL:
https://aclanthology.org/2021.mtsummit-up.25
DOI:
Bibkey:
Cite (ACL):
Róisín Moran, Carla Para Escartín, Akshai Ramesh, Páraic Sheridan, Jane Dunne, Federico Gaspari, Sheila Castilho, Natalia Resende, and Andy Way. 2021. Building MT systems in low resourced languages for Public Sector users in Croatia, Iceland, Ireland, and Norway. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track, pages 353–381, Virtual. Association for Machine Translation in the Americas.
Cite (Informal):
Building MT systems in low resourced languages for Public Sector users in Croatia, Iceland, Ireland, and Norway (Moran et al., MTSummit 2021)
Copy Citation:
Presentation:
 2021.mtsummit-up.25.Presentation.pdf