Findings of the WMT 2024 Shared Task of the Open Language Data Initiative

Jean Maillard, Laurie Burchell, Antonios Anastasopoulos, Christian Federmann, Philipp Koehn, Skyler Wang


Abstract
We present the results of the WMT 2024 shared task of the Open Language Data Initiative. Participants were invited to contribute to the FLORES+ and MT Seed multilingual datasets, two foundational open resources that facilitate the organic expansion of language technology’s reach. We accepted ten submissions covering 16 languages, which extended the range of languages included in the datasets and improved the quality of existing data.
Anthology ID:
2024.wmt-1.4
Volume:
Proceedings of the Ninth Conference on Machine Translation
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
110–117
Language:
URL:
https://aclanthology.org/2024.wmt-1.4
DOI:
Bibkey:
Cite (ACL):
Jean Maillard, Laurie Burchell, Antonios Anastasopoulos, Christian Federmann, Philipp Koehn, and Skyler Wang. 2024. Findings of the WMT 2024 Shared Task of the Open Language Data Initiative. In Proceedings of the Ninth Conference on Machine Translation, pages 110–117, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Findings of the WMT 2024 Shared Task of the Open Language Data Initiative (Maillard et al., WMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.wmt-1.4.pdf