DOLFIN - Document-Level Financial Test-Set for Machine Translation

Mariam Nakhle; Marco Dinarelli; Raheel Qader; Emmanuelle Esperança-Rodier; Hervé Blanchon

doi:10.18653/v1/2025.findings-naacl.307

DOLFIN - Document-Level Financial Test-Set for Machine Translation

Mariam Nakhle, Marco Dinarelli, Raheel Qader, Emmanuelle Esperança-Rodier, Hervé Blanchon

Abstract

Despite the strong research interest in document-level Machine Translation (MT), the test-sets dedicated to this task are still scarce. The existing test-sets mainly cover topics from the general domain and fall short on specialised domains, such as legal and financial. Also, despite their document-level aspect, they still follow a sentence-level logic that doesn’t allow for including certain linguistic phenomena such as information reorganisation. In this work, we aim to fill this gap by proposing a novel test-set : DOLFIN. The dataset is built from specialised financial documents and it makes a step towards true document-level MT by abandoning the paradigm of perfectly aligned sentences, presenting data in units of sections rather than sentences. The test-set consists of an average of 1950 aligned sections for five language pairs. We present the detailed data collection pipeline that can serve as inspiration for aligning new document-level datasets. We demonstrate the usefulness and the quality of this test-set with the evaluation of a series of models. Our results show that the test-set is able to discriminate between context-sensitive and context-agnostic models and shows the weaknesses when models fail to accurately translate financial texts. The test-set will be made public for the community.

Anthology ID:: 2025.findings-naacl.307
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5559–5571
Language:
URL:: https://aclanthology.org/2025.findings-naacl.307/
DOI:: 10.18653/v1/2025.findings-naacl.307
Bibkey:
Cite (ACL):: Mariam Nakhle, Marco Dinarelli, Raheel Qader, Emmanuelle Esperança-Rodier, and Hervé Blanchon. 2025. DOLFIN - Document-Level Financial Test-Set for Machine Translation. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 5559–5571, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: DOLFIN - Document-Level Financial Test-Set for Machine Translation (Nakhle et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.307.pdf

PDF Cite Search Fix data