2024
pdf
bib
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
Carolina Scarton
|
Charlotte Prescott
|
Chris Bayliss
|
Chris Oakley
|
Joanna Wright
|
Stuart Wrigley
|
Xingyi Song
|
Edward Gow-Smith
|
Rachel Bawden
|
Víctor M Sánchez-Cartagena
|
Patrick Cadwell
|
Ekaterina Lapshinova-Koltunski
|
Vera Cabarrão
|
Konstantinos Chatzitheodorou
|
Mary Nurminen
|
Diptesh Kanojia
|
Helena Moniz
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
pdf
bib
abs
Cultural Transcreation with LLMs as a new product
Beatriz Silva
|
Helena Wu
|
Yan Jingxuan
|
Vera Cabarrão
|
Helena Moniz
|
Sara Guerreiro de Sousa
|
João Almeida
|
Malene Sjørslev Søholm
|
Ana Farinha
|
Paulo Dimas
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2)
We present how at Unbabel we have been using Large Language Models to apply a Cultural Transcreation (CT) product on customer support (CS) emails and how we have been testing the quality and potential of this product. We discuss our preliminary evaluation of the performance of different MT models in the task of translating rephrased content and the quality of the translation outputs. Furthermore, we introduce the live pilot programme and the corresponding relevant findings, showing that transcreated content is not only culturally adequate but it is also of high rephrasing and translation quality.
pdf
bib
abs
Findings of the WMT 2024 Shared Task on Chat Translation
Wafaa Mohammed
|
Sweta Agrawal
|
Amin Farajian
|
Vera Cabarrão
|
Bryan Eikema
|
Ana C Farinha
|
José G. C. De Souza
Proceedings of the Ninth Conference on Machine Translation
This paper presents the findings from the third edition of the Chat Translation Shared Task. As with previous editions, the task involved translating bilingual customer support conversations, specifically focusing on the impact of conversation context in translation quality and evaluation. We also include two new language pairs: English-Korean and English-Dutch, in addition to the set of language pairs from previous editions: English-German, English-French, and English-Brazilian Portuguese.We received 22 primary submissions and 32 contrastive submissions from eight teams, with each language pair having participation from at least three teams. We evaluated the systems comprehensively using both automatic metrics and human judgments via a direct assessment framework.The official rankings for each language pair were determined based on human evaluation scores, considering performance in both translation directions—agent and customer. Our analysis shows that while the systems excelled at translating individual turns, there is room for improvement in overall conversation-level translation quality.
2023
pdf
bib
abs
Context-aware and gender-neutral Translation Memories
Marjolene Paulo
|
Vera Cabarrão
|
Helena Moniz
|
Miguel Menezes
|
Rachel Grewcock
|
Eduardo Farah
Proceedings of the 24th Annual Conference of the European Association for Machine Translation
This work proposes an approach to use Part-Of-Speech (POS) information to automatically detect context-dependent Translation Units (TUs) from a Translation Memory database pertaining to the customer support domain. In line with our goal to minimize context-dependency in TUs, we show how this mechanism can be deployed to create new gender-neutral and context-independent TUs. Our experiments, conducted across Portuguese (PT), Brazilian Portuguese (PT-BR), Spanish (ES), and Spanish-Latam (ES-LATAM), show that the occurrence of certain POS with specific words is accurate in identifying context dependency. In a cross-client analysis, we found that ~10% of the most frequent 13,200 TUs were context-dependent, with gender determining context-dependency in 98% of all confirmed cases. We used these findings to suggest gender-neutral equivalents for the most frequent TUs with gender constraints. Our approach is in use in the Unbabel translation pipeline, and can be integrated into any other Neural Machine Translation (NMT) pipeline.
pdf
bib
abs
Dialogue Quality and Emotion Annotations for Customer Support Conversations
John Mendonca
|
Patrícia Pereira
|
Miguel Menezes
|
Vera Cabarrão
|
Ana C Farinha
|
Helena Moniz
|
Alon Lavie
|
Isabel Trancoso
Proceedings of the Third Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)
Task-oriented conversational datasets often lack topic variability and linguistic diversity. However, with the advent of Large Language Models (LLMs) pretrained on extensive, multilingual and diverse text data, these limitations seem overcome. Nevertheless, their generalisability to different languages and domains in dialogue applications remains uncertain without benchmarking datasets. This paper presents a holistic annotation approach for emotion and conversational quality in the context of bilingual customer support conversations. By performing annotations that take into consideration the complete instances that compose a conversation, one can form a broader perspective of the dialogue as a whole. Furthermore, it provides a unique and valuable resource for the development of text classification models. To this end, we present benchmarks for Emotion Recognition and Dialogue Quality Estimation and show that further research is needed to leverage these models in a production setting.
2022
pdf
bib
abs
Fast-Paced Improvements to Named Entity Handling for Neural Machine Translation
Pedro Mota
|
Vera Cabarrão
|
Eduardo Farah
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
In this work, we propose a Named Entity handling approach to improve translation quality within an existing Natural Language Processing (NLP) pipeline without modifying the Neural Machine Translation (NMT) component. Our approach seeks to enable fast delivery of such improvements and alleviate user experience problems related to NE distortion. We implement separate NE recognition and translation steps. Then, a combination of standard entity masking technique and a novel semantic equivalent placeholder guarantees that both NE translation is respected and the best overall quality is obtained from NMT. The experiments show that translation quality improves in 38.6% of the test cases when compared to a version of the NLP pipeline with less-developed NE handling capability.
pdf
bib
abs
A Case Study on the Importance of Named Entities in a Machine Translation Pipeline for Customer Support Content
Miguel Menezes
|
Vera Cabarrão
|
Pedro Mota
|
Helena Moniz
|
Alon Lavie
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
This paper describes the research developed at Unbabel, a Portuguese Machine-translation start-up, that combines MT with human post-edition and focuses strictly on customer service content. We aim to contribute to furthering MT quality and good-practices by exposing the importance of having a continuously-in-development robust Named Entity Recognition system compliant with General Data Protection Regulation (GDPR). Moreover, we have tested semiautomatic strategies that support and enhance the creation of Named Entities gold standards to allow a more seamless implementation of Multilingual Named Entities Recognition Systems. The project described in this paper is the result of a shared work between Unbabel ́s linguists and Unbabel ́s AI engineering team, matured over a year. The project should, also, be taken as a statement of multidisciplinary, proving and validating the much-needed articulation between the different scientific fields that compose and characterize the area of Natural Language Processing (NLP).
2014
pdf
bib
abs
Revising the annotation of a Broadcast News corpus: a linguistic approach
Vera Cabarrão
|
Helena Moniz
|
Fernando Batista
|
Ricardo Ribeiro
|
Nuno Mamede
|
Hugo Meinedo
|
Isabel Trancoso
|
Ana Isabel Mata
|
David Martins de Matos
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules. The main focus of the revision process consisted on annotating and revising structural metadata events, such as disfluencies and punctuation marks. The resultant revised data is now being extensively used, and was of extreme importance for improving the performance of several modules, especially the punctuation and capitalization modules, but also the speech recognition system, and all the subsequent modules. The resultant data has also been recently used in disfluency studies across domains.