Comparing Multilingual NMT Models and Pivoting
Celia Soler Uguet, Fred Bane, Anna Zaretskaya, Tània Blanch Miró
Abstract
Following recent advancements in multilingual machine translation at scale, our team carried out tests to compare the performance of multilingual models (M2M from Facebook and multilingual models from Helsinki-NLP) with a two-step translation process using English as a pivot language. Direct assessment by linguists rated translations produced by pivoting as consistently better than those obtained from multilingual models of similar size, while automated evaluation with COMET suggested relative performance was strongly impacted by domain and language family.- Anthology ID:
- 2022.eamt-1.26
- Volume:
- Proceedings of the 23rd Annual Conference of the European Association for Machine Translation
- Month:
- June
- Year:
- 2022
- Address:
- Ghent, Belgium
- Editors:
- Helena Moniz, Lieve Macken, Andrew Rufener, Loïc Barrault, Marta R. Costa-jussà, Christophe Declercq, Maarit Koponen, Ellie Kemp, Spyridon Pilos, Mikel L. Forcada, Carolina Scarton, Joachim Van den Bogaert, Joke Daems, Arda Tezcan, Bram Vanroy, Margot Fonteyne
- Venue:
- EAMT
- SIG:
- Publisher:
- European Association for Machine Translation
- Note:
- Pages:
- 231–239
- Language:
- URL:
- https://aclanthology.org/2022.eamt-1.26
- DOI:
- Bibkey:
- Cite (ACL):
- Celia Soler Uguet, Fred Bane, Anna Zaretskaya, and Tània Blanch Miró. 2022. Comparing Multilingual NMT Models and Pivoting. In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, pages 231–239, Ghent, Belgium. European Association for Machine Translation.
- Cite (Informal):
- Comparing Multilingual NMT Models and Pivoting (Uguet et al., EAMT 2022)
- Copy Citation:
- PDF:
- https://aclanthology.org/2022.eamt-1.26.pdf
Export citation
@inproceedings{uguet-etal-2022-comparing, title = "Comparing Multilingual {NMT} Models and Pivoting", author = "Uguet, Celia Soler and Bane, Fred and Zaretskaya, Anna and Mir{\'o}, T{\`a}nia Blanch", editor = {Moniz, Helena and Macken, Lieve and Rufener, Andrew and Barrault, Lo{\"\i}c and Costa-juss{\`a}, Marta R. and Declercq, Christophe and Koponen, Maarit and Kemp, Ellie and Pilos, Spyridon and Forcada, Mikel L. and Scarton, Carolina and Van den Bogaert, Joachim and Daems, Joke and Tezcan, Arda and Vanroy, Bram and Fonteyne, Margot}, booktitle = "Proceedings of the 23rd Annual Conference of the European Association for Machine Translation", month = jun, year = "2022", address = "Ghent, Belgium", publisher = "European Association for Machine Translation", url = "https://aclanthology.org/2022.eamt-1.26", pages = "231--239", abstract = "Following recent advancements in multilingual machine translation at scale, our team carried out tests to compare the performance of multilingual models (M2M from Facebook and multilingual models from Helsinki-NLP) with a two-step translation process using English as a pivot language. Direct assessment by linguists rated translations produced by pivoting as consistently better than those obtained from multilingual models of similar size, while automated evaluation with COMET suggested relative performance was strongly impacted by domain and language family.", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="uguet-etal-2022-comparing"> <titleInfo> <title>Comparing Multilingual NMT Models and Pivoting</title> </titleInfo> <name type="personal"> <namePart type="given">Celia</namePart> <namePart type="given">Soler</namePart> <namePart type="family">Uguet</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Fred</namePart> <namePart type="family">Bane</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Anna</namePart> <namePart type="family">Zaretskaya</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tània</namePart> <namePart type="given">Blanch</namePart> <namePart type="family">Miró</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2022-06</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the 23rd Annual Conference of the European Association for Machine Translation</title> </titleInfo> <name type="personal"> <namePart type="given">Helena</namePart> <namePart type="family">Moniz</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Lieve</namePart> <namePart type="family">Macken</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Andrew</namePart> <namePart type="family">Rufener</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Loïc</namePart> <namePart type="family">Barrault</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Marta</namePart> <namePart type="given">R</namePart> <namePart type="family">Costa-jussà</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Christophe</namePart> <namePart type="family">Declercq</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Maarit</namePart> <namePart type="family">Koponen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ellie</namePart> <namePart type="family">Kemp</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Spyridon</namePart> <namePart type="family">Pilos</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Mikel</namePart> <namePart type="given">L</namePart> <namePart type="family">Forcada</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Carolina</namePart> <namePart type="family">Scarton</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Joachim</namePart> <namePart type="family">Van den Bogaert</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Joke</namePart> <namePart type="family">Daems</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Arda</namePart> <namePart type="family">Tezcan</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bram</namePart> <namePart type="family">Vanroy</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Margot</namePart> <namePart type="family">Fonteyne</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>European Association for Machine Translation</publisher> <place> <placeTerm type="text">Ghent, Belgium</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <abstract>Following recent advancements in multilingual machine translation at scale, our team carried out tests to compare the performance of multilingual models (M2M from Facebook and multilingual models from Helsinki-NLP) with a two-step translation process using English as a pivot language. Direct assessment by linguists rated translations produced by pivoting as consistently better than those obtained from multilingual models of similar size, while automated evaluation with COMET suggested relative performance was strongly impacted by domain and language family.</abstract> <identifier type="citekey">uguet-etal-2022-comparing</identifier> <location> <url>https://aclanthology.org/2022.eamt-1.26</url> </location> <part> <date>2022-06</date> <extent unit="page"> <start>231</start> <end>239</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T Comparing Multilingual NMT Models and Pivoting %A Uguet, Celia Soler %A Bane, Fred %A Zaretskaya, Anna %A Miró, Tània Blanch %Y Moniz, Helena %Y Macken, Lieve %Y Rufener, Andrew %Y Barrault, Loïc %Y Costa-jussà, Marta R. %Y Declercq, Christophe %Y Koponen, Maarit %Y Kemp, Ellie %Y Pilos, Spyridon %Y Forcada, Mikel L. %Y Scarton, Carolina %Y Van den Bogaert, Joachim %Y Daems, Joke %Y Tezcan, Arda %Y Vanroy, Bram %Y Fonteyne, Margot %S Proceedings of the 23rd Annual Conference of the European Association for Machine Translation %D 2022 %8 June %I European Association for Machine Translation %C Ghent, Belgium %F uguet-etal-2022-comparing %X Following recent advancements in multilingual machine translation at scale, our team carried out tests to compare the performance of multilingual models (M2M from Facebook and multilingual models from Helsinki-NLP) with a two-step translation process using English as a pivot language. Direct assessment by linguists rated translations produced by pivoting as consistently better than those obtained from multilingual models of similar size, while automated evaluation with COMET suggested relative performance was strongly impacted by domain and language family. %U https://aclanthology.org/2022.eamt-1.26 %P 231-239
Markdown (Informal)
[Comparing Multilingual NMT Models and Pivoting](https://aclanthology.org/2022.eamt-1.26) (Uguet et al., EAMT 2022)
- Comparing Multilingual NMT Models and Pivoting (Uguet et al., EAMT 2022)
ACL
- Celia Soler Uguet, Fred Bane, Anna Zaretskaya, and Tània Blanch Miró. 2022. Comparing Multilingual NMT Models and Pivoting. In Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, pages 231–239, Ghent, Belgium. European Association for Machine Translation.