Kevin Tran - ACL Anthology

Kevin Tran

2023

We study speech-to-speech translation (S2ST) that translates speech from one language into another language and focuses on building systems to support languages without standard text writing systems. We use English-Taiwanese Hokkien as a case study, and present an end-to-end solution from training data collection, modeling choices to benchmark dataset release. First, we present efforts on creating human annotated data, automatically mining data from large unlabeled speech datasets, and adopting pseudo-labeling to produce weakly supervised data. On the modeling, we take advantage of recent advances in applying self-supervised discrete representations as target for prediction in S2ST and show the effectiveness of leveraging additional text supervision from Mandarin, a language similar to Hokkien, in model training. Finally, we release an S2ST benchmark set to facilitate future research in this field.

FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN
Milind Agarwal | Sweta Agrawal | Antonios Anastasopoulos | Luisa Bentivogli | Ondřej Bojar | Claudia Borg | Marine Carpuat | Roldano Cattoni | Mauro Cettolo | Mingda Chen | William Chen | Khalid Choukri | Alexandra Chronopoulou | Anna Currey | Thierry Declerck | Qianqian Dong | Kevin Duh | Yannick Estève | Marcello Federico | Souhir Gahbiche | Barry Haddow | Benjamin Hsu | Phu Mon Htut | Hirofumi Inaguma | Dávid Javorský | John Judge | Yasumasa Kano | Tom Ko | Rishu Kumar | Pengwei Li | Xutai Ma | Prashant Mathur | Evgeny Matusov | Paul McNamee | John P. McCrae | Kenton Murray | Maria Nadejde | Satoshi Nakamura | Matteo Negri | Ha Nguyen | Jan Niehues | Xing Niu | Atul Kr. Ojha | John E. Ortega | Proyag Pal | Juan Pino | Lonneke van der Plas | Peter Polák | Elijah Rippeth | Elizabeth Salesky | Jiatong Shi | Matthias Sperber | Sebastian Stüker | Katsuhito Sudoh | Yun Tang | Brian Thompson | Kevin Tran | Marco Turchi | Alex Waibel | Mingxuan Wang | Shinji Watanabe | Rodolfo Zevallos
Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023)

This paper reports on the shared tasks organized by the 20th IWSLT Conference. The shared tasks address 9 scientific challenges in spoken language translation: simultaneous and offline translation, automatic subtitling and dubbing, speech-to-speech translation, multilingual, dialect and low-resource speech translation, and formality control. The shared tasks attracted a total of 38 submissions by 31 teams. The growing interest towards spoken language translation is also witnessed by the constantly increasing number of shared task organizers and contributors to the overview paper, almost evenly distributed across industry and academia.

Co-authors

Luisa Bentivogli 1

Ondřej Bojar 1

Marine Carpuat 1

Roldano Cattoni 1

Mauro Cettolo 1

Peng-Jen Chen 1

Khalid Choukri 1

Alexandra Chronopoulou 1

Thierry Declerck 1

Qianqian Dong 1

Paul-Ambroise Duquenne 1

Yannick Estève 1

Marcello Federico 1

Souhir Gahbiche 1

Dávid Javorský 1

Yasumasa Kano 1

Prashant Mathur 1

Evgeny Matusov 1

John Philip McCrae 1

Kenton Murray 1

Maria Nadejde 1

Satoshi Nakamura 1

Atul Kr. Ojha 1

John E. Ortega 1

Sravya Popuri 1

Elijah Rippeth 1

Elizabeth Salesky 1

Holger Schwenk 1

Matthias Sperber 1

Sebastian Stüker 1

Katsuhito Sudoh 1

Brian Thompson 1

Paden Tomasello 1

Changhan Wang 1

Mingxuan Wang 1

Shinji Watanabe 1

Rodolfo Zevallos 1

Lonneke van der Plas 1

Venues

findings1
iwslt1