Linguistically Inspired Language Model Augmentation for MT

George Tambouratzis, Vasiliki Pouli


Abstract
The present article reports on efforts to improve the translation accuracy of a corpus―based Machine Translation (MT) system. In order to achieve that, an error analysis performed on past translation outputs has indicated the likelihood of improving the translation accuracy by augmenting the coverage of the Target-Language (TL) side language model. The method adopted for improving the language model is initially presented, based on the concatenation of consecutive phrases. The algorithmic steps are then described that form the process for augmenting the language model. The key idea is to only augment the language model to cover the most frequent cases of phrase sequences, as counted over a TL-side corpus, in order to maximize the cases covered by the new language model entries. Experiments presented in the article show that substantial improvements in translation accuracy are achieved via the proposed method, when integrating the grown language model to the corpus-based MT system.
Anthology ID:
L16-1091
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
573–577
Language:
URL:
https://aclanthology.org/L16-1091/
DOI:
Bibkey:
Cite (ACL):
George Tambouratzis and Vasiliki Pouli. 2016. Linguistically Inspired Language Model Augmentation for MT. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 573–577, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Linguistically Inspired Language Model Augmentation for MT (Tambouratzis & Pouli, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1091.pdf