First Steps Towards Coverage-Based Sentence Alignment

Luís Gomes, Gabriel Pereira Lopes


Abstract
In this paper, we introduce a coverage-based scoring function that discriminates between parallel and non-parallel sentences. When plugged into Bleualign, a state-of-the-art sentence aligner, our function improves both precision and recall of alignments over the originally proposed BLEU score. Furthermore, since our scoring function uses Moses phrase tables directly we avoid the need to translate the texts to be aligned, which is time-consuming and a potential source of alignment errors.
Anthology ID:
L16-1354
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2228–2231
Language:
URL:
https://aclanthology.org/L16-1354
DOI:
Bibkey:
Cite (ACL):
Luís Gomes and Gabriel Pereira Lopes. 2016. First Steps Towards Coverage-Based Sentence Alignment. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2228–2231, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
First Steps Towards Coverage-Based Sentence Alignment (Gomes & Lopes, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1354.pdf