Cognates alignment

António Ribeiro, Gaël Dias, Gabriel Lopes, João Mexia


Abstract
Some authors (Simard et al.; Melamed; Danielsson & Mühlenbock) have suggested measures of similarity of words in different languages so as to find extra clues for alignment of parallel texts. Cognate words, like ‘Parliament’ and ‘Parlement’, in English and French respectively, provide extra anchors that help to improve the quality of the alignment. In this paper, we will extend an alignment algorithm proposed by Ribeiro et al. using typical contiguous and non-contiguous sequences of characters extracted using a statistically sound method (Dias et al.). With these typical sequences, we are able to find more reliable correspondence points and improve the alignment quality without recurring to heuristics to identify cognates.
Anthology ID:
2001.mtsummit-papers.52
Volume:
Proceedings of Machine Translation Summit VIII
Month:
September 18-22
Year:
2001
Address:
Santiago de Compostela, Spain
Editor:
Bente Maegaard
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2001.mtsummit-papers.52
DOI:
Bibkey:
Cite (ACL):
António Ribeiro, Gaël Dias, Gabriel Lopes, and João Mexia. 2001. Cognates alignment. In Proceedings of Machine Translation Summit VIII, Santiago de Compostela, Spain.
Cite (Informal):
Cognates alignment (Ribeiro et al., MTSummit 2001)
Copy Citation:
PDF:
https://aclanthology.org/2001.mtsummit-papers.52.pdf