Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment

Natalie Vargas, Carlos Ramisch, Helena Caseli


Abstract
We propose a method for joint unsupervised discovery of multiword expressions (MWEs) and their translations from parallel corpora. First, we apply independent monolingual MWE extraction in source and target languages simultaneously. Then, we calculate translation probability, association score and distributional similarity of co-occurring pairs. Finally, we rank all translations of a given MWE using a linear combination of these features. Preliminary experiments on light verb constructions show promising results.
Anthology ID:
W17-1711
Volume:
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
Month:
April
Year:
2017
Address:
Valencia, Spain
Venue:
MWE
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
91–96
Language:
URL:
https://aclanthology.org/W17-1711
DOI:
10.18653/v1/W17-1711
Bibkey:
Cite (ACL):
Natalie Vargas, Carlos Ramisch, and Helena Caseli. 2017. Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 91–96, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment (Vargas et al., MWE 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-1711.pdf