Collocation or Free Combination? — Applying Machine Translation Techniques to identify collocations in Japanese

Lis Pereira, Elga Strafella, Yuji Matsumoto


Abstract
This work presents an initial investigation on how to distinguish collocations from free combinations. The assumption is that, while free combinations can be literally translated, the overall meaning of collocations is different from the sum of the translation of its parts. Based on that, we verify whether a machine translation system can help us perform such distinction. Results show that it improves the precision compared with standard methods of collocation identification through statistical association measures.
Anthology ID:
L14-1430
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
736–739
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/519_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Lis Pereira, Elga Strafella, and Yuji Matsumoto. 2014. Collocation or Free Combination? — Applying Machine Translation Techniques to identify collocations in Japanese. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 736–739, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Collocation or Free Combination? — Applying Machine Translation Techniques to identify collocations in Japanese (Pereira et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/519_Paper.pdf