A Distributional and Orthographic Aggregation Model for English Derivational Morphology

Daniel Deutsch, John Hewitt, Dan Roth


Abstract
Modeling derivational morphology to generate words with particular semantics is useful in many text generation tasks, such as machine translation or abstractive question answering. In this work, we tackle the task of derived word generation. That is, we attempt to generate the word “runner” for “someone who runs.” We identify two key problems in generating derived words from root words and transformations. We contribute a novel aggregation model of derived word generation that learns derivational transformations both as orthographic functions using sequence-to-sequence models and as functions in distributional word embedding space. The model then learns to choose between the hypothesis of each system. We also present two ways of incorporating corpus information into derived word generation.
Anthology ID:
P18-1180
Volume:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Iryna Gurevych, Yusuke Miyao
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1938–1947
Language:
URL:
https://aclanthology.org/P18-1180
DOI:
10.18653/v1/P18-1180
Bibkey:
Cite (ACL):
Daniel Deutsch, John Hewitt, and Dan Roth. 2018. A Distributional and Orthographic Aggregation Model for English Derivational Morphology. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1938–1947, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
A Distributional and Orthographic Aggregation Model for English Derivational Morphology (Deutsch et al., ACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/P18-1180.pdf
Presentation:
 P18-1180.Presentation.pdf
Video:
 https://aclanthology.org/P18-1180.mp4
Code
 danieldeutsch/acl2018