Deep Learning Models For Multiword Expression Identification

Waseem Gharbieh, Virendrakumar Bhavsar, Paul Cook


Abstract
Multiword expressions (MWEs) are lexical items that can be decomposed into multiple component words, but have properties that are unpredictable with respect to their component words. In this paper we propose the first deep learning models for token-level identification of MWEs. Specifically, we consider a layered feedforward network, a recurrent neural network, and convolutional neural networks. In experimental results we show that convolutional neural networks are able to outperform the previous state-of-the-art for MWE identification, with a convolutional neural network with three hidden layers giving the best performance.
Anthology ID:
S17-1006
Volume:
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017)
Month:
August
Year:
2017
Address:
Vancouver, Canada
Editors:
Nancy Ide, Aurélie Herbelot, Lluís Màrquez
Venue:
*SEM
SIGs:
SIGLEX | SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
54–64
Language:
URL:
https://aclanthology.org/S17-1006/
DOI:
10.18653/v1/S17-1006
Bibkey:
Cite (ACL):
Waseem Gharbieh, Virendrakumar Bhavsar, and Paul Cook. 2017. Deep Learning Models For Multiword Expression Identification. In Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017), pages 54–64, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
Deep Learning Models For Multiword Expression Identification (Gharbieh et al., *SEM 2017)
Copy Citation:
PDF:
https://aclanthology.org/S17-1006.pdf