A MWE lexicon formalism optimised for observational adequacy

Adam Lion-Bouton, Agata Savary, Jean-Yves Antoine


Abstract
Past research advocates that, in order to handle the unpredictable nature of multiword expressions (MWEs), their identification should be assisted with lexicons. The choice of the format for such lexicons, however, is far from obvious. We propose the first – to our knowledge – method to quantitatively evaluate some MWE lexicon formalisms based on the notion of observational adequacy. We apply it to derive a simple yet adequate MWE-lexicon formalism, dubbed λ-CSS, based on syntactic dependencies. It proves competitive with lexicons based on sequential representation of MWEs, and even comparable to a state-of-the art MWE identifier.
Anthology ID:
2023.mwe-1.16
Volume:
Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023)
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Archna Bhatia, Kilian Evang, Marcos Garcia, Voula Giouli, Lifeng Han, Shiva Taslimipoor
Venue:
MWE
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
121–130
Language:
URL:
https://aclanthology.org/2023.mwe-1.16
DOI:
10.18653/v1/2023.mwe-1.16
Bibkey:
Cite (ACL):
Adam Lion-Bouton, Agata Savary, and Jean-Yves Antoine. 2023. A MWE lexicon formalism optimised for observational adequacy. In Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023), pages 121–130, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
A MWE lexicon formalism optimised for observational adequacy (Lion-Bouton et al., MWE 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.mwe-1.16.pdf