It Takes Two to Tango – Towards a Multilingual MWE Resource

Svetlozara Leseva, Verginica Barbu Mititelu, Ivelina Stoyanova


Abstract
Mature wordnets offer the opportunity of digging out interesting linguistic information otherwise not explicitly marked in the network. The focus in this paper is on the ways the results already obtained at two levels, derivation and multiword expressions, may be further employed. The parallel recent development of the two resources under discussion, the Bulgarian and the Romanian wordnets, has enabled interlingual analyses that reveal similarities and differences between the linguistic knowledge encoded in the two wordnets. In this paper we show how the resources developed and the knowledge gained are put together towards devising a linked MWE resource that is informed by layered dictionary representation and corpus annotation and analysis. This work is a proof of concept for the adopted method of compiling a multilingual MWE resource on the basis of information extracted from the Bulgarian, the Romanian and the Princeton wordnet, as well as additional language resources and automatic procedures.
Anthology ID:
2020.clib-1.11
Volume:
Proceedings of the 4th International Conference on Computational Linguistics in Bulgaria (CLIB 2020)
Month:
September
Year:
2020
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, IBL -- BAS
Note:
Pages:
101–111
Language:
URL:
https://aclanthology.org/2020.clib-1.11
DOI:
Bibkey:
Cite (ACL):
Svetlozara Leseva, Verginica Barbu Mititelu, and Ivelina Stoyanova. 2020. It Takes Two to Tango – Towards a Multilingual MWE Resource. In Proceedings of the 4th International Conference on Computational Linguistics in Bulgaria (CLIB 2020), pages 101–111, Sofia, Bulgaria. Department of Computational Linguistics, IBL -- BAS.
Cite (Informal):
It Takes Two to Tango – Towards a Multilingual MWE Resource (Leseva et al., CLIB 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.clib-1.11.pdf