Inducing Discourse Marker Inventories from Lexical Knowledge Graphs

Christian Chiarcos


Abstract
Discourse marker inventories are important tools for the development of both discourse parsers and corpora with discourse annotations. In this paper we explore the potential of massively multilingual lexical knowledge graphs to induce multilingual discourse marker lexicons using concept propagation methods as previously developed in the context of translation inference across dictionaries. Given one or multiple source languages with discourse marker inventories that discourse relations as senses of potential discourse markers, as well as a large number of bilingual dictionaries that link them – directly or indirectly – with the target language, we specifically study to what extent discourse marker induction can benefit from the integration of information from different sources, the impact of sense granularity and what limiting factors may need to be considered. Our study uses discourse marker inventories from nine European languages normalized against the discourse relation inventory of the Penn Discourse Treebank (PDTB), as well as three collections of machine-readable dictionaries with different characteristics, so that the interplay of a large number of factors can be studied.
Anthology ID:
2022.lrec-1.257
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2401–2412
Language:
URL:
https://aclanthology.org/2022.lrec-1.257
DOI:
Bibkey:
Cite (ACL):
Christian Chiarcos. 2022. Inducing Discourse Marker Inventories from Lexical Knowledge Graphs. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2401–2412, Marseille, France. European Language Resources Association.
Cite (Informal):
Inducing Discourse Marker Inventories from Lexical Knowledge Graphs (Chiarcos, LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.257.pdf
Code
 acoli-repo/rdf4discourse