MASALA: Modelling and Analysing the Semantics of Adpositions in Linguistic Annotation of Hindi

Aryaman Arora, Nitin Venkateswaran, Nathan Schneider


Abstract
We present a completed, publicly available corpus of annotated semantic relations of adpositions and case markers in Hindi. We used the multilingual SNACS annotation scheme, which has been applied to a variety of typologically diverse languages. Building on past work examining linguistic problems in SNACS annotation, we use language models to attempt automatic labelling of SNACS supersenses in Hindi and achieve results competitive with past work on English. We look towards upstream applications in semantic role labelling and extension to related languages such as Gujarati.
Anthology ID:
2022.lrec-1.612
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5696–5704
Language:
URL:
https://aclanthology.org/2022.lrec-1.612
DOI:
Bibkey:
Cite (ACL):
Aryaman Arora, Nitin Venkateswaran, and Nathan Schneider. 2022. MASALA: Modelling and Analysing the Semantics of Adpositions in Linguistic Annotation of Hindi. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5696–5704, Marseille, France. European Language Resources Association.
Cite (Informal):
MASALA: Modelling and Analysing the Semantics of Adpositions in Linguistic Annotation of Hindi (Arora et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.612.pdf
Data
STREUSLE