PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, Nathan Schneider


Abstract
We present the Prepositions Annotated with Supsersense Tags in Reddit International English (“PASTRIE”) corpus, a new dataset containing manually annotated preposition supersenses of English data from presumed speakers of four L1s: English, French, German, and Spanish. The annotations are comprehensive, covering all preposition types and tokens in the sample. Along with the corpus, we provide analysis of distributional patterns across the included L1s and a discussion of the influence of L1s on L2 preposition choice.
Anthology ID:
2020.law-1.10
Volume:
Proceedings of the 14th Linguistic Annotation Workshop
Month:
December
Year:
2020
Address:
Barcelona, Spain
Editors:
Stefanie Dipper, Amir Zeldes
Venue:
LAW
SIG:
SIGANN
Publisher:
Association for Computational Linguistics
Note:
Pages:
105–116
Language:
URL:
https://aclanthology.org/2020.law-1.10
DOI:
Bibkey:
Cite (ACL):
Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, and Nathan Schneider. 2020. PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English. In Proceedings of the 14th Linguistic Annotation Workshop, pages 105–116, Barcelona, Spain. Association for Computational Linguistics.
Cite (Informal):
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English (Kranzlein et al., LAW 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.law-1.10.pdf
Code
 nert-nlp/pastrie
Data
PASTRIESTREUSLE