Improving corpus search via parsing

Natalia Klyueva, Pavel Straňák


Abstract
In this paper, we describe an addition to the corpus query system Kontext that enables to enhance the search using syntactic attributes in addition to the existing features, mainly lemmas and morphological categories. We present the enhancements of the corpus query system itself, the attributes we use to represent syntactic structures in data, and some examples of querying the syntactically annotated corpora, such as treebanks in various languages as well as an automatically parsed large corpus.
Anthology ID:
L16-1457
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2862–2866
Language:
URL:
https://aclanthology.org/L16-1457
DOI:
Bibkey:
Cite (ACL):
Natalia Klyueva and Pavel Straňák. 2016. Improving corpus search via parsing. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2862–2866, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Improving corpus search via parsing (Klyueva & Straňák, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1457.pdf