Access control by query rewriting: the case of KorAP

Piotr Bański, Nils Diewald, Michael Hanl, Marc Kupietz, Andreas Witt


Abstract
We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given user’s permissions. We first outline the overall architecture of the corpus analysis platform KorAP, devoting some attention to the way in which it handles multiple query languages, by implementing ISO CQLF (Corpus Query Lingua Franca), which in turn constitutes a component crucial for the functionality discussed here. Next, we look at query rewriting as it is used by KorAP and zoom in on one kind of this procedure, namely the rewriting of queries that is forced by data access restrictions.
Anthology ID:
L14-1582
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3817–3822
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/743_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Piotr Bański, Nils Diewald, Michael Hanl, Marc Kupietz, and Andreas Witt. 2014. Access control by query rewriting: the case of KorAP. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3817–3822, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Access control by query rewriting: the case of KorAP (Bański et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/743_Paper.pdf