‘interHist’ - an interactive visual interface for corpus exploration

Verena Lyding, Lionel Nicolas, Egon Stemle


Abstract
In this article, we present interHist, a compact visualization for the interactive exploration of results to complex corpus queries. Integrated with a search interface to the PAISA corpus of Italian web texts, interHist aims at facilitating the exploration of large results sets to linguistic corpus searches. This objective is approached by providing an interactive visual overview of the data, which supports the user-steered navigation by means of interactive filtering. It allows to dynamically switch between an overview on the data and a detailed view on results in their immediate textual context, thus helping to detect and inspect relevant hits more efficiently. We provide background information on corpus linguistics and related work on visualizations for language and linguistic data. We introduce the architecture of interHist, by detailing the data structure it relies on, describing the visualization design and providing technical details of the implementation and its integration with the corpus querying environment. Finally, we illustrate its usage by presenting a use case for the analysis of the composition of Italian noun phrases.
Anthology ID:
L14-1428
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
635–641
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/517_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Verena Lyding, Lionel Nicolas, and Egon Stemle. 2014. ‘interHist’ - an interactive visual interface for corpus exploration. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 635–641, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
‘interHist’ - an interactive visual interface for corpus exploration (Lyding et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/517_Paper.pdf