The Lexometer: A Shiny Application for Exploratory Analysis and Visualization of Corpus Data

Oufan Hai, Matthew Sundberg, Katherine Trice, Rebecca Friedman, Scott Grimm


Abstract
Often performing even simple data science tasks with corpus data requires significant expertise in data science and programming languages like R and Python. With the aim of making quantitative research more accessible for researchers in the language sciences, we present the Lexometer, a Shiny application that integrates numerous data analysis and visualization functions into an easy-to-use graphical user interface. Some functions of the Lexometer are: filtering large databases to generate subsets of the data and variables of interest, providing a range of graphing techniques for both single and multiple variable analysis, and providing the data in a table format which can further be filtered as well as provide methods for cleaning the data. The Lexometer aims to be useful to language researchers with differing levels of programming expertise and to aid in broadening the inclusion of corpus-based empirical evidence in the language sciences.
Anthology ID:
2022.lrec-1.684
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
6370–6376
Language:
URL:
https://aclanthology.org/2022.lrec-1.684
DOI:
Bibkey:
Cite (ACL):
Oufan Hai, Matthew Sundberg, Katherine Trice, Rebecca Friedman, and Scott Grimm. 2022. The Lexometer: A Shiny Application for Exploratory Analysis and Visualization of Corpus Data. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 6370–6376, Marseille, France. European Language Resources Association.
Cite (Informal):
The Lexometer: A Shiny Application for Exploratory Analysis and Visualization of Corpus Data (Hai et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.684.pdf