The Gavagai Living Lexicon
Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, Anders Holst
Abstract
This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages. We describe the underlying distributional semantic model, and how we have solved some of the challenges in applying such a model to large amounts of streaming data. We also describe the architecture of our implementation, and discuss how we deal with continuous quality assurance of the lexicon.- Anthology ID:
- L16-1053
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 344–350
- Language:
- URL:
- https://aclanthology.org/L16-1053
- DOI:
- Bibkey:
- Cite (ACL):
- Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, and Anders Holst. 2016. The Gavagai Living Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 344–350, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- The Gavagai Living Lexicon (Sahlgren et al., LREC 2016)
- Copy Citation:
- PDF:
- https://aclanthology.org/L16-1053.pdf
Export citation
@inproceedings{sahlgren-etal-2016-gavagai, title = "The Gavagai Living Lexicon", author = "Sahlgren, Magnus and Gyllensten, Amaru Cuba and Espinoza, Fredrik and Hamfors, Ola and Karlgren, Jussi and Olsson, Fredrik and Persson, Per and Viswanathan, Akshay and Holst, Anders", editor = "Calzolari, Nicoletta and Choukri, Khalid and Declerck, Thierry and Goggi, Sara and Grobelnik, Marko and Maegaard, Bente and Mariani, Joseph and Mazo, Helene and Moreno, Asuncion and Odijk, Jan and Piperidis, Stelios", booktitle = "Proceedings of the Tenth International Conference on Language Resources and Evaluation ({LREC}'16)", month = may, year = "2016", address = "Portoro{\v{z}}, Slovenia", publisher = "European Language Resources Association (ELRA)", url = "https://aclanthology.org/L16-1053", pages = "344--350", abstract = "This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages. We describe the underlying distributional semantic model, and how we have solved some of the challenges in applying such a model to large amounts of streaming data. We also describe the architecture of our implementation, and discuss how we deal with continuous quality assurance of the lexicon.", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="sahlgren-etal-2016-gavagai"> <titleInfo> <title>The Gavagai Living Lexicon</title> </titleInfo> <name type="personal"> <namePart type="given">Magnus</namePart> <namePart type="family">Sahlgren</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Amaru</namePart> <namePart type="given">Cuba</namePart> <namePart type="family">Gyllensten</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Fredrik</namePart> <namePart type="family">Espinoza</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ola</namePart> <namePart type="family">Hamfors</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Jussi</namePart> <namePart type="family">Karlgren</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Fredrik</namePart> <namePart type="family">Olsson</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Per</namePart> <namePart type="family">Persson</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Akshay</namePart> <namePart type="family">Viswanathan</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Anders</namePart> <namePart type="family">Holst</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2016-05</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16)</title> </titleInfo> <name type="personal"> <namePart type="given">Nicoletta</namePart> <namePart type="family">Calzolari</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Khalid</namePart> <namePart type="family">Choukri</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Thierry</namePart> <namePart type="family">Declerck</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Sara</namePart> <namePart type="family">Goggi</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Marko</namePart> <namePart type="family">Grobelnik</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bente</namePart> <namePart type="family">Maegaard</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Joseph</namePart> <namePart type="family">Mariani</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Helene</namePart> <namePart type="family">Mazo</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Asuncion</namePart> <namePart type="family">Moreno</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Jan</namePart> <namePart type="family">Odijk</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Stelios</namePart> <namePart type="family">Piperidis</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>European Language Resources Association (ELRA)</publisher> <place> <placeTerm type="text">Portorož, Slovenia</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <abstract>This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages. We describe the underlying distributional semantic model, and how we have solved some of the challenges in applying such a model to large amounts of streaming data. We also describe the architecture of our implementation, and discuss how we deal with continuous quality assurance of the lexicon.</abstract> <identifier type="citekey">sahlgren-etal-2016-gavagai</identifier> <location> <url>https://aclanthology.org/L16-1053</url> </location> <part> <date>2016-05</date> <extent unit="page"> <start>344</start> <end>350</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T The Gavagai Living Lexicon %A Sahlgren, Magnus %A Gyllensten, Amaru Cuba %A Espinoza, Fredrik %A Hamfors, Ola %A Karlgren, Jussi %A Olsson, Fredrik %A Persson, Per %A Viswanathan, Akshay %A Holst, Anders %Y Calzolari, Nicoletta %Y Choukri, Khalid %Y Declerck, Thierry %Y Goggi, Sara %Y Grobelnik, Marko %Y Maegaard, Bente %Y Mariani, Joseph %Y Mazo, Helene %Y Moreno, Asuncion %Y Odijk, Jan %Y Piperidis, Stelios %S Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16) %D 2016 %8 May %I European Language Resources Association (ELRA) %C Portorož, Slovenia %F sahlgren-etal-2016-gavagai %X This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages. We describe the underlying distributional semantic model, and how we have solved some of the challenges in applying such a model to large amounts of streaming data. We also describe the architecture of our implementation, and discuss how we deal with continuous quality assurance of the lexicon. %U https://aclanthology.org/L16-1053 %P 344-350
Markdown (Informal)
[The Gavagai Living Lexicon](https://aclanthology.org/L16-1053) (Sahlgren et al., LREC 2016)
- The Gavagai Living Lexicon (Sahlgren et al., LREC 2016)
ACL
- Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, and Anders Holst. 2016. The Gavagai Living Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 344–350, Portorož, Slovenia. European Language Resources Association (ELRA).