Crowdsourcing Ontology Lexicons

Bettina Lanser, Christina Unger, Philipp Cimiano


Abstract
In order to make the growing amount of conceptual knowledge available through ontologies and datasets accessible to humans, NLP applications need access to information on how this knowledge can be verbalized in natural language. One way to provide this kind of information are ontology lexicons, which apart from the actual verbalizations in a given target language can provide further, rich linguistic information about them. Compiling such lexicons manually is a very time-consuming task and requires expertise both in Semantic Web technologies and lexicon engineering, as well as a very good knowledge of the target language at hand. In this paper we present an alternative approach to generating ontology lexicons by means of crowdsourcing: We use CrowdFlower to generate a small Japanese ontology lexicon for ten exemplary ontology elements from the DBpedia ontology according to a two-stage workflow, the main underlying idea of which is to turn the task of generating lexicon entries into a translation task; the starting point of this translation task is a manually created English lexicon for DBpedia. Comparison of the results to a manually created Japanese lexicon shows that the presented workflow is a viable option if an English seed lexicon is already available.
Anthology ID:
L16-1554
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3477–3484
Language:
URL:
https://aclanthology.org/L16-1554
DOI:
Bibkey:
Cite (ACL):
Bettina Lanser, Christina Unger, and Philipp Cimiano. 2016. Crowdsourcing Ontology Lexicons. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3477–3484, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Crowdsourcing Ontology Lexicons (Lanser et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1554.pdf