A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon

Tibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic, Johanna Marie Poppek


Abstract
The present paper describes the current release of the Bochum English Countability Lexicon (BECL 2.1), a large empirical database consisting of lemmata from Open ANC (http://www.anc.org) with added senses from WordNet (Fellbaum 1998). BECL 2.1 contains ≈ 11,800 annotated noun-sense pairs, divided in four major countability classes and 18 fine-grained subclasses. In the current version, BECL also provides information on nouns whose senses occur in more than one class allowing a closer look on polysemy and homonymy with regard to countability. Further included are sets of similar senses using the Leacock and Chodorow (LCH) score for semantic similarity (Leacock & Chodorow 1998), information on orthographic variation, on the completeness of all WordNet senses in the database and an annotated representation of different types of proper names. The further development of BECL will investigate the different countability classes of proper names and the general relation between semantic similarity and countability as well as recurring syntactic patterns for noun-sense pairs. The BECL 2.1 database is also publicly available via http://count-and-mass.org.
Anthology ID:
L16-1447
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2810–2814
Language:
URL:
https://aclanthology.org/L16-1447
DOI:
Bibkey:
Cite (ACL):
Tibor Kiss, Francis Jeffry Pelletier, Halima Husic, Roman Nino Simunic, and Johanna Marie Poppek. 2016. A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2810–2814, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
A sense-based lexicon of count and mass expressions: The Bochum English Countability Lexicon (Kiss et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1447.pdf