Mapping and Generating Classifiers using an Open Chinese Ontology

Luis Morgado Da Costa, Francis Bond, Helena Gao


Abstract
In languages such as Chinese, classifiers (CLs) play a central role in the quantification of noun-phrases. This can be a problem when generating text from input that does not specify the classifier, as in machine translation (MT) from English to Chinese. Many solutions to this problem rely on dictionaries of noun-CL pairs. However, there is no open large-scale machine-tractable dictionary of noun-CL associations. Many published resources exist, but they tend to focus on how a CL is used (e.g. what kinds of nouns can be used with it, or what features seem to be selected by each CL). In fact, since nouns are open class words, producing an exhaustive definite list of noun-CL associations is not possible, since it would quickly get out of date. Our work tries to address this problem by providing an algorithm for automatic building of a frequency based dictionary of noun-CL pairs, mapped to concepts in the Chinese Open Wordnet (Wang and Bond, 2013), an open machine-tractable dictionary for Chinese. All results will released under an open license.
Anthology ID:
2016.gwc-1.36
Volume:
Proceedings of the 8th Global WordNet Conference (GWC)
Month:
27--30 January
Year:
2016
Address:
Bucharest, Romania
Editors:
Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
249–256
Language:
URL:
https://aclanthology.org/2016.gwc-1.36
DOI:
Bibkey:
Cite (ACL):
Luis Morgado Da Costa, Francis Bond, and Helena Gao. 2016. Mapping and Generating Classifiers using an Open Chinese Ontology. In Proceedings of the 8th Global WordNet Conference (GWC), pages 249–256, Bucharest, Romania. Global Wordnet Association.
Cite (Informal):
Mapping and Generating Classifiers using an Open Chinese Ontology (Costa et al., GWC 2016)
Copy Citation:
PDF:
https://aclanthology.org/2016.gwc-1.36.pdf