CatVar: a database of categorial variations for English

Nizar Habash, Bonnie Dorr


Abstract
We present a new large-scale database called “CatVar” (Habash and Dorr, 2003) which contains categorial variations of English lexemes. Due to the prevalence of cross-language categorial variation in multilingual applications, our categorial-variation resource may serve as an integral part of a diverse range of natural language applications. Thus, the research reported herein overlaps heavily with that of the machine-translation, lexicon-construction, and information-retrieval communities. We demonstrate this database, embedded in a graphical interface; we also show a GUI for user input of corrections to the database.
Anthology ID:
2003.mtsummit-systems.9
Volume:
Proceedings of Machine Translation Summit IX: System Presentations
Month:
September 23-27
Year:
2003
Address:
New Orleans, USA
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2003.mtsummit-systems.9
DOI:
Bibkey:
Cite (ACL):
Nizar Habash and Bonnie Dorr. 2003. CatVar: a database of categorial variations for English. In Proceedings of Machine Translation Summit IX: System Presentations, New Orleans, USA.
Cite (Informal):
CatVar: a database of categorial variations for English (Habash & Dorr, MTSummit 2003)
Copy Citation:
PDF:
https://aclanthology.org/2003.mtsummit-systems.9.pdf