CCGweb: a New Annotation Tool and a First Quadrilingual CCG Treebank

Kilian Evang, Lasha Abzianidze, Johan Bos


Abstract
We present the first open-source graphical annotation tool for combinatory categorial grammar (CCG), and the first set of detailed guidelines for syntactic annotation with CCG, for four languages: English, German, Italian, and Dutch. We also release a parallel pilot CCG treebank based on these guidelines, with 4x100 adjudicated sentences, 10K single-annotator fully corrected sentences, and 82K single-annotator partially corrected sentences.
Anthology ID:
W19-4005
Original:
W19-4005v1
Version 2:
W19-4005v2
Version 3:
W19-4005v3
Volume:
Proceedings of the 13th Linguistic Annotation Workshop
Month:
August
Year:
2019
Address:
Florence, Italy
Venues:
ACL | LAW | WS
SIG:
SIGANN
Publisher:
Association for Computational Linguistics
Note:
Pages:
37–42
Language:
URL:
https://aclanthology.org/W19-4005
DOI:
10.18653/v1/W19-4005
Bibkey:
Cite (ACL):
Kilian Evang, Lasha Abzianidze, and Johan Bos. 2019. CCGweb: a New Annotation Tool and a First Quadrilingual CCG Treebank. In Proceedings of the 13th Linguistic Annotation Workshop, pages 37–42, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
CCGweb: a New Annotation Tool and a First Quadrilingual CCG Treebank (Evang et al., 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-4005.pdf
Data
Penn Treebank