Parallel Chinese-English Entities, Relations and Events Corpora

Justin Mott, Ann Bies, Zhiyi Song, Stephanie Strassel


Abstract
This paper introduces the parallel Chinese-English Entities, Relations and Events (ERE) corpora developed by Linguistic Data Consortium under the DARPA Deep Exploration and Filtering of Text (DEFT) Program. Original Chinese newswire and discussion forum documents are annotated for two versions of the ERE task. The texts are manually translated into English and then annotated for the same ERE tasks on the English translation, resulting in a rich parallel resource that has utility for performers within the DEFT program, for participants in NIST’s Knowledge Base Population evaluations, and for cross-language projection research more generally.
Anthology ID:
L16-1589
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3717–3722
Language:
URL:
https://aclanthology.org/L16-1589
DOI:
Bibkey:
Cite (ACL):
Justin Mott, Ann Bies, Zhiyi Song, and Stephanie Strassel. 2016. Parallel Chinese-English Entities, Relations and Events Corpora. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3717–3722, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Parallel Chinese-English Entities, Relations and Events Corpora (Mott et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1589.pdf