Annotation Guidelines for Czech-English Word Alignment

Ivana Kruijff-Korbayová, Klára Chvátalová, Oana Postolache


Abstract
We report on our experience with manual alignment of Czech and English parallel corpus text. We applied existing guidelines for English and French and augmented them to cover systematically occurring cases in our corpus. We describe the main extensions covered in our guidelines and provide examples. We evaluated both intra- and inter-annotator agreement and obtained very good results of Kappa well above 0.9 and agreement of 95% and 93%, respectively.
Anthology ID:
L06-1346
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Editors:
Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/575_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Ivana Kruijff-Korbayová, Klára Chvátalová, and Oana Postolache. 2006. Annotation Guidelines for Czech-English Word Alignment. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
Annotation Guidelines for Czech-English Word Alignment (Kruijff-Korbayová et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/575_pdf.pdf