Annotating Relation Mentions in Tabloid Press

Hong Li, Sebastian Krause, Feiyu Xu, Hans Uszkoreit, Robert Hummel, Veselina Mironova


Abstract
This paper presents a new resource for the training and evaluation needed by relation extraction experiments. The corpus consists of annotations of mentions for three semantic relations: marriage, parent―child, siblings, selected from the domain of biographic facts about persons and their social relationships. The corpus contains more than one hundred news articles from Tabloid Press. In the current corpus, we only consider the relation mentions occurring in the individual sentences. We provide multi-level annotations which specify the marked facts from relation, argument, entity, down to the token level, thus allowing for detailed analysis of linguistic phenomena and their interactions. A generic markup tool Recon developed at the DFKI LT lab has been utilised for the annotation task. The corpus has been annotated by two human experts, supported by additional conflict resolution conducted by a third expert. As shown in the evaluation, the annotation is of high quality as proved by the stated inter-annotator agreements both on sentence level and on relationmention level. The current corpus is already in active use in our research for evaluation of the relation extraction performance of our automatically learned extraction patterns.
Anthology ID:
L14-1234
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3253–3257
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/250_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Hong Li, Sebastian Krause, Feiyu Xu, Hans Uszkoreit, Robert Hummel, and Veselina Mironova. 2014. Annotating Relation Mentions in Tabloid Press. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3253–3257, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Annotating Relation Mentions in Tabloid Press (Li et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/250_Paper.pdf