Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank

Rui Wang; Caroline Sporleder

Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank

Abstract

In this paper, we present our work on constructing a textual semantic relation corpus by making use of an existing treebank annotated with discourse relations. We extract adjacent text span pairs and group them into six categories according to the different discourse relations between them. After that, we present the details of our annotation scheme, which includes six textual semantic relations, 'backward entailment', 'forward entailment', 'equality', 'contradiction', 'overlapping', and 'independent'. We also discuss some ambiguous examples to show the difficulty of such annotation task, which cannot be easily done by an automatic mapping between discourse relations and semantic relations. We have two annotators and each of them performs the task twice. The basic statistics on the constructed corpus looks promising: we achieve 81.17% of agreement on the six semantic relation annotation with a .718 kappa score, and it increases to 91.21% if we collapse the last two labels with a .775 kappa score.

Anthology ID:: L10-1565
Volume:: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
Month:: May
Year:: 2010
Address:: Valletta, Malta
Editors:: Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias
Venue:: LREC
SIG:
Publisher:: European Language Resources Association (ELRA)
Note:
Pages:
Language:
External URL:: http://www.lrec-conf.org/proceedings/lrec2010/pdf/820_Paper.pdf
DOI:
Bibkey:
Cite (ACL):: Rui Wang and Caroline Sporleder. 2010. Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), Valletta, Malta. European Language Resources Association (ELRA).
Cite (Informal):: Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank (Wang & Sporleder, LREC 2010)
Copy Citation:

External Cite Search Fix data