Building an Annotated Corpus for Text Summarization and Question Answering

Patcharee Varasai, Chaveevan Pechsiri, Thana Sukvari, Vee Satayamas, Asanee Kawtrakul


Abstract
We describe ongoing work in semi-automatic annotating corpus, with the goal to answer why-question in question answering system and give a construction of the coherent tree for text summarization. In this paper we present annotation schemas for identifying the discourse relations that hold between the parts of text as well as the particular textual of span that are related via the discourse relation. Furthermore, we address several tasks in building the annotated corpus in discourse level, namely creating annotated guidelines, ensuring annotation accuracy and evaluating.
Anthology ID:
L08-1613
Volume:
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Month:
May
Year:
2008
Address:
Marrakech, Morocco
Editors:
Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/870_paper.pdf
DOI:
Bibkey:
Cite (ACL):
Patcharee Varasai, Chaveevan Pechsiri, Thana Sukvari, Vee Satayamas, and Asanee Kawtrakul. 2008. Building an Annotated Corpus for Text Summarization and Question Answering. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
Cite (Informal):
Building an Annotated Corpus for Text Summarization and Question Answering (Varasai et al., LREC 2008)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/870_paper.pdf