QUEMDISSE? Reported speech in Portuguese

Cláudia Freitas, Bianca Freitas, Diana Santos


Abstract
This paper presents some work on direct and indirect speech in Portuguese using corpus-based methods: we report on a study whose aim was to identify (i) Portuguese verbs used to introduce reported speech and (ii) syntactic patterns used to convey reported speech, in order to enhance the performance of a quotation extraction system, dubbed QUEMDISSE?. In addition, (iii) we present a Portuguese corpus annotated with reported speech, using the lexicon and rules provided by (i) and (ii), and discuss the process of their annotation and what was learned.
Anthology ID:
L16-1698
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4410–4416
Language:
URL:
https://aclanthology.org/L16-1698
DOI:
Bibkey:
Cite (ACL):
Cláudia Freitas, Bianca Freitas, and Diana Santos. 2016. QUEMDISSE? Reported speech in Portuguese. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4410–4416, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
QUEMDISSE? Reported speech in Portuguese (Freitas et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1698.pdf