Attribution of Quoted Speech in Portuguese Text

Eckhard Bick


Abstract
This paper describes and evaluates a rule-based system implementing a novel method for quote attribution in Portuguese text, working on top of a Constraint-Grammar parse. Both direct and indirect speech are covered, as well as certain other text- embedded quote sources. In a first step, the system performs quote segmentation and identifies speech verbs, taking into account the different styles used in literature and news text. Speakers are then identified using syntactically and semantically grounded Constraint-Grammar rules. We rely on relational links and stream variables to handle anaphorical mentions and to recover the names of implied or underspecified speakers. In an evaluation including both literature and news text, the system performed well on both the segmentation and attribution tasks, achieving F-scores of 98-99% for the former and 89-94% for the latter.
Anthology ID:
2023.nodalida-cgmta.1
Volume:
Proceedings of the NoDaLiDa 2023 Workshop on Constraint Grammar - Methods, Tools and Applications
Month:
May
Year:
2023
Address:
Tórshavn, Faroe Islands
Editors:
Eckhard Bick, Trond Trosterud, Tanel Alumäe
Venue:
WS
SIG:
Publisher:
Association of Computational Linguistics
Note:
Pages:
1–9
Language:
URL:
https://aclanthology.org/2023.nodalida-cgmta.1
DOI:
Bibkey:
Cite (ACL):
Eckhard Bick. 2023. Attribution of Quoted Speech in Portuguese Text. In Proceedings of the NoDaLiDa 2023 Workshop on Constraint Grammar - Methods, Tools and Applications, pages 1–9, Tórshavn, Faroe Islands. Association of Computational Linguistics.
Cite (Informal):
Attribution of Quoted Speech in Portuguese Text (Bick, 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.nodalida-cgmta.1.pdf