Novel Event Detection and Classification for Historical Texts

Rachele Sprugnoli, Sara Tonelli


Abstract
Event processing is an active area of research in the Natural Language Processing community, but resources and automatic systems developed so far have mainly addressed contemporary texts. However, the recognition and elaboration of events is a crucial step when dealing with historical texts Particularly in the current era of massive digitization of historical sources: Research in this domain can lead to the development of methodologies and tools that can assist historians in enhancing their work, while having an impact also on the field of Natural Language Processing. Our work aims at shedding light on the complex concept of events when dealing with historical texts. More specifically, we introduce new annotation guidelines for event mentions and types, categorized into 22 classes. Then, we annotate a historical corpus accordingly, and compare two approaches for automatic event detection and classification following this novel scheme. We believe that this work can foster research in a field of inquiry as yet underestimated in the area of Temporal Information Processing. To this end, we release new annotation guidelines, a corpus, and new models for automatic annotation.
Anthology ID:
J19-2002
Volume:
Computational Linguistics, Volume 45, Issue 2 - June 2019
Month:
June
Year:
2019
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
229–265
Language:
URL:
https://aclanthology.org/J19-2002
DOI:
10.1162/coli_a_00347
Bibkey:
Cite (ACL):
Rachele Sprugnoli and Sara Tonelli. 2019. Novel Event Detection and Classification for Historical Texts. Computational Linguistics, 45(2):229–265.
Cite (Informal):
Novel Event Detection and Classification for Historical Texts (Sprugnoli & Tonelli, CL 2019)
Copy Citation:
PDF:
https://aclanthology.org/J19-2002.pdf