Corpus Annotation Graph Builder (CAG): An Architectural Framework to Create and Annotate a Multi-source Graph

Roxanne El Baff, Tobias Hecking, Andreas Hamm, Jasper W. Korte, Sabine Bartsch


Abstract
Graphs are a natural representation of complex data as their structure allows users to discover (often implicit) relations among the nodes intuitively. Applications build graphs in an ad-hoc fashion, usually tailored to specific use cases, limiting their reusability. To account for this, we present the Corpus Annotation Graph (CAG) architectural framework based on a create-and-annotate pattern that enables users to build uniformly structured graphs from diverse data sources and extend them with automatically extracted annotations (e.g., named entities, topics). The resulting graphs can be used for further analyses across multiple downstream tasks (e.g., node classification). Code and resources are publicly available on GitHub, and downloadable via PyPi with the command pip install cag.
Anthology ID:
2023.eacl-demo.28
Volume:
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Danilo Croce, Luca Soldaini
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
248–255
Language:
URL:
https://aclanthology.org/2023.eacl-demo.28
DOI:
10.18653/v1/2023.eacl-demo.28
Bibkey:
Cite (ACL):
Roxanne El Baff, Tobias Hecking, Andreas Hamm, Jasper W. Korte, and Sabine Bartsch. 2023. Corpus Annotation Graph Builder (CAG): An Architectural Framework to Create and Annotate a Multi-source Graph. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 248–255, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Corpus Annotation Graph Builder (CAG): An Architectural Framework to Create and Annotate a Multi-source Graph (El Baff et al., EACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.eacl-demo.28.pdf
Video:
 https://aclanthology.org/2023.eacl-demo.28.mp4