AniEE: A Dataset of Animal Experimental Literature for Event Extraction

Dohee Kim, Ra Yoo, Soyoung Yang, Hee Yang, Jaegul Choo


Abstract
Event extraction (EE), as a crucial information extraction (IE) task, aims to identify event triggers and their associated arguments from unstructured text, subsequently classifying them into pre-defined types and roles. In the biomedical domain, EE is widely used to extract complex structures representing biological events from literature. Due to the complicated semantics and specialized domain knowledge, it is challenging to construct biomedical event extraction datasets. Additionally, most existing biomedical EE datasets primarily focus on cell experiments or the overall experimental procedures. Therefore, we introduce AniEE, an event extraction dataset concentrated on the animal experiment stage. We establish a novel animal experiment customized entity and event scheme in collaboration with domain experts. We then create an expert-annotated high-quality dataset containing discontinuous entities and nested events and evaluate our dataset on the recent outstanding NER and EE models.
Anthology ID:
2023.findings-emnlp.863
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12959–12971
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.863
DOI:
10.18653/v1/2023.findings-emnlp.863
Bibkey:
Cite (ACL):
Dohee Kim, Ra Yoo, Soyoung Yang, Hee Yang, and Jaegul Choo. 2023. AniEE: A Dataset of Animal Experimental Literature for Event Extraction. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 12959–12971, Singapore. Association for Computational Linguistics.
Cite (Informal):
AniEE: A Dataset of Animal Experimental Literature for Event Extraction (Kim et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.863.pdf