Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts

Hai-Long Trieu, Nhung T. H. Nguyen, Makoto Miwa, Sophia Ananiadou


Abstract
Existing biomedical coreference resolution systems depend on features and/or rules based on syntactic parsers. In this paper, we investigate the utility of the state-of-the-art general domain neural coreference resolution system on biomedical texts. The system is an end-to-end system without depending on any syntactic parsers. We also investigate the domain specific features to enhance the system for biomedical texts. Experimental results on the BioNLP Protein Coreference dataset and the CRAFT corpus show that, with no parser information, the adapted system compared favorably with the systems that depend on parser information on these datasets, achieving 51.23% on the BioNLP dataset and 36.33% on the CRAFT corpus in F1 score. In-domain embeddings and domain-specific features helped improve the performance on the BioNLP dataset, but they did not on the CRAFT corpus.
Anthology ID:
W18-2324
Volume:
Proceedings of the BioNLP 2018 workshop
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
183–188
Language:
URL:
https://aclanthology.org/W18-2324
DOI:
10.18653/v1/W18-2324
Bibkey:
Cite (ACL):
Hai-Long Trieu, Nhung T. H. Nguyen, Makoto Miwa, and Sophia Ananiadou. 2018. Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts. In Proceedings of the BioNLP 2018 workshop, pages 183–188, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
Investigating Domain-Specific Information for Neural Coreference Resolution on Biomedical Texts (Trieu et al., BioNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-2324.pdf