DORE: Document Ordered Relation Extraction based on Generative Framework

Qipeng Guo; Yuqing Yang; Hang Yan (航 颜); Xipeng Qiu (邱锡鹏); Zheng Zhang

doi:10.18653/v1/2022.findings-emnlp.253

DORE: Document Ordered Relation Extraction based on Generative Framework

Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, Zheng Zhang

Abstract

In recent years, there is a surge of generation-based information extraction work, which allows a more direct use of pre-trained language models and efficiently captures output dependencies. However, previous generative methods using lexical representation do not naturally fit document-level relation extraction (DocRE) where there are multiple entities and relational facts. In this paper, we investigate the root cause of the underwhelming performance of the existing generative DocRE models and discover that the culprit is the inadequacy of the training paradigm, instead of the capacities of the models. We propose to generate a symbolic and ordered sequence from the relation matrix which is deterministic and easier for model to learn. Moreover, we design a parallel row generation method to process overlong target sequences. Besides, we introduce several negative sampling strategies to improve the performance with balanced signals. Experimental results on four datasets show that our proposed method can improve the performance of the generative DocRE models.

Anthology ID:: 2022.findings-emnlp.253
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2022
Month:: December
Year:: 2022
Address:: Abu Dhabi, United Arab Emirates
Editors:: Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3463–3474
Language:
URL:: https://aclanthology.org/2022.findings-emnlp.253/
DOI:: 10.18653/v1/2022.findings-emnlp.253
Bibkey:
Cite (ACL):: Qipeng Guo, Yuqing Yang, Hang Yan, Xipeng Qiu, and Zheng Zhang. 2022. DORE: Document Ordered Relation Extraction based on Generative Framework. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 3463–3474, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):: DORE: Document Ordered Relation Extraction based on Generative Framework (Guo et al., Findings 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.findings-emnlp.253.pdf
Video:: https://aclanthology.org/2022.findings-emnlp.253.mp4

PDF Cite Search Video Fix data