Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding

Yuan Hua, Zheng Huang, Jie Guo, Weidong Qiu


Abstract
Information extraction from documents such as receipts or invoices is a fundamental and crucial step for office automation. Many approaches focus on extracting entities and relationships from plain texts, however, when it comes to document images, such demand becomes quite challenging since visual and layout information are also of great significance to help tackle this problem. In this work, we propose the attention-based graph neural network to combine textual and visual information from document images. Moreover, the global node is introduced in our graph construction algorithm which is used as a virtual hub to collect the information from all the nodes and edges to help improve the performance. Extensive experiments on real-world datasets show that our method outperforms baseline methods by significant margins.
Anthology ID:
2020.ccl-1.79
Volume:
Proceedings of the 19th Chinese National Conference on Computational Linguistics
Month:
October
Year:
2020
Address:
Haikou, China
Editors:
Maosong Sun (孙茂松), Sujian Li (李素建), Yue Zhang (张岳), Yang Liu (刘洋)
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
853–862
Language:
English
URL:
https://aclanthology.org/2020.ccl-1.79
DOI:
Bibkey:
Cite (ACL):
Yuan Hua, Zheng Huang, Jie Guo, and Weidong Qiu. 2020. Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding. In Proceedings of the 19th Chinese National Conference on Computational Linguistics, pages 853–862, Haikou, China. Chinese Information Processing Society of China.
Cite (Informal):
Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding (Hua et al., CCL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.ccl-1.79.pdf