ActiveEA: Active Learning for Neural Entity Alignment

Bing Liu, Harrisen Scells, Guido Zuccon, Wen Hua, Genghong Zhao


Abstract
Entity Alignment (EA) aims to match equivalent entities across different Knowledge Graphs (KGs) and is an essential step of KG fusion. Current mainstream methods – neural EA models – rely on training with seed alignment, i.e., a set of pre-aligned entity pairs which are very costly to annotate. In this paper, we devise a novel Active Learning (AL) framework for neural EA, aiming to create highly informative seed alignment to obtain more effective EA models with less annotation cost. Our framework tackles two main challenges encountered when applying AL to EA: (1) How to exploit dependencies between entities within the AL strategy. Most AL strategies assume that the data instances to sample are independent and identically distributed. However, entities in KGs are related. To address this challenge, we propose a structure-aware uncertainty sampling strategy that can measure the uncertainty of each entity as well as its impact on its neighbour entities in the KG. (2) How to recognise entities that appear in one KG but not in the other KG (i.e., bachelors). Identifying bachelors would likely save annotation budget. To address this challenge, we devise a bachelor recognizer paying attention to alleviate the effect of sampling bias. Empirical results show that our proposed AL strategy can significantly improve sampling quality with good generality across different datasets, EA models and amount of bachelors.
Anthology ID:
2021.emnlp-main.270
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3364–3374
Language:
URL:
https://aclanthology.org/2021.emnlp-main.270
DOI:
10.18653/v1/2021.emnlp-main.270
Bibkey:
Cite (ACL):
Bing Liu, Harrisen Scells, Guido Zuccon, Wen Hua, and Genghong Zhao. 2021. ActiveEA: Active Learning for Neural Entity Alignment. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 3364–3374, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
ActiveEA: Active Learning for Neural Entity Alignment (Liu et al., EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.270.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.270.mp4
Code
 uq-neusoft-health-data-science/activeea