There is No Big Brother or Small Brother:Knowledge Infusion in Language Models for Link Prediction and Question Answering

Ankush Agarwal, Sakharam Gawade, Sachin Channabasavarajendra, Pushpak Bhattacharya


Abstract
The integration of knowledge graphs with deep learning is thriving in improving the performance of various natural language processing (NLP) tasks. In this paper, we focus on knowledge-infused link prediction and question answering using language models, T5, and BLOOM across three domains:Aviation, Movie, and Web. In this context, we infuse knowledge in large and small language models and study their performance, and find the performance to be similar. For the link prediction task on the Aviation Knowledge Graph, we obtain a 0.2 hits@1 score using T5-small, T5-base, T5-large, and BLOOM. Using template-based scripts, we create a set of 1 million synthetic factoid QA pairs in the aviation domain from National Transportation Safety Board (NTSB) reports. On our curated QA pairs, the three models of T5 achieve a 0.7 hits@1 score. We validate our findings with the paired student t test and Cohen’s kappa scores. For link prediction on Aviation Knowledge Graph using T5-small and T5-large, we obtain a Cohen’s kappa score of 0.76, showing substantial agreement between the models. Thus, we infer that small language models perform similar to large language models with the infusion of knowledge.
Anthology ID:
2022.icon-main.26
Volume:
Proceedings of the 19th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2022
Address:
New Delhi, India
Editors:
Md. Shad Akhtar, Tanmoy Chakraborty
Venue:
ICON
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
204–211
Language:
URL:
https://aclanthology.org/2022.icon-main.26
DOI:
Bibkey:
Cite (ACL):
Ankush Agarwal, Sakharam Gawade, Sachin Channabasavarajendra, and Pushpak Bhattacharya. 2022. There is No Big Brother or Small Brother:Knowledge Infusion in Language Models for Link Prediction and Question Answering. In Proceedings of the 19th International Conference on Natural Language Processing (ICON), pages 204–211, New Delhi, India. Association for Computational Linguistics.
Cite (Informal):
There is No Big Brother or Small Brother:Knowledge Infusion in Language Models for Link Prediction and Question Answering (Agarwal et al., ICON 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.icon-main.26.pdf
Document:
 2022.icon-main.26.document.pdf