Automatic Detection of Entity-Manipulated Text using Factual Knowledge

Ganesh Jawahar, Muhammad Abdul-Mageed, Laks Lakshmanan


Abstract
In this work, we focus on the problem of distinguishing a human written news article from a news article that is created by manipulating entities in a human written news article (e.g., replacing entities with factually incorrect entities). Such manipulated articles can mislead the reader by posing as a human written news article. We propose a neural network based detector that detects manipulated news articles by reasoning about the facts mentioned in the article. Our proposed detector exploits factual knowledge via graph convolutional neural network along with the textual information in the news article. We also create challenging datasets for this task by considering various strategies to generate the new replacement entity (e.g., entity generation from GPT-2). In all the settings, our proposed model either matches or outperforms the state-of-the-art detector in terms of accuracy. Our code and data are available at https://github.com/UBC-NLP/manipulated_entity_detection.
Anthology ID:
2022.acl-short.10
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
86–93
Language:
URL:
https://aclanthology.org/2022.acl-short.10
DOI:
10.18653/v1/2022.acl-short.10
Bibkey:
Cite (ACL):
Ganesh Jawahar, Muhammad Abdul-Mageed, and Laks Lakshmanan. 2022. Automatic Detection of Entity-Manipulated Text using Factual Knowledge. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 86–93, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Automatic Detection of Entity-Manipulated Text using Factual Knowledge (Jawahar et al., ACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.acl-short.10.pdf
Code
 ubc-nlp/manipulated_entity_detection
Data
RealNews