Enhancing Cohesion and Coherence of Fake Text to Improve Believability for Deceiving Cyber Attackers

Prakruthi Karuna, Hemant Purohit, Özlem Uzuner, Sushil Jajodia, Rajesh Ganesan


Abstract
Ever increasing ransomware attacks and thefts of intellectual property demand cybersecurity solutions to protect critical documents. One emerging solution is to place fake text documents in the repository of critical documents for deceiving and catching cyber attackers. We can generate fake text documents by obscuring the salient information in legit text documents. However, the obscuring process can result in linguistic inconsistencies, such as broken co-references and illogical flow of ideas across the sentences, which can discern the fake document and render it unbelievable. In this paper, we propose a novel method to generate believable fake text documents by automatically improving the linguistic consistency of computer-generated fake text. Our method focuses on enhancing syntactic cohesion and semantic coherence across discourse segments. We conduct experiments with human subjects to evaluate the effect of believability improvements in distinguishing legit texts from fake texts. Results show that the probability to distinguish legit texts from believable fake texts is consistently lower than from fake texts that have not been improved in believability. This indicates the effectiveness of our method in generating believable fake text.
Anthology ID:
W18-4104
Volume:
Proceedings of the First International Workshop on Language Cognition and Computational Models
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Editors:
Manjira Sinha, Tirthankar Dasgupta
Venue:
LCCM
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31–40
Language:
URL:
https://aclanthology.org/W18-4104
DOI:
Bibkey:
Cite (ACL):
Prakruthi Karuna, Hemant Purohit, Özlem Uzuner, Sushil Jajodia, and Rajesh Ganesan. 2018. Enhancing Cohesion and Coherence of Fake Text to Improve Believability for Deceiving Cyber Attackers. In Proceedings of the First International Workshop on Language Cognition and Computational Models, pages 31–40, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Enhancing Cohesion and Coherence of Fake Text to Improve Believability for Deceiving Cyber Attackers (Karuna et al., LCCM 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-4104.pdf