Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset

Sheza Munir; Wassay Sajjad; Mukeet Raza; Emaan Abbas; Abdul Hameed Azeemi; Ihsan Ayyub Qazi; Agha Ali Raza

Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset

Sheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

Abstract

Deepfakes, particularly in the auditory domain, have become a significant threat, necessitating the development of robust countermeasures. This paper addresses the escalating challenges posed by deepfake attacks on Automatic Speaker Verification (ASV) systems. We present a novel Urdu deepfake audio dataset for deepfake detection, focusing on two spoofing attacks – Tacotron and VITS TTS. The dataset construction involves careful consideration of phonemic cover and balance and comparison with existing corpora like PRUS and PronouncUR. Evaluation with AASIST-L model shows EERs of 0.495 and 0.524 for VITS TTS and Tacotron-generated audios, respectively, with variability across speakers. Further, this research implements a detailed human evaluation, incorporating a user study to gauge whether people are able to discern deepfake audios from real (bonafide) audios. The ROC curve analysis shows an area under the curve (AUC) of 0.63, indicating that individuals demonstrate a limited ability to detect deepfakes (approximately 1 in 3 fake audio samples are regarded as real). Our work contributes a valuable resource for training deepfake detection models in low-resource languages like Urdu, addressing the critical gap in existing datasets. The dataset is publicly available at: https://github.com/CSALT-LUMS/urdu-deepfake-dataset.

Anthology ID:: 2024.findings-acl.861
Volume:: Findings of the Association for Computational Linguistics ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand and virtual meeting
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14470–14480
Language:
URL:: https://aclanthology.org/2024.findings-acl.861
DOI:
Bibkey:
Cite (ACL):: Sheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, and Agha Ali Raza. 2024. Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset. In Findings of the Association for Computational Linguistics ACL 2024, pages 14470–14480, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):: Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset (Munir et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-acl.861.pdf

PDF Cite Search