Feature-Level Ensemble Learning for Robust Synthetic Text Detection with DeBERTaV3 and XLM-RoBERTa

Saman Sarker Joy; Tanusree Das Aishi

Feature-Level Ensemble Learning for Robust Synthetic Text Detection with DeBERTaV3 and XLM-RoBERTa

Abstract

As large language models, or LLMs, continue to advance in recent years, they require the development of a potent system to detect whether a text was created by a human or an LLM in order to prevent the unethical use of LLMs. To address this challenge, ALTA Shared Task 2023 introduced a task to build an automatic detection system that can discriminate between human-authored and synthetic text generated by LLMs. In this paper, we present our participation in this task where we proposed a feature-level ensemble of two transformer models namely DeBERTaV3 and XLM-RoBERTa to come up with a robust system. The given dataset consisted of textual data with two labels where the task was binary classification. Experimental results show that our proposed method achieved competitive performance among the participants. We believe this solution would make an impact and provide a feasible solution for detection of synthetic text detection.

Anthology ID:: 2023.alta-1.21
Volume:: Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association
Month:: November
Year:: 2023
Address:: Melbourne, Australia
Editors:: Smaranda Muresan, Vivian Chen, Kennington Casey, Vandyke David, Dethlefs Nina, Inoue Koji, Ekstedt Erik, Ultes Stefan
Venue:: ALTA
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 169–172
Language:
URL:: https://aclanthology.org/2023.alta-1.21/
DOI:
Bibkey:
Cite (ACL):: Saman Sarker Joy and Tanusree Das Aishi. 2023. Feature-Level Ensemble Learning for Robust Synthetic Text Detection with DeBERTaV3 and XLM-RoBERTa. In Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association, pages 169–172, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Feature-Level Ensemble Learning for Robust Synthetic Text Detection with DeBERTaV3 and XLM-RoBERTa (Joy & Aishi, ALTA 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.alta-1.21.pdf

PDF Cite Search Fix data