Rinaldo Gagiano
2023
A Prompt in the Right Direction: Prompt Based Classification of Machine-Generated Text Detection
Rinaldo Gagiano
|
Lin Tian
Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association
The goal of ALTA 2023 Shared Task is to distinguish between human-authored text and synthetic text generated by Large Language Models (LLMs). Given the growing societal concerns surrounding LLMs, this task addresses the urgent need for robust text verification strategies. In this paper, we describe our method, a fine-tuned Falcon-7B model with incorporated label smoothing into the training process. We applied model prompting to samples with lower confidence scores to enhance prediction accuracy. Our model achieved a statistically significant accuracy of 0.991.
2021
Robustness Analysis of Grover for Machine-Generated News Detection
Rinaldo Gagiano
|
Maria Myung-Hee Kim
|
Xiuzhen Zhang
|
Jennifer Biggs
Proceedings of the 19th Annual Workshop of the Australasian Language Technology Association
Advancements in Natural Language Generation have raised concerns on its potential misuse for deep fake news. Grover is a model for both generation and detection of neural fake news. While its performance on automatically discriminating neural fake news surpassed GPT-2 and BERT, Grover could face a variety of adversarial attacks to deceive detection. In this work, we present an investigation of Groverâs susceptibility to adversarial attacks such as character-level and word-level perturbations. The experiment results show that even a singular character alteration can cause Grover to fail, affecting up to 97% of target articles with unlimited attack attempts, exposing a lack of robustness. We further analyse these misclassified cases to highlight affected words, identify vulnerability within Groverâs encoder, and perform a novel visualisation of cumulative classification scores to assist in interpreting model behaviour.