ChatGPT as Your n-th Annotator: Experiments in Leveraging Large Language Models for Social Science Text Annotation in Slovak Language

Endre Hamerlik; Marek Šuppa; Miroslav Blšták; Jozef Kubík; Martin Takáč; Marian Simko; Andrej Findor

ChatGPT as Your n-th Annotator: Experiments in Leveraging Large Language Models for Social Science Text Annotation in Slovak Language

Endre Hamerlik, Marek Šuppa, Miroslav Blšták, Jozef Kubík, Martin Takáč, Marián Šimko, Andrej Findor

Abstract

Large Language Models (LLMs) are increasingly influential in Computational Social Science, offering new methods for processing and analyzing data, particularly in lower-resource language contexts. This study explores the use of OpenAI’s GPT-3.5 Turbo and GPT-4 for automating annotations for a unique news media dataset in a lower resourced language, focusing on stance classification tasks. Our results reveal that prompting in the native language, explanation generation, and advanced prompting strategies like Retrieval Augmented Generation and Chain of Thought prompting enhance LLM performance, particularly noting GPT-4’s superiority in predicting stance. Further evaluation indicates that LLMs can serve as a useful tool for social science text annotation in lower resourced languages, notably in identifying inconsistencies in annotation guidelines and annotated datasets.

Anthology ID:: 2024.cpss-1.6
Volume:: Proceedings of the 4th Workshop on Computational Linguistics for the Political and Social Sciences: Long and short papers
Month:: Sep
Year:: 2024
Address:: Vienna, Austria
Editors:: Christopher Klamm, Gabriella Lapesa, Simone Paolo Ponzetto, Ines Rehbein, Indira Sen
Venues:: cpss | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 81–89
Language:
URL:: https://aclanthology.org/2024.cpss-1.6/
DOI:
Bibkey:
Cite (ACL):: Endre Hamerlik, Marek Šuppa, Miroslav Blšták, Jozef Kubík, Martin Takáč, Marián Šimko, and Andrej Findor. 2024. ChatGPT as Your n-th Annotator: Experiments in Leveraging Large Language Models for Social Science Text Annotation in Slovak Language. In Proceedings of the 4th Workshop on Computational Linguistics for the Political and Social Sciences: Long and short papers, pages 81–89, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: ChatGPT as Your n-th Annotator: Experiments in Leveraging Large Language Models for Social Science Text Annotation in Slovak Language (Hamerlik et al., cpss 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.cpss-1.6.pdf

PDF Cite Search Fix data