Can GPT4 Detect Euphemisms across Multiple Languages?

Todd Firsich, Anthony Rios


Abstract
A euphemism is a word or phrase used in place of another word or phrase that might be considered harsh, blunt, unpleasant, or offensive. Euphemisms generally soften the impact of what is being said, making it more palatable or appropriate for the context or audience. Euphemisms can vary significantly between languages, reflecting cultural sensitivities and taboos, and what might be a mild expression in one language could carry a stronger connotation or be completely misunderstood in another. This paper uses prompting techniques to evaluate OpenAI’s GPT4 for detecting euphemisms across multiple languages as part of the 2024 FigLang shared task. We evaluate both zero-shot and few-shot approaches. Our method achieved an average macro F1 of .732, ranking first in the competition. Moreover, we found that GPT4 does not perform uniformly across all languages, with a difference of .233 between the best (English .831) and the worst (Spanish .598) languages.
Anthology ID:
2024.figlang-1.9
Volume:
Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024)
Month:
June
Year:
2024
Address:
Mexico City, Mexico (Hybrid)
Editors:
Debanjan Ghosh, Smaranda Muresan, Anna Feldman, Tuhin Chakrabarty, Emmy Liu
Venues:
Fig-Lang | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
65–72
Language:
URL:
https://aclanthology.org/2024.figlang-1.9
DOI:
10.18653/v1/2024.figlang-1.9
Bibkey:
Cite (ACL):
Todd Firsich and Anthony Rios. 2024. Can GPT4 Detect Euphemisms across Multiple Languages?. In Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024), pages 65–72, Mexico City, Mexico (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Can GPT4 Detect Euphemisms across Multiple Languages? (Firsich & Rios, Fig-Lang-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.figlang-1.9.pdf