Optimizing Multilingual Euphemism Detection using Low-Rank Adaption Within and Across Languages

Nicholas Hankins


Abstract
This short paper presents an investigation into the effectiveness of various classification methods as a submission in the Multilingual Euphemism Detection Shared Task for the Fourth Workshop on Figurative Language Processing co-located with NAACL 2024. The process used by the participant utilizes pre-trained large language models combined with parameter efficient fine-tuning methods, specifically Low-Rank Adaptation (LoRA), in classifying euphemisms across four different languages - Mandarin Chinese, American English, Spanish, and Yorùbá. The study is comprised of three main components that aim to explore heuristic methods to navigate how base models can most efficiently be fine-tuned into classifiers to learn figurative language. Multilingual labeled training data was utilized to fine-tune classifiers for each language, and later combined for one large classifier, while unseen test data was finally used to evaluate the accuracy of the best performing classifiers. In addition, cross-lingual tests were conducted by applying each language’s data on each of the other language’s classifiers. All of the results provide insights into the potential of pre-trained base models combined with LoRA fine-tuning methods in accurately classifying euphemisms across and within different languages.
Anthology ID:
2024.figlang-1.2
Volume:
Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024)
Month:
June
Year:
2024
Address:
Mexico City, Mexico (Hybrid)
Editors:
Debanjan Ghosh, Smaranda Muresan, Anna Feldman, Tuhin Chakrabarty, Emmy Liu
Venues:
Fig-Lang | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8–14
Language:
URL:
https://aclanthology.org/2024.figlang-1.2
DOI:
10.18653/v1/2024.figlang-1.2
Bibkey:
Cite (ACL):
Nicholas Hankins. 2024. Optimizing Multilingual Euphemism Detection using Low-Rank Adaption Within and Across Languages. In Proceedings of the 4th Workshop on Figurative Language Processing (FigLang 2024), pages 8–14, Mexico City, Mexico (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Optimizing Multilingual Euphemism Detection using Low-Rank Adaption Within and Across Languages (Hankins, Fig-Lang-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.figlang-1.2.pdf