A Unified Generative Framework for Bilingual Euphemism Detection and Identification

Yuxue Hu; Junsong Li; Tongguan Wang; Dongyu Su; Guixin Su; Ying Sha

doi:10.18653/v1/2024.findings-acl.403

A Unified Generative Framework for Bilingual Euphemism Detection and Identification

Yuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, Ying Sha

Abstract

Various euphemisms are emerging in social networks, attracting widespread attention from the natural language processing community. However, existing euphemism datasets are only domain-specific or language-specific. In addition, existing approaches to the study of euphemisms are one-sided. Either only the euphemism detection task or only the euphemism identification task is accomplished, lacking a unified framework. To this end, we construct a large-scale Bilingual Multi-category dataset of Euphemisms named BME, which covers a total of 12 categories for two languages, English and Chinese. Then, we first propose a unified generative model to Jointly conduct the tasks of bilingual Euphemism Detection and Identification named JointEDI. By comparing with LLMs and human evaluation, we demonstrate the effectiveness of the proposed JointEDI and the feasibility of unifying euphemism detection and euphemism identification tasks. Moreover, the BME dataset also provides a new reference standard for euphemism detection and euphemism identification.

Anthology ID:: 2024.findings-acl.403
Volume:: Findings of the Association for Computational Linguistics: ACL 2024
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6753–6766
Language:
URL:: https://aclanthology.org/2024.findings-acl.403/
DOI:: 10.18653/v1/2024.findings-acl.403
Bibkey:
Cite (ACL):: Yuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, and Ying Sha. 2024. A Unified Generative Framework for Bilingual Euphemism Detection and Identification. In Findings of the Association for Computational Linguistics: ACL 2024, pages 6753–6766, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: A Unified Generative Framework for Bilingual Euphemism Detection and Identification (Hu et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-acl.403.pdf

PDF Cite Search Fix data