Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness

Manas Madine


Abstract
This paper investigates the robustness of Large Language Models (LLMs) against Out-Of-Distribution (OOD) data within the context of sentiment analysis. Traditional fine-tuning approaches often fail to generalize effectively across different data distributions, limiting the practical deployment of LLMs in dynamic real-world scenarios. To address this challenge, we introduce a novel method called “Semantic Rewriting,” which leverages the inherent flexibility of LLMs to align both in-distribution (ID) and OOD data with the LLMs distributions. By semantically transforming sentences to minimize linguistic discrepancies, our approach helps to standardize features across datasets, thus enhancing model robustness. We conduct extensive experiments with several benchmark datasets and LLMs to validate the efficacy of our method. The results demonstrate that Semantic Rewriting significantly improves the performance of models on OOD tasks, outperforming traditional methods in both robustness and generalization capabilities. Our findings suggest that Semantic Rewriting is a promising technique for developing more reliable and versatile NLP systems capable of performing robustly across diverse operational environments.
Anthology ID:
2024.acl-srw.39
Volume:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Xiyan Fu, Eve Fleisig
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
334–344
Language:
URL:
https://aclanthology.org/2024.acl-srw.39
DOI:
10.18653/v1/2024.acl-srw.39
Bibkey:
Cite (ACL):
Manas Madine. 2024. Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 334–344, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness (Madine, ACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.acl-srw.39.pdf