Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss

Bowen Zhang; Chunping Li

doi:10.18653/v1/2024.emnlp-main.663

Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss

Abstract

Since the introduction of BERT and RoBERTa, research on Semantic Textual Similarity (STS) has made groundbreaking progress. Particularly, the adoption of contrastive learning has substantially elevated state-of-the-art performance across various STS benchmarks. However, contrastive learning categorizes text pairs as either semantically similar or dissimilar, failing to leverage fine-grained annotated information and necessitating large batch sizes to prevent model collapse. These constraints pose challenges for researchers engaged in STS tasks that involve nuanced similarity levels or those with limited computational resources, compelling them to explore alternatives like Sentence-BERT. Despite its efficiency, Sentence-BERT tackles STS tasks from a classification perspective, overlooking the progressive nature of semantic relationships, which results in suboptimal performance. To bridge this gap, this paper presents an innovative regression framework and proposes two simple yet effective loss functions: Translated ReLU and Smooth K2 Loss. Experimental results demonstrate that our method achieves convincing performance across seven established STS benchmarks and offers the potential for further optimization of contrastive learning pre-trained models.

Anthology ID:: 2024.emnlp-main.663
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11882–11893
Language:
URL:: https://aclanthology.org/2024.emnlp-main.663/
DOI:: 10.18653/v1/2024.emnlp-main.663
Bibkey:
Cite (ACL):: Bowen Zhang and Chunping Li. 2024. Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 11882–11893, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss (Zhang & Li, EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.663.pdf
Software:: 2024.emnlp-main.663.software.zip
Data:: 2024.emnlp-main.663.data.zip

PDF Cite Search Software Data Fix data