MUSTS: MUltilingual Semantic Textual Similarity Benchmark

Tharindu Ranasinghe; Hansi Hettiarachchi; Constantin Orasan; Ruslan Mitkov

doi:10.18653/v1/2025.acl-short.27

MUSTS: MUltilingual Semantic Textual Similarity Benchmark

Tharindu Ranasinghe, Hansi Hettiarachchi, Constantin Orasan, Ruslan Mitkov

Abstract

Predicting semantic textual similarity (STS) is a complex and ongoing challenge in natural language processing (NLP). Over the years, researchers have developed a variety of supervised and unsupervised approaches to calculate STS automatically. Additionally, various benchmarks, which include STS datasets, have been established to consistently evaluate and compare these STS methods. However, they largely focus on high-resource languages, mixed with datasets annotated focusing on relatedness instead of similarity and containing automatically translated instances. Therefore, no dedicated benchmark for multilingual STS exists. To solve this gap, we introduce the Multilingual Semantic Textual Similarity Benchmark (MUSTS), which spans 13 languages, including low-resource languages. By evaluating more than 25 models on MUSTS, we establish the most comprehensive benchmark of multilingual STS methods. Our findings confirm that STS remains a challenging task, particularly for low-resource languages.

Anthology ID:: 2025.acl-short.27
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 331–353
Language:
URL:: https://aclanthology.org/2025.acl-short.27/
DOI:: 10.18653/v1/2025.acl-short.27
Bibkey:
Cite (ACL):: Tharindu Ranasinghe, Hansi Hettiarachchi, Constantin Orasan, and Ruslan Mitkov. 2025. MUSTS: MUltilingual Semantic Textual Similarity Benchmark. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 331–353, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: MUSTS: MUltilingual Semantic Textual Similarity Benchmark (Ranasinghe et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-short.27.pdf

PDF Cite Search Fix data