CUCLASIC at SemEval-2026 Task 5: LLM Prompting Strategies for Rating Ambiguous Word Senses

Federico Ortega Riba; Jasper Wilkerson; Kelsey Lafreniere Adams

CUCLASIC at SemEval-2026 Task 5: LLM Prompting Strategies for Rating Ambiguous Word Senses

Federico Ortega Riba, Jasper Wilkerson, Kelsey Lafreniere Adams

Abstract

Word sense disambiguation has been a foundational task in computational semantics since the 1990s, but remains an unsolved problem when it comes to bridging human and computational evaluation of ambiguity. The SemEval-2026 Task 5 attempts to address this gap. We test six Large Language Models (LLMs) from the Llama and Gemini families in order to evaluate LLMs’ ratings of ambiguous textual excerpts, experimenting with zero- and few-shot variants of prompts and analyzing how simple linguistic cues improve performance. We propose a methodology of eliciting human-like ratings from language models by using examples with low and high standard deviations between human ratings. We further evaluate and compare the prediction patterns of different models and how they align with the human generated ratings. Our best model (Gemini 3-Flash) achieves a 75% score combining Spearman correlation and accuracy within one standard deviation.

Anthology ID:: 2026.semeval-1.121
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 886–893
Language:
URL:: https://aclanthology.org/2026.semeval-1.121/
DOI:
Bibkey:
Cite (ACL):: Federico Ortega Riba, Jasper Wilkerson, and Kelsey Lafreniere Adams. 2026. CUCLASIC at SemEval-2026 Task 5: LLM Prompting Strategies for Rating Ambiguous Word Senses. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 886–893, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: CUCLASIC at SemEval-2026 Task 5: LLM Prompting Strategies for Rating Ambiguous Word Senses (Ortega Riba et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.121.pdf
Supplementarymaterial:: 2026.semeval-1.121.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data