ABDN-NLP at CoMeDi Shared Task: Predicting the Aggregated Human Judgment via Weighted Few-Shot Prompting

Ying Xuan Loke; Dominik Schlechtweg; Wei Zhao

ABDN-NLP at CoMeDi Shared Task: Predicting the Aggregated Human Judgment via Weighted Few-Shot Prompting

Ying Xuan Loke, Dominik Schlechtweg, Wei Zhao

Abstract

Human annotation is notorious for being subjective and expensive. Recently, (CITATION) introduced the CoMeDi shared task aiming to address this issue by predicting human annotations on the semantic proximity between word uses, and estimating the variation of the human annotations. However, distinguishing the proximity between word uses can be challenging, when their semantic difference is subtle. In this work, we focus on predicting the aggregated annotator judgment of semantic proximity by using a large language model fine-tuned on 20 examples with various proximity classes. To distinguish nuanced proximity, we propose a weighted few-shot approach that pays greater attention to the proximity classes identified as important during fine-tuning. We evaluate our approach in the CoMeDi shared task across 7 languages. Our results demonstrate the superiority of our approach over zero-shot and standard few-shot counterparts. While useful, the weighted few-shot should be applied with caution, given that it relies on development sets to compute the importance of proximity classes, and thus may not generalize well to real-world scenarios where the distribution of class importance is different.

Anthology ID:: 2025.comedi-1.12
Volume:: Proceedings of Context and Meaning: Navigating Disagreements in NLP Annotation
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Michael Roth, Dominik Schlechtweg
Venues:: CoMeDi | WS
SIG:
Publisher:: International Committee on Computational Linguistics
Note:
Pages:: 122–128
Language:
URL:: https://aclanthology.org/2025.comedi-1.12/
DOI:
Bibkey:
Cite (ACL):: Ying Xuan Loke, Dominik Schlechtweg, and Wei Zhao. 2025. ABDN-NLP at CoMeDi Shared Task: Predicting the Aggregated Human Judgment via Weighted Few-Shot Prompting. In Proceedings of Context and Meaning: Navigating Disagreements in NLP Annotation, pages 122–128, Abu Dhabi, UAE. International Committee on Computational Linguistics.
Cite (Informal):: ABDN-NLP at CoMeDi Shared Task: Predicting the Aggregated Human Judgment via Weighted Few-Shot Prompting (Loke et al., CoMeDi 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.comedi-1.12.pdf

PDF Cite Search Fix data