COGNAC at SemEval-2026 Task 4: Evaluating Narrative Components with LLMs for Hard Story Similarity Cases

Tisa Islam Erana; Azwad Anjum Islam; Anshu Sharma; Mark Finlayson

COGNAC at SemEval-2026 Task 4: Evaluating Narrative Components with LLMs for Hard Story Similarity Cases

Tisa Islam Erana, Azwad Anjum Islam, Anshu Sharma, Mark Finlayson

Abstract

This paper presents a two-stage system for the SemEval-2026 shared task on narrative similarity. The task defines similarity in terms of three components: abstract theme, course of action, and outcome. For Track A, the system first applies majority voting over multiple independent large language model (LLM) judgments to handle high-agreement (easy) cases. For low-agreement (difficult) cases, it routes examples to a second stage that decomposes stories into theme, course of action, and outcome, and either (i) scores these components individually with learned weights or (ii) uses structured chain-of-thought prompting to compare stories along the three dimensions. This two-stage approach improves robustness on difficult examples and achieves first place with 0.78 test accuracy. For Track B, the system generates embeddings of full stories and of individual narrative components using several embedding models. Experiments show that embeddings derived from the course-of-action component alone yield the best performance, achieving 0.72 accuracy and ranking first. Additional analyses reveal substantial annotation variability in the dataset and highlight the importance of handling ambiguity and disagreement when modeling narrative similarity.

Anthology ID:: 2026.semeval-1.290
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2290–2300
Language:
URL:: https://aclanthology.org/2026.semeval-1.290/
DOI:
Bibkey:
Cite (ACL):: Tisa Islam Erana, Azwad Anjum Islam, Anshu Sharma, and Mark Finlayson. 2026. COGNAC at SemEval-2026 Task 4: Evaluating Narrative Components with LLMs for Hard Story Similarity Cases. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2290–2300, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: COGNAC at SemEval-2026 Task 4: Evaluating Narrative Components with LLMs for Hard Story Similarity Cases (Erana et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.290.pdf
Supplementarymaterial:: 2026.semeval-1.290.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data