Team HausaNLP at SemEval-2026 Task 4: Narratives via Semantic Embeddings

Faisal Muhammad Adam; Lukman Aliyu; Sani Aji

Team HausaNLP at SemEval-2026 Task 4: Narratives via Semantic Embeddings

Abstract

This paper presents Team HausaNLP’s submission to SemEval-2026 Task 4 (Track A),which requires identifying the more narrativelysimilar of two candidate stories relative to ananchor. Narrative similarity is defined alongthree dimensions: abstract theme, course ofaction, and story outcomes. We conduct a systematic ablation comparing five approaches:a lexical TF-IDF baseline, two bi-encoderSBERT variants (all-MiniLM-L6-v2 andall-mpnet-base-v2), a paraphrase-focusedembedding model, and a cross-encoder reranker. On the 200-instance development set,all-mpnet-base-v2 achieves the best performance (61.5% accuracy, 61.48 macro-F1), outperforming both TF-IDF (54.5%) and the official SBERT baseline (55.0%). Surprisingly,the cross-encoder re-ranker (55.5%) does notimprove on the bi-encoders, which we attributeto the long-document nature of Wikipedia storysummaries exceeding the model’s effective context window. On the official test set, our primary SBERT MiniLM submission achieved61.50% accuracy (33rd of 44 teams). Our erroranalysis over 200 development instances identifies five systematic failure categories, distinctfrom the All Correct / Partial cases, including23 Lexical Trap cases, 23 Hard Cases, and 24Proposed-Recovery cases, thereby informingconcrete directions for future work.

Anthology ID:: 2026.semeval-1.7
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 48–53
Language:
URL:: https://aclanthology.org/2026.semeval-1.7/
DOI:
Bibkey:
Cite (ACL):: Faisal Adam, Lukman Aliyu, and Sani Aji. 2026. Team HausaNLP at SemEval-2026 Task 4: Narratives via Semantic Embeddings. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 48–53, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Team HausaNLP at SemEval-2026 Task 4: Narratives via Semantic Embeddings (Adam et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.7.pdf
Supplementarymaterial:: 2026.semeval-1.7.SupplementaryMaterial.txt
Supplementarymaterial:: 2026.semeval-1.7.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Supplementarymaterial Fix data