Clutch or Cry at SemEval-2026 Task 12: Offline Retrieval-Augmented Generation with Frozen DeBERTa for Abductive Event Reasoning

Aayush Prasad; Rudra Trivedi; Arshad Khatib; Shrikant Malviya; Naveen Kumar

Clutch or Cry at SemEval-2026 Task 12: Offline Retrieval-Augmented Generation with Frozen DeBERTa for Abductive Event Reasoning

Aayush Prasad, Rudra Trivedi, Arshad Khatib, Shrikant Malviya, Naveen Kumar

Abstract

We present our system for SemEval-2026 Task 12 on abductive event reasoning. Initial experiments with direct fine-tuning of large language models suffered from severe overfitting due to limited training data, while smaller models failed under context-length constraints, leading to random guessing under the strict Exact Match evaluation metric. To address these challenges, we propose a two-stage offline Retrieval-Augmented Generation (RAG) pipeline that separates semantic evidence retrieval from multi-label classification. We employ a dense retriever (all-MiniLM-L6-v2) to extract the single most relevant sentence (top-k=1) and feed it into a partially frozen DeBERTa-v3-Large classifier trained with BCEWithLogitsLoss. Freezing the lower 12 layers effectively mitigates overfitting while preserving pre-trained semantic knowledge. Our approach eliminates long-context truncation issues, reduces hallucination, and achieves a final Exact Match accuracy of 0.72 on the official test set.

Anthology ID:: 2026.semeval-1.261
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2078–2081
Language:
URL:: https://aclanthology.org/2026.semeval-1.261/
DOI:
Bibkey:
Cite (ACL):: Aayush Prasad, Rudra Trivedi, Arshad Khatib, Shrikant Malviya, and Naveen Kumar. 2026. Clutch or Cry at SemEval-2026 Task 12: Offline Retrieval-Augmented Generation with Frozen DeBERTa for Abductive Event Reasoning. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2078–2081, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Clutch or Cry at SemEval-2026 Task 12: Offline Retrieval-Augmented Generation with Frozen DeBERTa for Abductive Event Reasoning (Prasad et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.261.pdf
Supplementarymaterial:: 2026.semeval-1.261.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data