Deja Vu at SemEval 2024 Task 9: A Comparative Study of Advanced Language Models for Commonsense Reasoning

Trina Chakraborty; Marufur Rahman; Omar Riyad

doi:10.18653/v1/2024.semeval-1.180

Deja Vu at SemEval 2024 Task 9: A Comparative Study of Advanced Language Models for Commonsense Reasoning

Trina Chakraborty, Marufur Rahman, Omar Riyad

Abstract

This research systematically forms an impression of the capabilities of advanced language models in addressing the BRAINTEASER task introduced at SemEval 2024, which is specifically designed to explore the models’ proficiency in lateral commonsense reasoning. The task sets forth an array of Sentence and Word Puzzles, carefully crafted to challenge the models with scenarios requiring unconventional thought processes. Our methodology encompasses a holistic approach, incorporating pre-processing of data, fine-tuning of transformer-based language models, and strategic data augmentation to explore the depth and flexibility of each model’s understanding. The preliminary results of our analysis are encouraging, highlighting significant potential for advancements in the models’ ability to engage in lateral reasoning. Further insights gained from post-competition evaluations suggest scopes for notable enhancements in model performance, emphasizing the continuous evolution of the models in mastering complex reasoning tasks.

Anthology ID:: 2024.semeval-1.180
Volume:: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1239–1244
Language:
URL:: https://aclanthology.org/2024.semeval-1.180/
DOI:: 10.18653/v1/2024.semeval-1.180
Bibkey:
Cite (ACL):: Trina Chakraborty, Marufur Rahman, and Omar Riyad. 2024. Deja Vu at SemEval 2024 Task 9: A Comparative Study of Advanced Language Models for Commonsense Reasoning. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 1239–1244, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: Deja Vu at SemEval 2024 Task 9: A Comparative Study of Advanced Language Models for Commonsense Reasoning (Chakraborty et al., SemEval 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.semeval-1.180.pdf
Supplementarymaterial:: 2024.semeval-1.180.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Fix data