Assessing the Asymmetric Behaviour of Italian Large Language Models across Different Syntactic Structures

Elena Sofia Ruzzetti; Federico Ranaldi; Dario Onorati; Davide Venditti; Leonardo Ranaldi; Tommaso Caselli; Fabio Massimo Zanzotto

Assessing the Asymmetric Behaviour of Italian Large Language Models across Different Syntactic Structures

Elena Sofia Ruzzetti, Federico Ranaldi, Dario Onorati, Davide Venditti, Leonardo Ranaldi, Tommaso Caselli, Fabio Massimo Zanzotto

Abstract

While LLMs get more proficient at solving tasks and generating sentences, we aim to investigate the role that differentsyntactic structures have on models’ performances on a battery of Natural Language Understanding tasks. We analyze theperformance of five LLMs on semantically equivalent sentences that are characterized by different syntactic structures. Tocorrectly solve the tasks, a model is implicitly required to correctly parse the sentence. We found out that LLMs strugglewhen there are more complex syntactic structures, with an average drop of 16.13(±11.14) points in accuracy on Q&A task.Additionally, we propose a method based on token attribution to spot which area of the LLMs encode syntactic knowledge,by identifying model heads and layers responsible for the generation of a correct answer

Anthology ID:: 2024.clicit-1.92
Volume:: Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:: December
Year:: 2024
Address:: Pisa, Italy
Editors:: Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:: CLiC-it
SIG:
Publisher:: CEUR Workshop Proceedings
Note:
Pages:: 854–863
Language:
URL:: https://aclanthology.org/2024.clicit-1.92/
DOI:
Bibkey:
Cite (ACL):: Elena Sofia Ruzzetti, Federico Ranaldi, Dario Onorati, Davide Venditti, Leonardo Ranaldi, Tommaso Caselli, and Fabio Massimo Zanzotto. 2024. Assessing the Asymmetric Behaviour of Italian Large Language Models across Different Syntactic Structures. In Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), pages 854–863, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):: Assessing the Asymmetric Behaviour of Italian Large Language Models across Different Syntactic Structures (Ruzzetti et al., CLiC-it 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.clicit-1.92.pdf

PDF Cite Search Fix data