La Non Canonica L’hai Studiata? Exploring LLMs and Sentence Canonicity in Italian

Claudiu Hromei, Danilo Croce, Rodolfo Delmonte, Roberto Basili


Abstract
This paper investigates the ability of Large Language Models (LLMs) to differentiate between canonical and non-canonical sentences in Italian, employing advanced neural architectures like LLaMA and its adaptations. Canonical sentences adhere to the standard Subject-Verb-Object (SVO) structure. We hypothesize that recent generative LLMs are influenced heavily by the English language, where non-canonical structures are very rare. Using the in-context learning technique, we probe these models and further fine-tune them for this specific task. Initial results indicate that these models continue to struggle with this task even after fine-tuning. Additionally, we introduce a new dataset comprising several hundred sentences from the poetry domain, which presents significant challenges for the canonical structure task.
Anthology ID:
2024.clicit-1.52
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
431–439
Language:
URL:
https://aclanthology.org/2024.clicit-1.52/
DOI:
Bibkey:
Cite (ACL):
Claudiu Hromei, Danilo Croce, Rodolfo Delmonte, and Roberto Basili. 2024. La Non Canonica L’hai Studiata? Exploring LLMs and Sentence Canonicity in Italian. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 431–439, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
La Non Canonica L’hai Studiata? Exploring LLMs and Sentence Canonicity in Italian (Hromei et al., CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.52.pdf