LLMs’ morphological analyses of complex FST-generated Finnish words

Anssi Moisio, Mathias Creutz, Mikko Kurimo


Abstract
Rule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbohas some difficulties in the task while GPT-3.5-turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely.
Anthology ID:
2024.cmcl-1.21
Volume:
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Tatsuki Kuribayashi, Giulia Rambelli, Ece Takmaz, Philipp Wicke, Yohei Oseki
Venues:
CMCL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
242–254
Language:
URL:
https://aclanthology.org/2024.cmcl-1.21
DOI:
Bibkey:
Cite (ACL):
Anssi Moisio, Mathias Creutz, and Mikko Kurimo. 2024. LLMs’ morphological analyses of complex FST-generated Finnish words. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 242–254, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
LLMs’ morphological analyses of complex FST-generated Finnish words (Moisio et al., CMCL-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.cmcl-1.21.pdf