LLMs’ morphological analyses of complex FST-generated Finnish words

Anssi Moisio; Mathias Creutz; Mikko Kurimo

doi:10.18653/v1/2024.cmcl-1.21

LLMs’ morphological analyses of complex FST-generated Finnish words

Anssi Moisio, Mathias Creutz, Mikko Kurimo

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

Rule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbohas some difficulties in the task while GPT-3.5-turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely.

Anthology ID:: 2024.cmcl-1.21
Volume:: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Tatsuki Kuribayashi, Giulia Rambelli, Ece Takmaz, Philipp Wicke, Yohei Oseki
Venues:: CMCL | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 242–254
Language:
URL:: https://aclanthology.org/2024.cmcl-1.21/
DOI:: 10.18653/v1/2024.cmcl-1.21
Bibkey:
Cite (ACL):: Anssi Moisio, Mathias Creutz, and Mikko Kurimo. 2024. LLMs’ morphological analyses of complex FST-generated Finnish words. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 242–254, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: LLMs’ morphological analyses of complex FST-generated Finnish words (Moisio et al., CMCL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.cmcl-1.21.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{moisio-etal-2024-llms,
    title = "{LLM}s' morphological analyses of complex {FST}-generated {F}innish words",
    author = "Moisio, Anssi  and
      Creutz, Mathias  and
      Kurimo, Mikko",
    editor = "Kuribayashi, Tatsuki  and
      Rambelli, Giulia  and
      Takmaz, Ece  and
      Wicke, Philipp  and
      Oseki, Yohei",
    booktitle = "Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.cmcl-1.21/",
    doi = "10.18653/v1/2024.cmcl-1.21",
    pages = "242--254",
    abstract = "Rule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbohas some difficulties in the task while GPT-3.5-turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="moisio-etal-2024-llms">
    <titleInfo>
        <title>LLMs’ morphological analyses of complex FST-generated Finnish words</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Anssi</namePart>
        <namePart type="family">Moisio</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Mathias</namePart>
        <namePart type="family">Creutz</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Mikko</namePart>
        <namePart type="family">Kurimo</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2024-08</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Tatsuki</namePart>
            <namePart type="family">Kuribayashi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Giulia</namePart>
            <namePart type="family">Rambelli</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ece</namePart>
            <namePart type="family">Takmaz</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Philipp</namePart>
            <namePart type="family">Wicke</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Yohei</namePart>
            <namePart type="family">Oseki</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Bangkok, Thailand</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>Rule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbohas some difficulties in the task while GPT-3.5-turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely.</abstract>
    <identifier type="citekey">moisio-etal-2024-llms</identifier>
    <identifier type="doi">10.18653/v1/2024.cmcl-1.21</identifier>
    <location>
        <url>https://aclanthology.org/2024.cmcl-1.21/</url>
    </location>
    <part>
        <date>2024-08</date>
        <extent unit="page">
            <start>242</start>
            <end>254</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T LLMs’ morphological analyses of complex FST-generated Finnish words
%A Moisio, Anssi
%A Creutz, Mathias
%A Kurimo, Mikko
%Y Kuribayashi, Tatsuki
%Y Rambelli, Giulia
%Y Takmaz, Ece
%Y Wicke, Philipp
%Y Oseki, Yohei
%S Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
%D 2024
%8 August
%I Association for Computational Linguistics
%C Bangkok, Thailand
%F moisio-etal-2024-llms
%X Rule-based language processing systems have been overshadowed by neural systems in terms of utility, but it remains unclear whether neural NLP systems, in practice, learn the grammar rules that humans use. This work aims to shed light on the issue by evaluating state-of-the-art LLMs in a task of morphological analysis of complex Finnish noun forms. We generate the forms using an FST tool, and they are unlikely to have occurred in the training sets of the LLMs, therefore requiring morphological generalisation capacity. We find that GPT-4-turbohas some difficulties in the task while GPT-3.5-turbo struggles and smaller models Llama2-70B and Poro-34B fail nearly completely.
%R 10.18653/v1/2024.cmcl-1.21
%U https://aclanthology.org/2024.cmcl-1.21/
%U https://doi.org/10.18653/v1/2024.cmcl-1.21
%P 242-254

Download as File

Markdown (Informal)

[LLMs’ morphological analyses of complex FST-generated Finnish words](https://aclanthology.org/2024.cmcl-1.21/) (Moisio et al., CMCL 2024)

LLMs’ morphological analyses of complex FST-generated Finnish words (Moisio et al., CMCL 2024)

ACL

Anssi Moisio, Mathias Creutz, and Mikko Kurimo. 2024. LLMs’ morphological analyses of complex FST-generated Finnish words. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 242–254, Bangkok, Thailand. Association for Computational Linguistics.