Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark

Enrico Mensa; Lorenzo Zane; Calogero Jerik Scozzaro; Matteo Delsanto; Tommaso Milani; Daniele P. Radicioni

Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark

Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, Daniele P. Radicioni

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Anthology ID:: 2025.clicit-1.69
Volume:: Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Month:: September
Year:: 2025
Address:: Cagliari, Italy
Editors:: Cristina Bosco, Elisabetta Jezek, Marco Polignano, Manuela Sanguinetti
Venue:: CLiC-it
SIG:
Publisher:: CEUR Workshop Proceedings
Note:
Pages:: 722–734
Language:
URL:: https://aclanthology.org/2025.clicit-1.69/
DOI:
Bibkey:
Cite (ACL):: Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, and Daniele P. Radicioni. 2025. Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), pages 722–734, Cagliari, Italy. CEUR Workshop Proceedings.
Cite (Informal):: Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark (Mensa et al., CLiC-it 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.clicit-1.69.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{mensa-etal-2025-easy,
    title = "Easy to Complete, Hard to Choose: Investigating {LLM} Performance on the {P}roverb{IT} Benchmark",
    author = "Mensa, Enrico  and
      Zane, Lorenzo  and
      Scozzaro, Calogero Jerik  and
      Delsanto, Matteo  and
      Milani, Tommaso  and
      Radicioni, Daniele P.",
    editor = "Bosco, Cristina  and
      Jezek, Elisabetta  and
      Polignano, Marco  and
      Sanguinetti, Manuela",
    booktitle = "Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)",
    month = sep,
    year = "2025",
    address = "Cagliari, Italy",
    publisher = "CEUR Workshop Proceedings",
    url = "https://aclanthology.org/2025.clicit-1.69/",
    pages = "722--734",
    ISBN = "979-12-243-0587-3"
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="mensa-etal-2025-easy">
    <titleInfo>
        <title>Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Enrico</namePart>
        <namePart type="family">Mensa</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Lorenzo</namePart>
        <namePart type="family">Zane</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Calogero</namePart>
        <namePart type="given">Jerik</namePart>
        <namePart type="family">Scozzaro</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Matteo</namePart>
        <namePart type="family">Delsanto</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Tommaso</namePart>
        <namePart type="family">Milani</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Daniele</namePart>
        <namePart type="given">P</namePart>
        <namePart type="family">Radicioni</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2025-09</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Cristina</namePart>
            <namePart type="family">Bosco</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Elisabetta</namePart>
            <namePart type="family">Jezek</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Marco</namePart>
            <namePart type="family">Polignano</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Manuela</namePart>
            <namePart type="family">Sanguinetti</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>CEUR Workshop Proceedings</publisher>
            <place>
                <placeTerm type="text">Cagliari, Italy</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
        <identifier type="isbn">979-12-243-0587-3</identifier>
    </relatedItem>
    <identifier type="citekey">mensa-etal-2025-easy</identifier>
    <location>
        <url>https://aclanthology.org/2025.clicit-1.69/</url>
    </location>
    <part>
        <date>2025-09</date>
        <extent unit="page">
            <start>722</start>
            <end>734</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark
%A Mensa, Enrico
%A Zane, Lorenzo
%A Scozzaro, Calogero Jerik
%A Delsanto, Matteo
%A Milani, Tommaso
%A Radicioni, Daniele P.
%Y Bosco, Cristina
%Y Jezek, Elisabetta
%Y Polignano, Marco
%Y Sanguinetti, Manuela
%S Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
%D 2025
%8 September
%I CEUR Workshop Proceedings
%C Cagliari, Italy
%@ 979-12-243-0587-3
%F mensa-etal-2025-easy
%U https://aclanthology.org/2025.clicit-1.69/
%P 722-734

Download as File

Markdown (Informal)

[Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark](https://aclanthology.org/2025.clicit-1.69/) (Mensa et al., CLiC-it 2025)

Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark (Mensa et al., CLiC-it 2025)

ACL

Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, and Daniele P. Radicioni. 2025. Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), pages 722–734, Cagliari, Italy. CEUR Workshop Proceedings.