Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark

Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, Daniele P. Radicioni


Anthology ID:
2025.clicit-1.69
Volume:
Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025)
Month:
September
Year:
2025
Address:
Cagliari, Italy
Editors:
Cristina Bosco, Elisabetta Jezek, Marco Polignano, Manuela Sanguinetti
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
722–734
Language:
URL:
https://aclanthology.org/2025.clicit-1.69/
DOI:
Bibkey:
Cite (ACL):
Enrico Mensa, Lorenzo Zane, Calogero Jerik Scozzaro, Matteo Delsanto, Tommaso Milani, and Daniele P. Radicioni. 2025. Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), pages 722–734, Cagliari, Italy. CEUR Workshop Proceedings.
Cite (Informal):
Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark (Mensa et al., CLiC-it 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.clicit-1.69.pdf