Holmes \ensuremath\recorder A Benchmark to Assess the Linguistic Competence of Language Models

Holmes \ensuremath\recorder A Benchmark to Assess the Linguistic Competence of Language Models Andreas Waldis author Yotam Perlitz author Leshem Choshen author Yufang Hou author Iryna Gurevych author 2024 text journal article Transactions of the Association for Computational Linguistics continuing MIT Press Cambridge, MA periodical academic journal waldis-etal-2024-holmes 10.1162/tacl_a_00718 https://aclanthology.org/2024.tacl-1.88/ 2024 12 1616 1647