Computational Expressivity of Neural Language Models

Alexandra Butoi, Ryan Cotterell, Anej Svete


Abstract
Language models (LMs) are currently at the forefront of NLP researchdue to their remarkable versatility across diverse tasks. However, a largegap exists between their observed capabilities and the explanations proposedby established formal machinery. To motivate a better theoreticalcharacterization of LMs’ abilities and limitations, this tutorial aims toprovide a comprehensive introduction to a specific framework for formalanalysis of modern LMs using tools from formal language theory (FLT). Wepresent how tools from FLT can be useful in understanding the inner workingsand predicting the capabilities of modern neural LM architectures. We willcover recent results using FLT to make precise and practically relevantstatements about LMs based on recurrent neural networks and transformers byrelating them to formal devices such as finite-state automata, Turingmachines, and analog circuits. Altogether, the results covered in thistutorial will allow us to make precise statements and explanations about theobserved as well as predicted behaviors of LMs, as well as providetheoretically motivated suggestions on the aspects of the architectures thatcould be improved.
Anthology ID:
2024.acl-tutorials.3
Volume:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Luis Chiruzzo, Hung-yi Lee, Leonardo Ribeiro
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5–5
Language:
URL:
https://aclanthology.org/2024.acl-tutorials.3
DOI:
Bibkey:
Cite (ACL):
Alexandra Butoi, Ryan Cotterell, and Anej Svete. 2024. Computational Expressivity of Neural Language Models. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), pages 5–5, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Computational Expressivity of Neural Language Models (Butoi et al., ACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.acl-tutorials.3.pdf