Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models

Tharindu Madusanka, Ian Pratt-Hartmann, Riza Batista-Navarro


Abstract
Efforts to apply transformer-based language models (TLMs) to the problem of reasoning in natural language have enjoyed ever-increasing success in recent years. The most fundamental task in this area to which nearly all others can be reduced is that of determining satisfiability. However, from a logical point of view, satisfiability problems vary along various dimensions, which may affect TLMs’ ability to learn how to solve them. The problem instances of satisfiability in natural language can belong to different computational complexity classes depending on the language fragment in which they are expressed. Although prior research has explored the problem of natural language satisfiability, the above-mentioned point has not been discussed adequately. Hence, we investigate how problem instances from varying computational complexity classes and having different grammatical constructs impact TLMs’ ability to learn rules of inference. Furthermore, to faithfully evaluate TLMs, we conduct an empirical study to explore the distribution of satisfiability problems.
Anthology ID:
2024.acl-long.815
Volume:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15278–15294
Language:
URL:
https://aclanthology.org/2024.acl-long.815
DOI:
Bibkey:
Cite (ACL):
Tharindu Madusanka, Ian Pratt-Hartmann, and Riza Batista-Navarro. 2024. Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15278–15294, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models (Madusanka et al., ACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.acl-long.815.pdf