Can Language Models Be Specific? How?

Jie Huang, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-mei Hwu


Abstract
“He is a person”, “Paris is located on the earth”. Both statements are correct but meaningless - due to lack of specificity. In this paper, we propose to measure how specific the language of pre-trained language models (PLMs) is. To achieve this, we introduce a novel approach to build a benchmark for specificity testing by forming masked token prediction tasks with prompts. For instance, given “Toronto is located in [MASK].”, we want to test whether a more specific answer will be better filled in by PLMs, e.g., Ontario instead of Canada. From our evaluations, we show that existing PLMs have only a slight preference for more specific answers. We identify underlying factors affecting the specificity and design two prompt-based methods to improve the specificity. Results show that the specificity of the models can be improved by the proposed methods without additional training. We hope this work can bring to awareness the notion of specificity of language models and encourage the research community to further explore this important but understudied problem.
Anthology ID:
2023.findings-acl.45
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
716–727
Language:
URL:
https://aclanthology.org/2023.findings-acl.45
DOI:
10.18653/v1/2023.findings-acl.45
Bibkey:
Cite (ACL):
Jie Huang, Kevin Chen-Chuan Chang, Jinjun Xiong, and Wen-mei Hwu. 2023. Can Language Models Be Specific? How?. In Findings of the Association for Computational Linguistics: ACL 2023, pages 716–727, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Can Language Models Be Specific? How? (Huang et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.45.pdf