You don’t need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments Bangzhao Shu author Lechen Zhang author Minje Choi author Lavinia Dunagan author Lajanugen Logeswaran author Moontae Lee author Dallas Card author David Jurgens author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication shu-etal-2024-dont 10.18653/v1/2024.naacl-long.295 https://aclanthology.org/2024.naacl-long.295/ 2024-06 5263 5281