Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem YuHong Sun author Zhangyue Yin author Qipeng Guo author Jiawen Wu author Xipeng Qiu author Hui Zhao author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication sun-etal-2024-benchmarking https://aclanthology.org/2024.lrec-main.196/ 2024-05 2178 2188