A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations Md Tahmid Rahman Laskar author Sawsan Alqahtani author M Saiful Bari author Mizanur Rahman author Mohammad Abdullah Matin Khan author Haidar Khan author Israt Jahan author Amran Bhuiyan author Chee Wei Tan author Md Rizwan Parvez author Enamul Hoque author Shafiq Joty author Jimmy Huang author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing Yaser Al-Onaizan editor Mohit Bansal editor Yun-Nung Chen editor Association for Computational Linguistics Miami, Florida, USA conference publication laskar-etal-2024-systematic 10.18653/v1/2024.emnlp-main.764 https://aclanthology.org/2024.emnlp-main.764/ 2024-11 13785 13816