Can Large Language Models Be Good Language Teachers?

LiQing Xu; Qiwei Li; Tianshuo Peng; Zuchao Li; Hai Zhao; Ping Wang

doi:10.18653/v1/2025.emnlp-main.1222

Can Large Language Models Be Good Language Teachers?

LiQing Xu, Qiwei Li, Tianshuo Peng, Zuchao Li, Hai Zhao, Ping Wang

Abstract

Large language models (LLMs) have achieved remarkable success across diverse domains. However, their potential as effective language teachers—particularly in complex pedagogical scenarios like teaching Chinese as a second language—remains inadequately assessed. To address this gap, we propose the first pedagogical competence benchmark for LLMs, rigorously evaluating their performance against international standards for Chinese language teachers. Our framework spans three core dimensions: (1) basic knowledge evaluation, covering 32 subtopics across five major categories; (2) international teacher examination, based on data collected from international Chinese teacher certification exams; and (3) teaching practice evaluation, where target LLMs summarize knowledge points and design instructional content for student models, followed by testing the student models to assess the LLM’s ability to distill and teach key concepts.We conduct a comprehensive evaluation of 13 latest multilingual and Chinese LLMs. While most models demonstrate promising pedagogical potential, there remains substantial room for improvement in their teaching capabilities. This study contributes to the development of AI-assisted language education tools capable of rivaling human teaching excellence. The benchmark dataset and evaluation scripts used in this study are publicly available at https://github.com/Line-Kite/CLTE.

Anthology ID:: 2025.emnlp-main.1222
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 23957–23971
Language:
URL:: https://aclanthology.org/2025.emnlp-main.1222/
DOI:: 10.18653/v1/2025.emnlp-main.1222
Bibkey:
Cite (ACL):: LiQing Xu, Qiwei Li, Tianshuo Peng, Zuchao Li, Hai Zhao, and Ping Wang. 2025. Can Large Language Models Be Good Language Teachers?. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 23957–23971, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Can Large Language Models Be Good Language Teachers? (Xu et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.1222.pdf
Checklist:: 2025.emnlp-main.1222.checklist.pdf

PDF Cite Search Checklist Fix data