Phun-Bench: Evaluating LLMs on Phonological Understanding in Chinese

Xing Yue; Yongliang Shen; Weiming Lu

Phun-Bench: Evaluating LLMs on Phonological Understanding in Chinese

Abstract

Language is a vehicle for thought, intricately tied to sounds, symbols, and meaning. However, most large language model (LLM) research focuses on meaning (semantics) and symbols (spelling) while largely overlooking sounds. Existing benchmarks on LLMs’ phonological abilities are either solvable through rote memorization or intertwined with other abilities, making them inadequate to measure LLMs’ genuine ability in *phonological understanding*. Here, we present Phun-Bench, a purpose-built Chinese benchmark with diverse tasks and settings across three dimensions (Homophony, Rhyme, and Phonetic Similarity), designed to systematically evaluate LLMs’ phonological understanding. Our results show that while LLMs excel at recalling correct pronunciations, they generally struggle to leverage phonological knowledge in the flexible and intuitive way that human speakers do. Moreover, through detailed analyses, we propose a hypothesis regarding the underlying mechanism of LLMs’ phonological understanding and “perception”, highlighting an underexplored frontier for future research.

Anthology ID:: 2026.acl-long.1041
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22723–22768
Language:
URL:: https://aclanthology.org/2026.acl-long.1041/
DOI:
Bibkey:
Cite (ACL):: Xing Yue, Yongliang Shen, and Weiming Lu. 2026. Phun-Bench: Evaluating LLMs on Phonological Understanding in Chinese. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 22723–22768, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Phun-Bench: Evaluating LLMs on Phonological Understanding in Chinese (Yue et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1041.pdf
Checklist:: 2026.acl-long.1041.checklist.pdf

PDF Cite Search Checklist Fix data