Do Language Models Use Logophoric Cues? Evidence from Mandarin Chinese Long-Distance Reflexive

Yunfang Dong

Do Language Models Use Logophoric Cues? Evidence from Mandarin Chinese Long-Distance Reflexive

Abstract

Resolving anaphora requires integrating syntactic, semantic, and discourse information. Mandarin Chinese offers a particularly revealing case through the reflexive ziji, whose interpretation permits long-distance binding licensed by logophoric cues (i.e., cues relevant to discourse perspective). While these cues have been extensively studied in linguistic theory and psycholinguistic experiments, it remains an open question to what extent such cues are captured by computational models.We investigate this question by probing large language models’ sensitivity to four logophoric cues known to license long-distance binding of ziji: predicate type, perspective marking, discourse topicality, and discourse relation. Using minimal pairs and surprisal-based measures, we assess whether models exhibit systematic biases toward non-local antecedents in logophoric contexts.Across two model families, we find that (i) models exhibit above-chance sensitivity to all four cues; (ii) lexically anchored cues are more robustly captured than discourse-level cues; and (iii) some cues generalize cross-lingually, whereas others appear to depend on language-specific training data. Taken together, these findings provide non-English evidence that large language models capture certain aspects of logophoricity, yet continue to struggle with discourse-level representations that are central to human anaphora resolution. Code and data are available at: https://github.com/yunfang-dong/mandarin-logophoricity-llm

Anthology ID:: 2026.findings-acl.2135
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 43061–43072
Language:
URL:: https://aclanthology.org/2026.findings-acl.2135/
DOI:
Bibkey:
Cite (ACL):: Yunfang Dong. 2026. Do Language Models Use Logophoric Cues? Evidence from Mandarin Chinese Long-Distance Reflexive. In Findings of the Association for Computational Linguistics: ACL 2026, pages 43061–43072, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Do Language Models Use Logophoric Cues? Evidence from Mandarin Chinese Long-Distance Reflexive (Dong, Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.2135.pdf
Checklist:: 2026.findings-acl.2135.checklist.pdf

PDF Cite Search Checklist Fix data