Do Large Language Models Understand Double Mismatches? Evidence from Farsi

Maryam Mohammadi

Do Large Language Models Understand Double Mismatches? Evidence from Farsi

Abstract

Large language models (LLMs) are increasingly used for communication in many languages, therefore, understanding their limitations with respect to culture-specific pragmatics is important. While LLMs perform well on statistically frequent structures, their shortcomings are most evident in rare pragmatic phenomena. This study investigates whether LLMs can generate a (rare) complex honorific mismatch in Farsi. The pattern arises at two levels:(i) a plural pronoun disagrees with a singular referent for the sake of honorification, and (ii) the related components violate the Polite Plural Generalization due to intimacy implication. This double mismatch pattern is attested in everyday speech, though it is statistically sparse. We tested GPT-4 across multiple scenarios. The results reveal that the model successfully employs the first mismatch to indicate honorific, but fails to adopt the second mismatch that simultaneously conveys intimacy. The model thus deviates from humanlike behavior at the syntax–pragmatics interface. These findings suggest that, while machine models demonstrate partial success in generating honorifics, they rely primarily on statistical patterns and lack the deeper pragmatic understanding necessary for contextual competence.

Anthology ID:: 2026.silkroadnlp-1.3
Volume:: The Proceedings of the First Workshop on NLP and LLMs for the Iranian Language Family
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Rayyan Merchant, Karine Megerdoomian
Venues:: SilkRoadNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24–28
Language:
URL:: https://aclanthology.org/2026.silkroadnlp-1.3/
DOI:
Bibkey:
Cite (ACL):: Maryam Mohammadi. 2026. Do Large Language Models Understand Double Mismatches? Evidence from Farsi. In The Proceedings of the First Workshop on NLP and LLMs for the Iranian Language Family, pages 24–28, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Do Large Language Models Understand Double Mismatches? Evidence from Farsi (Mohammadi, SilkRoadNLP 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.silkroadnlp-1.3.pdf

PDF Cite Search Fix data