What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence

Manodyna K H


Abstract
Field linguistics increasingly relies on computational tools to organize, analyze, and preserve linguistic data, yet the classificatory assumptions embedded in these tools are rarely examined. A pervasive assumption is that languages can be treated as discrete, genealogically defined units, with relatedness modeled as tree-structured descent. We argue that this assumption misrepresents linguistic evidence in contact-heavy regions and risks distorting the computational mediation of field linguistic data. Focusing on South Asia, we show that widely assumed boundaries—such as the Indo-Aryan–Dravidian divide—collapse in long-standing contact zones characterized by convergence, dialect continua, and institutional multilingualism. Through historically grounded case studies including Kannada–Telugu and Tamil–Malayalam, we demonstrate how convergence, script-mediated distance, and post-hoc standardization reshape how field data is segmented, compared, and interpreted when organized through genealogical labels. We argue that contact-aware, relational models of linguistic relatedness are necessary if NLP tools are to support, rather than distort, the documentation and analysis of linguistic diversity.
Anthology ID:
2026.fieldmatters-1.2
Volume:
Proceedings of the Fifth Workshop on NLP Applications to Field Linguistics
Month:
March
Year:
2026
Address:
Rabat, Morocco
Venues:
FieldMatters | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8–15
Language:
URL:
https://aclanthology.org/2026.fieldmatters-1.2/
DOI:
Bibkey:
Cite (ACL):
Manodyna K H. 2026. What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence. In Proceedings of the Fifth Workshop on NLP Applications to Field Linguistics, pages 8–15, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence (K H, FieldMatters 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.fieldmatters-1.2.pdf