Documenting Endangered Languages with LangDoc: A Wordlist-Based System and A Case Study on Moklen

Piyapath Spencer


Abstract
Language documentation, especially languages lacking standardised writing systems, is a laborious and time-consuming process. This paper introduces LangDoc, a comprehensive system designed to address challenges and improve the efficiency and accuracy of language documentation projects. LangDoc offers several features, including tools for managing, recording, and reviewing the collected data. It operates both online and offline, crucial for fieldwork in remote locations. The paper also presents a comparative analysis demonstrating LangDoc’s efficiency compared to other methods. A case study of the Moklen language documentation project demonstrates how the features address the specific challenges of working with endangered languages and remote communities. Future development areas include integrating with NLP tools for advanced linguistic analysis and emphasising its potential to support the preservation of language diversity.
Anthology ID:
2024.fieldmatters-1.4
Volume:
Proceedings of the 3rd Workshop on NLP Applications to Field Linguistics (Field Matters 2024)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Oleg Serikov, Ekaterina Voloshina, Anna Postnikova, Saliha Muradoglu, Eric Le Ferrand, Elena Klyachko, Ekaterina Vylomova, Tatiana Shavrina, Francis Tyers
Venues:
FieldMatters | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
28–36
Language:
URL:
https://aclanthology.org/2024.fieldmatters-1.4
DOI:
Bibkey:
Cite (ACL):
Piyapath Spencer. 2024. Documenting Endangered Languages with LangDoc: A Wordlist-Based System and A Case Study on Moklen. In Proceedings of the 3rd Workshop on NLP Applications to Field Linguistics (Field Matters 2024), pages 28–36, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Documenting Endangered Languages with LangDoc: A Wordlist-Based System and A Case Study on Moklen (Spencer, FieldMatters-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.fieldmatters-1.4.pdf