Implicit Discourse Relation Classification For Nigerian Pidgin

Muhammed Yahia Gaffar Saeed Saeed, Peter Bourgonje, Vera Demberg


Abstract
Nigerian Pidgin (NP) is an English-based creole language spoken by nearly 100 million people across Nigeria, and is still low-resource in NLP. In particular, there are currently no available discourse parsing tools, which, if available, would have the potential to improve various downstream tasks. Our research focuses on implicit discourse relation classification (IDRC) for NP, a task which, even in English, is not easily solved by prompting LLMs, but requires supervised training. % With this in mind, we have developed a framework for the task, which could also be used by researchers for other English-lexified languages. We systematically compare different approaches to the low resource IDRC task: in one approach, we use English IDRC tools directly on the NP text as well as on their English translations (followed by a back-projection of labels). In another approach, we create a synthetic discourse corpus for NP, in which we automatically translate the English discourse-annotated corpus PDTB to NP, project PDTB labels, and then train an NP IDR classifier. The latter approach of training a “native” NP classifier outperforms our baseline by 13.27% and 33.98% in f1 score for 4-way and 11-way classification, respectively.
Anthology ID:
2025.coling-main.174
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2561–2574
Language:
URL:
https://aclanthology.org/2025.coling-main.174/
DOI:
Bibkey:
Cite (ACL):
Muhammed Yahia Gaffar Saeed Saeed, Peter Bourgonje, and Vera Demberg. 2025. Implicit Discourse Relation Classification For Nigerian Pidgin. In Proceedings of the 31st International Conference on Computational Linguistics, pages 2561–2574, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Implicit Discourse Relation Classification For Nigerian Pidgin (Saeed et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.174.pdf