Exploiting machine algorithms in vocalic quantification of African English corpora

Lasisi Adeiza Isiaka

Exploiting machine algorithms in vocalic quantification of African English corpora

Abstract

Towards procedural fidelity in the processing of African English speech corpora, this work demonstrates how the adaptation of machine-assisted segmentation of phonemes and automatic extraction of acoustic values can significantly speed up the processing of naturalistic data and make the vocalic analysis of the varieties less impressionistic. Research in African English phonology has, till date, been least data-driven – much less the use of comparative corpora for cross-varietal assessments. Using over 30 hours of naturalistic data (from 28 speakers in 5 Nigerian cities), the procedures for segmenting audio files into phonemic units via the Munich Automatic Segmentation System (MAUS), and the extraction of their spectral values in Praat are explained. Evidence from the speech corpora supports a more complex vocalic inventory than attested in previous auditory/manual-based accounts – thus reinforcing the resourcefulness of the algorithms for the current data and cognate varieties. Keywords: machine algorithms; naturalistic data; African English phonology; vowel segmentation

Anthology ID:: W19-3647
Volume:: Proceedings of the 2019 Workshop on Widening NLP
Month:: August
Year:: 2019
Address:: Florence, Italy
Editors:: Amittai Axelrod, Diyi Yang, Rossana Cunha, Samira Shaikh, Zeerak Waseem
Venue:: WiNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 149–151
Language:
URL:: https://aclanthology.org/W19-3647/
DOI:
Bibkey:
Cite (ACL):: Lasisi Adeiza Isiaka. 2019. Exploiting machine algorithms in vocalic quantification of African English corpora. In Proceedings of the 2019 Workshop on Widening NLP, pages 149–151, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: Exploiting machine algorithms in vocalic quantification of African English corpora (Isiaka, WiNLP 2019)
Copy Citation:

Cite Search Fix data