Rauno Jaaska

2023

Automatic Transcription for Estonian Children’s Speech
Agnes Luhtaru | Rauno Jaaska | Karl Kruusamäe | Mark Fishel
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)

We evaluate the impact of recent improvements in Automatic Speech Recognition (ASR) on transcribing Estonian children’s speech. Our research focuses on fine-tuning large ASR models with a 10-hour Estonian children’s speech dataset to create accurate transcriptions. Our results show that large pre-trained models hold great potential when fine-tuned first with a more substantial Estonian adult speech corpus and then further trained with children’s speech.

Co-authors

Mark Fishel 1
Karl Kruusamäe 1
Agnes Luhtaru 1

Venues

NoDaLiDa1

Fix author