Introducing a web application for labeling, visualizing speech and correcting derived speech signals

Raphael Winkelmann, Georg Raess


Abstract
The advent of HTML5 has sparked a great increase in interest in the web as a development platform for a variety of different research applications. Due to its ability to easily deploy software to remote clients and the recent development of standardized browser APIs, we argue that the browser has become a good platform to develop a speech labeling tool for. This paper introduces a preliminary version of an open-source client-side web application for labeling speech data, visualizing speech and segmentation information and manually correcting derived speech signals such as formant trajectories. The user interface has been designed to be as user-friendly as possible in order to make the sometimes tedious task of transcribing as easy and efficient as possible. The future integration into the next iteration of the EMU speech database management system and its general architecture will also be outlined, as the work presented here is only one of several components contributing to the future system.
Anthology ID:
L14-1367
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4129–4133
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/429_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Raphael Winkelmann and Georg Raess. 2014. Introducing a web application for labeling, visualizing speech and correcting derived speech signals. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 4129–4133, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Introducing a web application for labeling, visualizing speech and correcting derived speech signals (Winkelmann & Raess, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/429_Paper.pdf