TC-STAR: New language resources for ASR and SLT purposes

Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, Djamel Mostefa


Abstract
In TC-STAR a variety of Language Resources (LR) is being produced. In this contribution we address the resources that have been created for Automatic Speech Recrognition and Spoken Language Translation. As yet, these are 14 LR in total: two training SLR for ASR (English and Spanish), three development LR and three evaluation LR for ASR (English, Spanish, Mandarin), and three development LR and three evaluation LR for SLT (English-Spanish, Spanish-English, Mandarin-English). In this paper we describe the properties, validation, and availability of these resources.
Anthology ID:
L06-1039
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Editors:
Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/81_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, and Djamel Mostefa. 2006. TC-STAR: New language resources for ASR and SLT purposes. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
TC-STAR: New language resources for ASR and SLT purposes (van den Heuvel et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/81_pdf.pdf